







| | Discrete Mathematical Reasoning 


| and Proof with 


| Mathematics Puzzles, Patterns, and Games 








DOTEA Sa [S 2 J. Winston Crawley 





Copyrighted Material 


Copyrighted Material 





Copyrighted Material 





Discrete Mathematics 


Mathematical Reasoning and Proof with Puzzles, 
Patterns, and Games 


Douglas E. Ensley 
Shippensburg University 


J. Winston Crawley 
Shippensburg University 


WILEY 
John Wiley & Sons, Inc. 


Copyrighted Material 


Copyrighted Material 


To Amy, John, and Jessica, for making life fun. — D. E. 
To Margaret, for her unwavering support. — W. C. 


VICE PRESIDENT, PUBLISHER Laurie Rosstone 
ASSOCIATE EDITOR Kelly Boyle 
PROGRAM ASSISTANT. Stacy French 
SENIOR PRODUCTION EDITOR Lisa Wasserman 





SENIOR DESIGNER Kevin Murphy 
SENIOR ILLUSTRATION EDITOR Anna Melhorn 
COVER PHOTO Chuck Carlton/Index Steck 


This hook was set in LaTex by TechBooks and printed and bound by RR. Donnelley. The cover was 
printed by Brady Palmer. 


‘This book is printed on acid free paper, sx 


Copyright © 2006 John Wiley & Sons, Inc. AU rights reserved, No part of this publication may be 
reproduced, stored ín a retrieval system, or transmitted in any form or by any means, electronic. 
‘mechanical, photocopying, recording, scanning, or otherwise, exoept as permitted under Sections 107 or 
108 of the 197 United States Copyright Act, without ether the prior writen permision of the 
Publisher, or authorization through payment af the appropriate per-eopy fcc o the Copyright Clearance 
Center, Inc.. 222 Rosewood Drive, Danvers, MA 01923, website wwweopyright eom Requests o the 
Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc, 
111 River Street, Hoboken, NJ 07030-5774, (201) 748-6011, fax (20146-6008. or website 

hupsiwew wiley com/go/pernisions 


“To order books or for eustomer service please, call |-800.CALL WILEY (225.5045 














Library of Congress Cataloging in Publication Dara: 
Ensley. Douglas E. 
Discrete Mathematics: Mathematical Reasoning and Proof with Puzzles, Patterns and Games! 
Douglas E. Ensley. J Winston Crawley. 
pcm, 
Includes bibilographical referencesand indes.x 
ISBN 978.0-471-47602-3 
TSBN-10 0-471-47602-1 








1 Logie, Symbolic and mathematical, 2, Mathematical recreations 3, Computer 
science Mathematics 

1 Crawley, 3. Winston, IL 

A925 657 2006 

SU der? 





2008007417 
Printed in the United $ 


0957054521 





tes of America 


Copyrighted Material 


Copyrighted Material 





Preface 
for Instructors 


This book is written for students who are prepared to make a first departure from the 
deeply worn path that leads from arithmetic to calculus in the typical mathematics 
curriculum. Because of its deliberate pace and emphasis on student readability as 
well as its treatment of engaging topics. this book is suitable for courses taught at 
several different levels. 





© ‘This text originated as notes for an enrichment course in discrete mathematics for 
talented high school students This course has been taught for over ten years as part 
of the core curriculum of the Pennsylvania Governor's School for the Sciences. 

© The book has been used for five years fora discrete math course at Shippensburg 
University. This course is required for students majoring in mathematics or com- 
puter science as well as those students majoring in elementary education with a 
concentration in mathematics. 





@ In its most recent life, this book has also been used by several instructors as 
the basis for a “transition” course in a mathematics major program. The gentle 
introduction to abstraction and the emphasis on written proot have been well- 
received by students at this level as well 


Discrete mathematics suffers somewhat from having no historically established 
place in the K-12 mathematics curriculum. According to the National Council of 
"Teachers of Mathematics, discrete math should be emphasized at every level of math- 
ematics Meanwhile college curriculum committees struggle with what one-semester 
course ought o be used to address basic discrete math concepts. Since it seems un 

likely that there will be a trend in colleges to createa three or four semester sequence 





iii 
Copyrighted Material 


ÍV. Preface lor instructors 


Copyrighted Material 


in discrete mathematics, perhaps the best one can do is to apply some careful thought 
and sound pedagogy to what will happen in the crucial one or two semesters 

‘The approach taken in our book is one of establishing fundamental skills and 
building connections between basic concepts instead of surveying as many topics as 
possible in one semester. There is enough variety of topics for the instructor to have 
some choice for the emphasis of his or her course. However, the instructor might find 
that a favorite topic is treated only in a specific context instead of more generally. 
For example, in our book, instead of an entire chapter on binary relations, there are 
three non-contiguous sections placed at different points in the overall development. 
By stressing connections between topics, we believe we are enhancing student un- 
derstanding and avoiding the time-consuming task of thoroughly discussing every 
aspect of each topic before moving on. 








Contents and Organization 





A list of mathem; il topics that are addressed within the framework of this book 
is shown below. This list covers most of the content and learning objectives recom- 
mended by the ACM/IEEE (for aone-semester course for computer science majors), 
the MAA CUPM (for math majors), and the CBMS document on the Mathematical 
Education of Teachers (for future middle and secondary level teachers), 






























Number sequences Functions Markov chains 
Truth tables Binary relations Eulerian graphs 
Propositional logic Equivalence relations Planar graphs 
Predicate logic Numerical functions. iltonian graphs 
Mathematical proof — | Iterated function systems ing salesperson 
Proof by induction Counting techniques ted graphs 
Pigconhole principle | RSA cryptography Adjacency matrices 
Modular arithmetic Recurrence relations Spanning trees 
Representing numbers | Growth of functions Solving puzzles 

Sets Discrete probability Analyzing games 
Boolean algebra Expected value 

Logic circuits Matrix arithmetic 





‘These topics are broadly organized into four parts: 


‘Thinking and writing about mathematics 





The first two chapters introduce the fundamental emphasis of the course, Although 
the content is discrete mathematics, the theme is learning to think and write math- 
ematically. The material is introduced in the context of puzzles concerning logic 
and numbers. Although some symbolic logic is covered in the first chapter, it is not 
directly used in the discussion of proof in Chapter 2. Instead, we build upon the 
students’ innate ability to apply basic logic to their surroundings, and we focus our 
efforts on attaininga level of comfort with formalization and abstraction. We demon- 
strate that inductive reasoning and proof are very natural processes, The students 


Copyrighted Material 


Copyrighted Material 
Threads V 
practice their newly developed proof-writing skills in connection with a number of 


mathematical topics. including divisibility properties, rational numbers, sequences, 
and summations, Chapter 2 ends with applications of these properties of numbers. 





Sets, functions, and relations 


The next two chapters introduce these important topics, Students are encouraged to 
apply their problem-solving and proof-writing skills with this new subject matter. An 
important point in this discussion is the way that abstraction (like Boolean algebra) 
can actually make certain computations (as with logic circuits) casier instead of 
harder, Other excursions into iterated functions and growth of functions continue 
the idea of recursive descriptions and inductive proofs from earlier chapters 


Combinatorics und probal 








Chapters 5 and 6 discuss combinatorics, probabilities, and their applications to games 
of chance, Although mathematical proof does not play as prominent a role in this 
portion, mathematical reasoning is still paramount. An important example is the 
emphasis on using the idea of one-to-one correspondence to realize that two seem- 
ingly different problems must have the same solution. Excursions into the solution 
of recurrence relations and the use of matrices to model probability and expected. 
value make connections to other topics in the book. 





Graph theory 


“The fourth part of the text develops graph theory. emphasizing a closer look at puz- 
ales and games. The topic is motivated by the famous Bridges of Kénigsberg puzzle. 
Once introduced, graphs provide an opportunity to reinforce several topics from the 
earlier portions of the book, including binary relations and induction proofs, Alter- 
natively, the classical applications of graph theory can be understood and appreciated 
independently of any earlier material 


Threads 





One innovation in this book is the spiral approach to the development of some 
important concepts and topics. We achieve this by avoiding as much as possible the 
fragmentation of the course into disconnected pieces. The book uscs several major 
threads to help weave the core topics into a cohesive whole. 





© Throughout the book, we emphasize the application of mathematical reasoning 
to the solution of problems, and we guide the student in thinking about, reading, 
and writing proofs in a wide variety of contexts 

© An important content thread is the focus on (mathematical) puzzles, games or 
magic tricks to engage students with problems that motivate the development of 
specific topics 


Copyrighted Material 


Vi Preface for instructors 


Copyrighted Material 


@ In addition, we have included a variety of “real world excursions" to which these 
same concepts can be applied, in order to show students the utility of discrete 


mathematics, 


© A primary conceptual thread is the use o 





algorithmic thinking” in solving prob- 


lems and exploring the connection between related problems. 


€ Another important feature is the recurrent (so to speak) use of recursive modeling 





and its companion, mathemati 


il induction. There are ample opportu 





ies for 


students to reason recursively and prove inductively throughout the book 


Four Possible Courses 





Discrete math courses vary so greatly that it is difficult for one book to serve every 


purpose. There are four different types of courses for whi 





this book is a good fit. 





We briefly outline here the core sections that an instructor should consider for each 


of these courses. Recommendations for emphasis within each se 


the Instructor's Guide. 
1. A mixed math, computer science 
and math education audience requires 
breadth among applications and concep- 
tual topics. The following sections would 
provide this experience: 





94144,4547 
554 

e 6i-63 
971,73.75 


3. A sophomore or junior level math 
major transition course might focus on 
mathematical proof and classical struc- 
tures, hence the following core sections 
would be appropria 





eli 
92127 
©3133 
eias 
97123 





Copyrighted Material 





ion are 





en in 


2. A first-year course for computer sci. 
ence students would want to cover spe- 
cifie computer science applications as 
well as the logic and proof topics recom- 
mended by ACM: 





921,32, 44-40. 48 
@ 5.1-5.3. 55.5.6 
© 7.1,7.3,7.6,7.7 





4. Advanced high school students might 
have applications of discrete math as 
their key interest, so the following sec- 
tions could provide the core of a course 
for this audiena 








vida 
. 27 
03135 

© 4.1,42.46,4.7 
. 





© 6.1-63.66 
© 71.73.75.77 


Copyrighted Material 
‘Aids to Teaching and Learning. Vii 


Aids to Teaching and Learning 





Practice problems 


Each section contains a number of practice problems distributed throughout. The 
intentisthatstudents use these to test their understanding of what they have just read. 
This encourages the students to slow down and think about what they are reading. 
It also provides positive reinforcement to those who solve the problem correctly 
and a mid-course correction to those who do not, Detailed solutions for the practice 
problems are found at the end of each section, 


Section reviews 


At the conclusion of each chapter, a section review outlines the major terms, concepts, 
and skills covered by each section within that chapter. In addition to providing a 
synopsis to the student, these reviews are useful to the instructor in planning which 
sections to cover. 


Exercises 


“The book has over 1,100 exercises. ranging from routine reinforcement to conceptual 
challenges. We have tried to include many exercises that are direct applications of 
the examples in the section, several exercises that make connections to past topics or 
foreshadow some future developments, and some problems that will challenge and 
intrigue even the very best students 


Excursions 





Each chapter ends with at least one section that leads an excursion into applications 
of the concepts of the chapter. These independent sections can provide additional 
applications or individual enrichment, or they can be used as the basis of independent 
projects, 


Appendices 


Matrix operationsare presented in two different contexts (Markov chains and count- 
ing walks in graphs) in this book, and either can be taught without the other. However, 
each treatment is fairly brief and offers no real chance for directed practice with ma- 
trices, so we have included an appendix on basic matrix operations for this purpose. 
In addition, we have included an appendix explaining the basic rules and terminology 
for many of the games we use for examples in the book. 





Answers and soluti 


In addition to the solutions to the practice problems, we give answers or hints in the 
back of the book for many of the exercises. In the exercise sets themselves, we indicate 
which problems have answers in the back by highlighting the exercise number in 
blue, A supplementary Student Solutions Manual ISBN (-471-760978 contains more 
detailed solutions to those same exercises. 


Copyrighted Material 





Vili. Preface for Instructors 


p 


Explore more on 


the Web, 


Copyrighted Material 


Electronic resources for students 


Every section has examples or exercises supplemented by web-based activities. This 
supplementary material primarily takes the form of either an online version of the 
problem or an interactive activity to develop an idea, When an item in the text is 
connected to an online activity, it is identified with an icon in the margin. The URL 
for this material and other resources for instructors and students is 





hitp:/wwwwiley.com/college/ensley 


‘The web-based material related to mathematical proof was funded through NSF 
grant DUE-0230755. 


Resources for instructors 


The instructors Guide located om the instructor companion site at http:// 
wwwwileycom/collegelensley provides more details on how to use this book. includ- 
ing course outlines and an annotated version of the section summaries that appear 
within the text. In addition, the Instructor's Solution Manual ISBN 0-471-488011 
provides complete solutions to all the exercises in the book 


Acknowledgements 





A textbook combines the clever ideas, generous time, and industrious efforts of many 
individuals While the blame for shortcomings and errors always rests on the shoul- 
ders of the authors, the credit for the book's successes should be spread generously 
among all of these people. 

For providing feedback from classroom testing, we thank the following pro- 
fessors and their students: Madelaine Bates (Bronx Community College). De- 
bra Rorkovitz (Wheelock College), Judith Covington (Louisiana State University, 
Shreveport), Renee Fister (Murray State University), Catherine Murphy (Purdue 
University, Calumet), Patricia Oakley (Goshen College), Sharon Robbert (Trinity 
Christian College), and Madeleine Schep (Columbia College) 

In addition, we gratefully acknowledge the following individuals for review- 
ing our manuscript and generously providing feedback that has greatly improved 
the book: David Barrington (University of Massachusetts, Amherst), George Davis 
(Georgia State University), Darin Goldstein (California State University, Long 
Beach), Jonathan Goldstine (Penn State University), Johannes Hattingh (Geor- 
gia State University), Matthew Hudelson (Washington State University), Heather 
Hulett (University of Wisconsin, La Crosse), Mark Jacobson (University of North- 
em Iowa), Jay Kappraff (New Jersey Institute of Technology), Theodore Laetsch 
(University of Arizona), Hong-lian Lai (West Virginia University), Sheau-Dong 
Lang (University of Central Florida), Lisa Markus (De Anza College), Jandelyn 
Plane (University of Maryland, College Park), Jacek Polewezak (California State 
University, Northridge), and Anne-Louise Radimsky (California State University, 
Sacramento). 

We also thank our patient colleagues at Shippensburg University, James Ham- 
blin, Kate McGivney, Fred Nordai. Chery! Olsen, and Kim Presser, for their con- 
tinuous input into the process of developing this material over the past five years. 
For detailed reviews and technical assistance, we would like to thank John and Amy 
Ensley, Mark McKibben of Goucher University, Frank Purcell and Elka Block of 
‘Twin Prime Editorial, and the entire team at John Wiley and Sons, especially Kelly 
Boyle, Laurie Rosatone and Lisa Wasserman. 

Copyrighted Material 


Copyrighted Material 





Preface 
for Students 





Iris not worth the while to go round the world to count the cats in Zanzibar 
-Henry David Thoreau, in Walden 


Mathematics has often been saddled with the reputation of being a dry academic 
pursuit characterized by tedious rote. It is easy to lay the blame for this on some 
traditional mathematics programs, but much of the trouble truly lies with the subject 
itself. Before the college level, mathematics appears to be a tower of topics, each one 
requiring mastery before continuation is possible, Certainly no one will understand. 
algebra without first mastering addition and subtraction of numbers. Noone will grasp 
the great utility of trigonometry without understanding something about geometry 
first. This pyramid of mathematical topics persists to the study of calculus, It is no 
wonder that calculus is considered by so many outside of mathematical circles to be 
the very pinnacle of achievement in mathematics, 

The most recent national mathematics standards address these concerns, and 
their recommendations to incorporate many different mathematical topics (to some 
degree) at all levels might someday change this perception of mathematics, but today, 
college is likely 10 be the first place that a student finds out about entirely different 
kinds of mathematics. There are many courses to take after the calculus sequence 
Which never mention calculus at all. There are even senior level courses which cover 
new material drawing on no specific mathematical topics the students have ever had. 
‘These are not reasons to avoid mathematics in college! On the contrary, you may 
find that you can enjoy mathematics even if calculus is not your cup of tea. 

This discrete mathematics text is designed with two primary goals in mind. The 
first goal is to expose you to some mathematics that you have never seen before, 
"The topics have not been chosen at random—we will try to see how these new 
mathematical topics can be applied to real problems and everyday situations Our 














ix 
Copyrighted Material 


X Preface for Students 


Copyrighted Material 


second goal is to develop your problem-solving skills. There is no better test of 
reasoning ability than to understand and give mathematical arguments based upon 
new material. We hope you will learn to value these skills regardless of your long-term 
career pursuits. 

This book is also designed with your personal intellectual growth valued more 
highly than your exposure to as many topics as possible. Instead of trying to “go round 
the world” to learn about discrete mathematics, we focus instead on establishing a 
solid understanding of each thing we do. In this course, you will be asked to think, 
explore, write, and discuss in order to make connections between different abstract 
and concrete mathematical concepts. It is these skills that will be the measure of your 
success in the course. 

The title of this book might give rise to skepticism about the usefulness of this 
subject. Let us therefore say a few words in defense of puzzles and games. The field 
of graph theory can be traced to the solution of a popular puzzle concerning crossing 
bridges in 1735. Today, this is one of the fastest growing areas of mathematics thanks 
to important ties with computer science and business. Games of chance gave birth 
to the now ubiquitous fields of probability and statistics. The ability to play chess 
remains an important test case for intelligence in computers. Mathematicians have 
shown how certain card tricks are related to the allocation of computer memory. In 
short, yesterday’s games and amusements are among today’s deepest mathematical 
subjects. Looking at it another way, it is also apparent that much of the mathematics 
that we encounter in our academic and worldly pursuits is not entirely different 
from “recreational” mathematics. Every math question is a puzzle. Every thought 
experiment is a game. 

The book is organized into four parts. In the first part we will practice thinking 
and writing about mathematics, The context for most of this activity is in puzzles 
concerning logic and numbers. This part is built on the premise that logic is a natural 
function of the human brain, and that the real challenge for developing mathematical 
thinking rests with attaining comfort with formalization and abstraction and not so 
much with logic. This part of the course will build some important skills that we will 
subsequently use. 

The second part of the course is an excursion into abstract mathematics via the 
study of sets and functions. This will primarily give us a setting in which to exercise our 
problem-solving and proof-writing skills. Another important point in this discussion 
however is the way that abstraction (like Boolean algebra) can actually make certain 
computations (as with logic circuits) easier instead of harder. In this part of the 
course, we also explore the concept of function and its application to problems that 
are quite different from those in a precalculus setting. 

In the third part of the course, we will study probabilities and their applications 
to games of chance. In order to understand probability, we will have a somewhat 
lengthy introduction to the mathematical field of combinatorics or counting. The 
reason for this is simple. A basic premise of probability dictates that the probability 
of something happening (say rolling a 5 or greater on a toss of a fair die) is simply 
the ratio of the number of ways for the event to happen to the total number of ways 
in which any outcome may happen. In the example of rolling a 5 or greater on one 
toss of a die, there are two ways to have success (rolling a 5 or a 6) and six total 
equally-likely outcomes from a toss of a die, hence the probability of rolling a 5 or 
greater is . Of course in real problems, the events are more complicated and 
the outcomes much more numerous. Therefore, in order to accomplish anything in 
probability. we will first have to learn some techniques of counting. Along the way, 








Copyrighted Material 


Copyrighted Material 
Preface for Students Xi 


we will be sure to point out that combinatorics is a beautiful and valuable area of 
mathematics in its own right. 

The fourth part of the course will be a closer look at puzzles and games. The route 
by which we will approach them will be the field of graph theory, which Gottfried 
Leibniz called the "geometry of position." Once we have seen some of the basic 
techniques of graph theory we will look in some detail at applications of this important 
area of mathematics. 

There are several threads that run through the course. The two most obvious 
ones are recursion and applications. Believe it or not, recursive reasoning is not 
really something new to you. For example, if you have just determined that the 
sum of the first 100 positive integers is 5,050, and someone asks what the sum of 
the first 101 positive integers is, you will not start over but will rather quickly add 
5050 + 101 = 5151. The inherent laziness of the human brain that reuses previous 
information is at the heart of recursive reasoning. The first step in coming to terms 
with this idea is in realizing that it is a natural form of reasoning, not something 
invented by your math teacher. 

The thread of real world applications is rather overtly presented throughout 
sections whose main heading is "Excursion." These sections are intended to provide 
a flavor of application related to some of the ideas developed in the chapter. The 
intention of this thread is for you to leave the course with the understanding that 
discrete mathemat relevant to important problems. 

So the big plan is to introduce you to mathematical writing, abstract structures, 
counting. discrete probability, and graph theory. Along the way you will meet some 
applications to puzzles and games and subsequently to problems in the real world. 
This course will probably seem different than most math courses you have had. Be 
prepared to work and be prepared to play, but most of all, be prepared to think. 















"I see you stand like greyhounds in the slips, 
Straining upon the start. The games afoot!” 
— William Shakespeare, in Henry V 


Copyrighted Material 


Copyrighted Material 
Contents 


wu [1 Puzzles, Pattems, and GE] S Combinatorics 368 


Mathematical Language — 1 1 Introduction 369 


.2 Basic Rules for Counting 386 


5; 
5. 
11 First Examples 1 
5.3 Combinations and the Binomial Theorem — 398 
5 
5. 
5 


12 Number Puzzles and Sequences 9 

1,3 Truth-tellers, Liars, and Propositional 
Logic 24 a 

14 Predicates 40 

15 Implications — 53 i 


MB ie csse virgae m 


4 Binary Sequences 408 
S Recursive Counting 418 
.6 Excursion Solving Recurrence Relations — 423 


6 Probability — 440 


6.1 introduction — 440 
‘BE | 2 A Primer of Mathematical &2 Simard odie a Ks Rebuild 
Writing 81 6.3 Probability in Games of Chance — 460 
Bieten esed dae Mi 
6.5 Excursion Recursion Revisited — 475 
6.6 Excursion Matrices and Markov Chains — 482 


23 MathematcalWing 82 
2.2 Proofs About Numbers 98 
2.3 Mathematical induction 110 
24 More About hiduction 122 
2.5 Contracicton and the Pigeonhoie 
Principle 132 
3 1.6 Excursion Represetatons of Numbers — 150 ec UL ANE 


Z males enon otan 72 Pook Abat Graph and Tes S10 
Excursion Moda Arhrekc and Crpography 73 bomorhi an Panty 533 


74 Connections to Matrices and 
Relations — 546 


[4 | 3 sets and Boolean Algebra — i8 à 15 Graphs in Puzzles and Games 567 


7 Graphs and Trees — 505 





76 Excursion Binary Tres 581 
31 Set Definitions and Operations — 181 
3.2 More Operationson Sets — 198 


T7 Excursion Hamiltonian Cycles and the TSP — 596 


3.3 Proving Set Properties 210 a | Å rules of the Game 613 
3.4 Boolean Algebra — 221 
[ 25 kanion Logie Craits 220 cos on 
Som s 


um 4 : (Miscellaneous Games 615 
Functions and Relations 248 


z | B Matrices and Their Operations 


4 Definitions, Diagrams, and Inverses — 249. 
618 


42 Tha Compositon Operation — 268 


4.3 Properties of Functions and Set 
Cardinaity 203 


44 Properties of Relations — 30i 


Matrix Operatons 618 
Matrix Arithmetic with Technology 620 


4.5 Equivalence Relators 313 "ME | selected Answers and Hints 625 
4.6 Numerical Functions in Discrete Math — 324. 

8 AT Excursion lente Furetions and Chaos 334 "ll | References and Further Reading 
4B excursion Grown of Funcions 345 682 


xii 
Copyrighted Material 


Driver TWDDO03-Ensley-v6 July 9,205 2050 





Puzzles, Patterns, and 
Mathematical Language 


Ask five mathematicians what mathematics is, and you might very well get 
five different answers, but there will no doubt be some underlying themes. 
‘Mathematics is about solving problems. Mathematics explains patterns. Math- 
'ematics is a set of statements deduced logically from axioms and definitions. 
‘Mathematics uses abstraction to model the real world. This introductory chap- 
ter lays a foundation for our study of discrete mathematics by addressing all 
these themes. In the second section, we will examine pattems in number 
sequences, and in particular, we will discuss the idea of a “recursive model” 
that will be important throughout the book. The subsequent sections address 
basic logic particularly as it applies to clarity of language. We are accustomed 
to people not saying exactly what they mean in conversation, and as a result, 
we are good at interpreting statements from context, but there is no room for 
ambiguity in mathematics. We end the chapter with a discussion of validity 
of the kinds of arguments that one might encounter in life. 





11 First Examples 





In this section we will take a first look at some of the puzzles and games that will 
occupy us through the weeks ahead. We will not yet worry about the mathematics 
that lies behind these puzzles and games. The mathematical ideas will confront us 
soon enough. At this point, you should just immerse yourself in these examples to 
get a feeling for the type of problems we will be studying in this book. Make some 
time to perform the magic trick on a roommate, discuss the Josephus problem in 


1 


Driver IWDDO03-Ensley.v6 July 9,205 2050 


2 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


class, solve the puzzles, and play the games. This will make the mathematics more 
‘meaningful to you as we get to each of the relevant topics behind these examples. 


A Magic Trick 


Ata recent magic show, the following trick was performed. Four playing cards* were 
stacked face up, with a heart at the bottom, then a club, then a diamond, and then 
a spade. A spectator held this packet while the blindfolded magician gave these 
instructions: 


1. Turn the spade (the uppermost card) face down. 
2. Move any number of cards, one at a time, from the top of the packet to the 
bottom. 
3. Turn over the top two cards as one. 
4. Move any number of cards, one at a time, from the top of the packet to the 
bottom. 
5. Turn over the top two cards as one. 
6. Move any number of cards, one at a time, from the top of the packet to the 
bottom. 
7. Either turn over the entire stack or do not—your choice. 
8. Turn over the topmost card. 
9. Turn over the top two cards as one. 
10. Turn over the top three cards as one. 


At this point, the prestidigitator, while still blindfolded, correctly divined the 
state of the stack. The club was the only card facing the opposite way from the 
others! 

This trick takes advantage of the fact that the actions that seem like shuffling to 
the spectator are actually preserving all the properties the magician cares about. We 
will see how to verify this “invariance of properties” in Section 2. 


wn Example 1 Write down the initial state of the deck (from bottom to top) as 

HCD S. Using uppercase letters to indicate the card is face up and lowercase for face 
down, we see that after step 1 the deck would be H C D s. Starting from here, trace the 
state of the deck through step 10 if in steps 2, 4, and 6, we move exactly two cards from 
the top to the bottom, and in step 7 we turn over the packet. 


SOLUTION Using the notation described, we can show the state of the 
packet after each step. 


Stepl. HCDs Step6. Sdch 
Step2. DsHC Step7. HCDs 
Step3. Dsch Step8. HCDS 
Step4. chDs Step9. HCsd 
Step5. chSd Step10. HDSc 
a 


"The reader who is unfamiliar with the anatomy of a deck of playing cards should consult 
Appendix A. 


Driver — IWDDOO3-Ensley.v6 Juy9,2005 — 2050 


11 Frnt Examples 3 


Practice Problem 1 (Note. Practice problems allow the reader to stop and try 
an idea before moving on. The answers to all practice problems are given at the 
end of the section.) The following questions refer to the magic trick described in 
Example 1. 


(a) Repeat Example 1 but this time in step 7, do not turn over the deck. 

() Repeat Example 1, but this time in steps 2, 4, and 6, move one, two, and three 
cards (respectively) from the top to the bottom, and in step 7 do not turn over the 
deck. 


A Matter of Life and Death 


In [32], the following legend about the first-century historian Flavius Josephus is 
recounted: 


In the Jewish revolt against Rome, Josephus and 39 of his comrades were holding 
out against the Romans in a cave. With defeat imminent, they resolved that, like 
the rebels at Masada, they would rather die than be slaves to the Romans. They 
decided to arrange themselves in a circle. One man was designated as number one, 
and they proceeded clockwise killing every seventh man. .., Josephus (according 
10 the story) was among other things an accomplished mathematician; so he 
instantly figured out where he ought to sit in order to be the last to go. But when 
the time came, instead of Killing himself he joined the Roman side. 


The solution is for Josephus to stand in the 24" position in the circle. It is yet 
another historical example of how those with a distaste for mathematics quickly 
become the chaff of evolution. That point aside, the problem rightfully raises the 
question of how Josephus could have quickly computed this position. 

‘We will refer to this scenario as the “Josephus game" even though it does not 
sound like fun. We call it a game because itis very similar to methods that children use 
to decide things on the playground, Perhaps we should imagine the soldiers saying 
(in Latin, of course): 


One potato, Two potato, Three potato, Four; Five potato, Six potato, 
Seven potato, More 


On the “More” of course, someone would be killed, which would be a shame, 
but it would certainly eliminate that person. This is much like the Josephus game 
where one skips by eight people between those who are eliminated instead of seven. 
Another playground example is “eeny-meeny-miney-moe,” where the skip number 
is 16. 

Here are some samples of the many questions one can ask about this type of 
game, We will begin an examination of the first question in the practice problems 
and exercises of this section. The others are more difficult, In fact, no one knows the 
answer to the last one. 


1. If every second person is killed (instead of every seventh) when there are initially 
n people in the circle, where should Josephus stand? 

2, In some versions of the story, Josephus saves a friend by having the friend stand in 
a position so that the two of them are the last two people left alive. If every second 


Driver IWDDO03-Ensley.v6 July 9,205 — 2050 


Å Chapter 1 / Puzzles, Patterns, and Mathematical Language 


C2650 o 


Person2elminated ^ Person 4 eliminated First five eliminated 
Figure 1-1 The beginning of the Josephus "game." 


person is killed (instead of every seventh) when there are initially n people in the 
circle, where should Josephus's friend stand? 

3, If Josephus is not allowed to move positions but is asked what the "skip number” 
should be (say Monty Hall isa Roman soldier, for instance), can he always respond 
in a way so that he will live? What about his friend? 

4, Characterize all sets of people in the original circle who can be made to be the 
last people living by naming an appropriate skip number. In other words, can 
Josephus save any number of friends in any positions by naming an appropriate 
skip number?* 


The Josephus problem is the basis for introducing recursive modeling in [29] as 
well as in [18]. In [32] the concept of Josephus permutation is introduced so that the 
entire killing process is captured instead of just the grisly end result. Later in this 
book, we will explore some aspects of Josephus permutations using recursion and 


induction. 

ah nnt 2 Example 2. If there are 10 people numbered 1 to 10 in a circle, and every 

the Web. other person is eliminated starting with person number 2, which person is the last 
standing? 


SOLUTION In Figure 1-1, we use the letter x to represent dead people and 
a capital X to highlight the most recent person eliminated. 

In the interest of space, we will write the state of the game in a line rather 
than draw the circle each time. For example, after the first five people have 
been eliminated, we can write 1 x 3x 5 x 7 x 9 X instead of drawing the last 
picture above, and we can still tell from the placement of the X that the next 
person to be eliminated is number 3. If we use this system, the remainder of 
the game proceeds like this: 


9 IxXx5x7x9x 
9 Ixxx5xXx9x 
6 Xxxx5xxx9x 
6 xxxx5xxxXx 


Hence, person number 5 is the last one standing in this game. o 


* This is given as a "Research problem” in [29]. 


Driver — TWDDOO3-Ensley.v6 Juy9,2005 — 2050 


11 Frnt Examples 5 
Practice Problem 2 


(a) If there are nine people numbered 1 to 9 in a circle, and every other person is 
eliminated starting with person number 2, which person is left? 

(b) If there are eight people numbered 1 to 8 in a circle, and every other person is 
eliminated starting with person number 2, which person is left? 

(© If there are sixteen people numbered 1 to 16 in a circle, and every other person is 
eliminated starting with person number 2, which person is left? 

(d) Explain why the answer to #3 is the same as for #2. 


It's Just a Game 


Sporting events present an interesting challenge for mathematical modeling since in 
sports lies a blend of strategy, skill, and luck that is hard to separate. We will simplify 
the analysis by using probabilities to simulate all three aspects. For example, if a 
baseball player bats 0.300 for a season, we will use 3, as the probability of that player 
getting a hit in any given at-bat. This is certainly an oversimplification—in a given 
at-bat, the probability of a player getting a hit could be higher or lower depending 
on who is pitching, how many outs there are, or what he had for breakfast. However, 
using $ as if it is the real probability makes our calculations feasible, and it is a 
reasonable simplification, 

‘We will look at two kinds of sports events in this course, each of which is char- 
acterized by the following examples from tennis: 


1. Ina certain tennis league, the first player to win two sets wins the match. If Player 
A has a 55% chance of winning a set against Player B, what is the probability that 
Player A wins the match? If these two players played many matches, what would 
you expect to be the average number of sets that determine a match? 

2, In tennis, each player's score progresses from 0 to 15 to 30 to 40 to game with 
the wrinkle that a final score of game to 40 is impossible. A score of 40-40 is 
called “deuce,” and a game can only be won from deuce if one player wins two 
consecutive points. (This is equivalent to the rule of "winning by two" in other 
sports) If Player A has a 60% chance of winning a point against Player B, what 
is the probability that Player A wins the game? If these two players played many 
games, what would you expect to be the average number of points that determine 
a game? 


The fundamental difference between these two examples is that the first one 
can be analyzed by exhaustively cataloging all possible matches, but the second one 
cannot because there are infinitely many possible ways that a game could progress. 
In Chapter 6, we will see how to handle each type of problem. 


Practice Problem 3 Suppose that Player A and Player B play a tennis match, 
where Player A has a 55% chance of winning any given set and the first player to win 
two sets wins the match. (This is called a “best-of-three” match.) One example of how 
a match could go is, “A wins the first set, B wins the second, and A wins the third.” This 
sequence of events can be represented more briefly by the simple string ABA. Every 
outcome of a match has a similar representation. 


[Driver JWDDO03-Ensley-v6 July 9,205 2050 


6 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


1. Suppose that Player A and Player B play a best-of-three match. List in an organized 
‘manner (using the representation described above) all the different ways the match 
could go. 

2. Isevery way you listed in Part 1 equally likely to occur? In particular, which do you 
think is more likely, that A wins in two sets, or that B wins in two sets? 

3. Suppose that Player A and Player B play a single game that lasts for eight points. 
One way this could happen is for the points to be won in this order: ABABABAA. 
Give two other ways this could happen. 

4. Explain why a single game cannot last for exactly seven points. 

5. In an organized manner, list all the different ways Player A can win in six or fewer 
points, 


3, 4 An Elementary Puzzle 


Itis not too difficult to draw the picture in Figure 1-2 without lifting your pencil from 
the paper and without retracing any lines. It is not as easy to decide whether the. 
1 la Picture in Figure 1-3 can be drawn in the same way, but it can. 
We will see how to make this decision for any picture without resorting to hours 


Figure V2 Asimple of trial and error, Surprisingly, this is related to an eighteenth-century puzzle that 


envelope, was solved by Leonhard Euler. Perhaps even more unexpected is that there are 
present-day applications of the same idea in many hard problems which have to do 

with scheduling tasks and constructing networks. 
ee uw Example 3. Starting at the lower left corner of the envelope (position 1 in 
the Web. Figure 1-2), describe a way to draw the envelope of Figure 1-2 without lifting your. 


pencil and without retracing any lines. 
SOLUTION One possibility is to go up, then across the top, do the “flap,” 
go diagonally to the lower right corner, then up, diagonally to the lower left 
corner, and finally go across the bottom, Using the numbering in Figure 1-2, 
we can represent this path by the list of corners we pass through: 


1,3,4,5,3,2,4,1,2 


a 
ER Ans 
Caanliaaedpiare, Practice Problem 4 


(a) If you have not already done so, find a different solution to Example 3 than the 
one given. 
(b) Do you think it is possible to start at the upper left corner (position 3)? 








A Game of Skill? 


Play this game with a friend. Start with a 4 x 4 grid as shown in Figure 1-4. Players 
alternate turns where on each turn a player 


























Figure 1-4 1. Chooses a row or column that has an empty space, and 
The grid game. 2. Places new X's anywhere in that row or column, 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


11 Fist Examples. 7 














x x x 
x|X|x x 

x x x 
XX |X|X x| |x| 



































Figure 1-5 Grids for Practice 
Problem 5. 


At least one new X must be placed on each turn, and the person to X the last 
available space wins. Does either player have a foolproof strategy for winning this 
game? 

In Chapter 7, we will see why one player does always have a winning strategy 
in a broad category of similar games. We will also see how this fact does not help in 
most real-world examples, since the strategy is often too complicated to implement 


effectively. 

d Practice Problem 5 

Explore more on (a) Its your turn to move, and the board looks like the grid on the left in Figure 1-5. 
the Web. Describe a winning strategy. 


(b) Its your turn to move, and the board looks like the grid on the right in Figure 1-5. 
Can you win if your opponent plays intelligently? 


Solutions to Practice Problems 


1| We use the same notation as in Example 1, showing the will go as follows: 
state of the deck after each step. 
(a) If in steps 2, 4, and 6, we move exactly two cards —Xx3x5x7x9 
from the top to the bottom, and in step 7 we do —xx3xXx7x9 
not turn over the packet, the trick would proceed as —xx3xxx7AX 
viso: —xx3xxxXxx 
Stepl. HCDs Step6. Sdch 
Step2. DsHC Step7. Sdch (b) Atter the first four eliminations, we have 1x3 x5 x 
Sip. Dsch Step8. SdcH 7X, with 3 next in line. Then 3, 7, and 5 go, leaving 
Step4. chDs Step9. SdhC L 
Steps. chSd Step10. ScHD (©) After the first eight eliminations, we have 1x3x 5x 


7x9x11 x13 x 15 X, with 3 next in line. 


(b) Itin steps 2, 4, and 6, we move one, two, and three. date aes icr diii wokans 


cards (respectively) from the top to the bottom, and. 


in step 7 we do not turn over the deck, the trick. REPNE N 


would proceed as follows: 
Sepl HCDs Stp& chSd with 5 next in line. Then 5, 13, and 9 go, leaving 1. 
Slep2. sHCD Step7. chSd (d) After the first eight eliminations, there are eight peo- 
Step3. sHde  Stp& chSD ple left (1, 3, 5,7, 9, 11, 13, 15) with the second one 
Step4. desH Step9. chds Se er solius ac eek 
problem as starting with eight people, with the sec- 
Step5. dehS Step 10. cSDH T eO 
2 Wa will use the tome notation as in Evimple 2, 3 (a) You can write them out as AA, ABA, ABB, BAA, 
(a) After the first four eliminations we have 1x3x5x7 BAB, BB. Alternatively, the "game tree" in Figure 


X9 and next in line is 1, The next four eliminations 1-6 provides a handy visualization of the solutions. 


IWDD003-Ensley-v6 


[Driver 


8 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


As MA 
AB ALABA 
ABB 

i] 

AL A BAA 
B BB 


BB. 
Figure 1-6 A game tree for Practice Problem 3. 


First Three Points Conclusion 
AAA A 
BA 
BBA 
AAB AA 
ABA 
BAA 
ABA AA 
ABA 
BAA 
ABB AAA 
BAA AA 
ABA 
BAA 
BAB AAA 
BBA AAA 
BBB AAAA 
Table 1-1 Table for Practice 
Problem 3 


(b) Because A is more likely to win each set, it seems 
reasonable that A is more likely to win two sets in a 
row than is B. 


Exercises for Section 1.1 


Exercises numbered in bold face blue have answers or par- 
tial answers in the back of the book. 

1. Suppose we use C(a, b, c, d) to mean the result of do- 
ing the card trick with a, 6, c, and d describing what 
to do in steps 2, 4, 6, and 7, respectively. For example, 
C(2, 2,2, no) would mean to move two cards to the bot- 
tom at each of steps 2, 4, and 6, and then to not turn 
over the deck in step 7. Find the final configuration of 
the deck for each of the following games: 

(a) C(1, 1,1, no) 
5) CQ. 1,1, yes) 





Jey9,200 — 2050 








x[x|x 
x[x| [x 
x| [xx 
Figure 1-7 Grid game in Practice Problem 5. 


























(c) ABBABAAA and BBBAAAAA are two of many 
possibilities. 
(d) Here are the scores that total to seven points with 
the winner ahead by at least two points: 7-0, 6-1, 
5-2. None of these are possible, since in each case the 
game would have ended when the winner reached 
four points, 
(c) One way to organize is to think about the first three. 
points, which can be either AAA, AAB, ABA, ABB, 
BAA, BAB, BBA, BBB. For cach of these, we list 
in Table 1-1 all the possible conclusions of the game 
that result in a win by A. 
4 (a) If we use the same notation as in Example 3, a dif- 
ferent way to draw the envelope can be represented 


124531432 


(b) Tn the graph theory chapter, we will see that this is 
not possible. 

5 (a) By completely filling row 3, you leave two spaces, 
and your opponent cannot X out both at once. 

(b) It you completely fillin any column or any row, your 
opponent can win. Likewise, if you fill in row 3, col- 
umn 3 or row 4, column 2, then your opponent has 
a winning strategy. However, if you fill in row 3, col- 
umn 2, you have the board in Figure 1-7 and you are. 
‘guaranteed a win, 


(© €@,3,3,n0) 
@ CG,2,1, yes) 

2. Explain why in the card trick, after each of steps 2-7, 
the packet of cards contains exactly one card reversed 
from the others. 

3. Suppose we use J(p, s) to mean the Josephus game with 

A. p people and a skip amount of s. For example, J(10,2) 
means the game with 10 people in which every other 
personis eliminated, starting with person 2, Play the fol- 
lowing Josephus games to determine the winner. Also 
decide where Josephus's friend should stand—that is, 


[Driver 


IWDD003-Ensley-v6 


4 


LA 


Juy9,2005 — 2050 


12 Number Puzzles and Sequences 9 



































x] [x[x x[ [x[x x[x 
x[x[x x|x|x x|x|x 
«cT Te $ Fer fx] eet leia 
x[x[x x[x[x x|x|x 





























Figure 1-8 Grids for Exercise 12. 


who is the last person who would be eliminated before 
the final survivor? 

(9) 7015,3) 

(b) (2,2) 

(©) 7015,2) 

(d) J(5.4) 

Suppose a tennis match is a “best of five" rather than a 
"best of three.” Draw the complete game tree. If each 
of the outcomes is equally likely, what percentage of the 
time would you expect the match to last five sets? 


. We showed that a tennis game cannot consist of exactly 


seven points. What other numbers of total points are 

impossible? 

Draw a game tree to show the fifteen ways in which 

Player A can win a tennis game in six or fewer points. 

(See Practice Problem 3.) 

Suppose you toss three coins, a nickel, dime, and quar- 

ter, and record the results in that order. For example, 

HTH would mean head on the nickel, tails on the dime, 

and heads on the quarter. 

(a) In a systematic way, list all the different results you 
could record. 

(b) Draw a game tree for the recording of the results, 

(©) On the game tree, label each possible result either 
0, 1, 2, or 3, indicating how many heads it has. Do 
you think a person who tosses three coins is more 
likely to get all three heads, or to get exactly two 
heads? 


8. Suppose you take a true-false test with three questions, 

and you answer all the questions. 

(a) In a systematic way, write all the different answers. 
you could give. Use the representation where, for 
example, TFF means answering true for Question 
1, false for Question 2, and false for Question 3. 

(b) Draw a game tree for the different answers. 

(©) Suppose the correct answers on the test are TFT. 
On your game tree, label each possible result either 
0, 1, 2, or 3, indicating how many correct answers 
it has Do you think a person who guesses is more 
likely to get exactly one right, or to get exactly two 
right? 

Explain the similarity between your answers in Exer- 

cises 7 and 8. 

Explain the similarity between the answers in Exercises. 

7 and 8 and the list of how the first three points could 

go in a tennis game between Player A and Player B. 

Insolutions to the envelope puzzle, are there any other 

places on the envelope where it would be possible to 

start? After you've investigated this, make a complete 
list of where you were able tostart, and for each of these 
write down where you finished. What do you notice? 

Consider the grid game from Practice Problem 5. For 

each of the grids in Figure 1-8, if it's your turn, can you 

make a play that guarantees you a win no matter what 
your opponent does, or can your opponent win no mat- 
ter what you do? Explain. 


10, 


~E 


wE 


1.2 Number Puzzles and Sequences 





Guess the Next Number 


One reason we focus on puzzles and games in this course is that much of mathematics 
seems like a puzzle or a game when viewed the right way. Most of you are familiar 
with the following type of mathematical puzzle. 


What is the next number in each of the following lists? 


1. 5,7,9,11, 13, 
2. 1,9, 17,25, 33, 41, 


Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


10 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


3. 1,4,9, 16, 25, 36, 
4. 2,4,8, 16, 32, 64, 
5. 1,2, 6,24, 120, 724 





The key to this type of puzzle is to find a pattern in the sequence of numbers. 
By “pattern,” we mean one of three things: 





(i) Each term is related (by arithmetical operations) to previous terms. 
(ii) Each term can be described relative to its position in the sequence. 


(iii) The sequence merely enumerates aset of integers that the reader must recognize 
from the scant information given. 


Occasionally, a sequence can be described in all three ways, as we see in our first 
example. 


a Example 1 How can the sequence 1,3, 5, 7, 9, .. . be described in each of these 
ways? 
SOLUTION 


(i) Each term is 2 more than the previous term and the first term is 1. When 
we describe a sequence in this way, we have to describe not only how each 
term is related to the previous term(s), but also how the sequence begins. 
For example, if we simply say, “Each term is 2 more than the previous 
term,” that could also describe the sequence 202, 204, 206, 208, .... The 
stipulation that the first term is 1 guarantees that we have a complete 
description. 

(ii) Then" term is given by the formula (2n — 1). So the firsttermis2 -1 — 1 = 
1, the second term is 2-2— 1 = 3, and so on. 

(ili) The sequence merely enumerates the positive odd integers. 
a 


In this course, we will be concerned primarily with characterizations of the first 
two types. One reason we are interested in both types of characterizations is that 
each has its own particular strengths. The first type of description is often easier to 
spot and describe, such as in the sequence 1, 1,2, 3,5, 8, ..., where each term is the 
sum of the two previous terms. 

On the other hand, the second type of description means having a formula for 
calculating each term relative to its position in the sequence, and this makes it easy 
to calculate a particular term. For example, if the n" term of some sequence is given 
by the formula 3n — 7, we can quickly determine that the 100" term in the list is 
3(100) — 7 = 293. 





Practice Problem 1 Consider the sequence 4, 6, 8, 10, 12, 


(a) Describe the sequence in each of the three ways we have mentioned. 


(b) If you know a certain term of this sequence is 898, what would the next three terms 
be? Which type of description is most useful for answering this question? 


[Driver 


IWDD003-Ensley-v6 


Juy9,2005 — 2050 


12 Number Puzzles and Sequences. 11 


(© What is the 1,000" term of the sequence? Which type of description is most useful 
for answering this question? 
Sequences and Sequence Notation 


‘Before we examine some more complicated examples, we will introduce some nota- 
tion and terminology. 





Definition A recursive formula for a sequence is a formula where each 
term is described in relation to a previous term (or terms) of the sequence. This 
type of description must include enough information on how the list begins for 
the recursive relationship to determine every subsequent term in the list. This 
is sometimes called a recurrence relation. 








Definition A closed formula for a sequence is a formula where each term. 
is described only in relation to its position in the list. 








Definition [Sequence notation] We usually use lowercase letters (a, b, etc.) 
to name sequences, and we use subscripting to indicate position in a sequence. 
The notation a, indicates the n™ term of the sequence we are writing as a. We 
read a, as “a subscript n," or more usually just "a sub n." 











EB Example 2 in the sequence of numbers 1,3, 5, 7.9. 








a —1 means "the first term in the sequence is 1" 





d) =3 means "the second term in the sequence is 3” 


a3 =5 means "the third term in the sequence is 5" 


as 





‘means "the fourth term in the sequence is 7” 
and so on. A closed formula for this sequence is 


a,—2n-1 





In words, this says, “the n'^ term of the sequence is given by the formula 2n — 
A recursive formula for this sequence is 


a -l and =at? 





In words, this says, “the first term in the sequence is 1," and "each term in the sequence 
can be found by adding 2 to the previous term.” 


‘Sequence notation is quite similar to the function notation you are familiar with 
from algebra courses. When we write a, — 2n — 1, it is similar to writing f(x) = 
2x — 1. In each case, the formula gives a rule for calculating the result for a particular 
given number. Just as f(15) would be calculated by substituting 15 for x in the 
formula 2x — 1, so ais is found by substituting 15 for n in the formula 2n — 1. So 
ays =2-15-1=30-1=29. 





Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


12 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


In this notation, we think of the subscripts as ordinal numbers. The ordinal 
numbers are first, second, third, fourth, ... one-hundred-twenty-first, and so on. They 
indicate ordering within a list—hence the term ordinal. So when we see a subscript, 
we can interpret it as a position within the sequence. For example, azı indicates the 
21* term in the sequence. Likewise, a, is the n^ term in the sequence, and a; would 
be the (n — 1) term, that is, the term just before the z'^ term. So when we write 
a, = an-ı +2, we can read it as "the n™ term in the sequence is equal to the (n — 1)^ 
term plus 2." Since the (n — 1)" term is the one immediately before the n" term, we 
sometimes just describe it as "the previous term." 

In the preceding discussion, itis implicit that the subscripts start at n = 1. We will 
use this convention throughout our discussion of numerical sequences, For example, 
when we write the closed formula 





a, =2n—1 


it will mean the same as writing 





a,—2n—1, for integers n> 1 


If for some reason we decide to begin with a subscript other than 1, we will carefully 
indicate that fact. 

On the other hand, for a recursive formula we explicitly give some initial terms 
in the sequence, so the formula typically starts with the first term not explicitly given. 
So when we write the recursive formula 


a =1 and y= ay 442 


we mean that the recursive relationship holds for all values of n beyond the explicitly 
given term for n = 1. That is, this is shorthand for 


1 ad a 





a 





4,1 +2, for integers n > 2 


The following examples illustrate more about the relationship between closed 
formulas and recursive formulas for sequences of numbers. 


Till saepe & iter tie ieina wine ened formule ie =P he de he 
following: 
1. List the first five terms of the sequence. 
2. Calculate the value of the 20 term. 
3. Give a formula for the (k +1)" term. 
4. Give a formula for aya. 
SOLUTION 
La-22-1-1 4522-123 422 
B-1=31. 
2. am = 2" — 1 = 1,048,575. 
3. a4, = 2 — 1. Just as we find a5 by replacing n with 20 in the formula, we 
find agy by replacing n with k + 1 in the formula. 


4, We solve this by replacing n with 2; — 3 in the formula, so a5. 












[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


12 Number Puzzles and Sequences. 13 


E] Example 4 Consider the sequence whose recursive formula is a; — 11 and 
an= dna S. 





1. Write the fist five terms of the sequence. 
2. Write a recursive formula for the 80* term. 

3. Write a recursive formula for the (k +1)" term. 
4. Write a recursive formula for a5, 


SOLUTION 
1. We can solve the problem by plugging into the recursive formula as follows: 
a =11 
@ = 445 =a 45=114+5=16 
a3 = 034 +5 =a, +5 =164+5=21 
acit 





Alternatively, once we realize that the recursive formula says, "Each term is 
equal to the previous term plus 5,” we can simply write down the sequence 
as 11, 16, 21, 26, 31. 

2. In the formula ay = an-ı +5, we replace n with 80, giving am = aso-1 +5, 
which simplifies to aso = ary + 5. In other words, the 80" term is 5 more than 
the 795 term. 

3. Wereplacen with k + 1, giving axsı = a(+1)-1 +5, which simplifies to ag, 
a,-- 5. Notice that ay,1 is the next term in the sequence after ag. We can 
express this formula in words as "the next term is equal to this term plus 5,” 
which is all the original description says. 

4. We replace n with 3j — 2, giving as)» = agj-2)-1 +5, which simplifies to 
25-2 = asj-3 +5. 











Discovering Patterns in Sequences 


‘We will now return to the original problems in this section. We will try to do two 
things with these puzzles. First, we will determine the next number in each sequence. 
To do this, we will have to discover a pattern for the sequence. We will then try to 
use the discovered pattern to describe the sequence with either a closed formula or 
a recursive formula (or perhaps both). Here are the puzzles again: 


1. 5,7,9,11,13, 

2. 1,9,17,25, 33, 41, 
3. 1,4,9, 16, 25, 36, 
4. 2, 4,8, 16, 32, 64, 
5. 1,2, 6,24, 120, 720, 


Before reading on, try to discover the pattern and determine the next number 
in each sequence. Then we will discuss the answers. 


Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


14 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


p uw Example 5 Give both closed and recursive formulas for the sequence that 
Explore more on begins 
the Web. 
5,7,9, 11, 13 


SOLUTION It appears that the sequence lists odd numbers in order, so the 
next number would be 15. We observe that each term is 2 more than the previous 
term. It is not too hard to use this fact to develop the recursive formula. We 
just have to remember that the “previous” number in the sequence can be 
expressed as an-ı. The recursive description is 


a =5 and ay =a,4+2 


‘The closed formula requires a bit more thought. One approach that works wellis 
to think of an easier sequence (i.e., one you know a closed formula for) with the 
same recursive pattern, and compare this sequence to the original. In this case, 
the sequence 2, 4, 6,8,... has the easy closed formula b, = 2n and the same 
recursive pattern, “each term is 2 more than the previous term.” In Table 1-2 we 
i246 8 line up the sequence whose closed formula we know with the sequence whose 
a|5 7 9 1 closed formula we wish to find. We notice that each term in the bottom row is 
Table 1-2 Tableof 3 more than the corresponding term above it. That is, a, = b, +3. Therefore, 
Values for Example S the closed formula for a, is 


n|12 3 4 ow 








a,—2n43 m 


E] Example 6 Give both closed and recursive formulas for these sequences. 
1. The sequence that begins 1, 9, 17,25, 33, 41. 

2. The sequence that begins 1, 4, 9, 16, 25, 36. 

3. The sequence that begins 2, 4, 8, 16, 32, 64. 

4. The sequence that begins 1, 2, 6, 24, 120, 720. 


SOLUTION 


1. The obvious pattern here is that each term is 8 more than the previous term, 
so the recursive formula is easy to write 


a -1 and dy =a,1+8 


An easy sequence with this same pattern is b, = 8n, and we can compare 
this with the original as shown in Table 1-3. Each term in the bottom row is 
7 less than the term above it, so the closed formula is a, = 8n — 7. 

2. By now we may be in the habit of looking at differences between successive 
terms For the sequence 1, 4, 9, 16, 25, 36, however, those differences change. 





4, |8 16 24 32 
adl 9 17 25 
Table 1-3 Table of 














Values for Example 6 But there is a pattern in the differences: 
a =a 4/3 
a =a +[5 














3 = a3+(7 





Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


12 Number Puzzles and Sequences. 15 





as = a5 +({9 











as = as [11 








aj = ag +[ 7? 











This might lead us to conclude that a7 will be 13 more than as. Can we 
express the difference in terms of n? For n = 2 (i.e., when the subscript on 
the left-hand side of the equation is 2), the number in the box on the right- 
hand side of the equation is 3. When n = 3, the boxed number is 5, and so 
on. Following this pattern, we conclude that when the subscript on the left- 
hand side is n, the number in the box has formula 2n — 1. This leads to our 
recursive formula for the sequence, 
4,71 and ay = dy n - 1) 

For the closed formula, we need to find a pattern in the numbers themselves. 
In this case it helps to recognize that the sequence can be described verbally 
as “those positive integers that are perfect squares” (1° = 1,2” = 4,3? = 
9, ...). This allows us to write the closed formula 


a, =P 





3. For the sequence that begins 2, 4, 8, 16, 32, 64, each successive term is twice 
as large as the previous, so the next term will be 128. The recursive formula 
is therefore 


ay =2 and dy = 2a 
For the closed formula, we recognize that the sequence consists of “integers 
that are a power of 2,” beginning at 2 = 2!, 4 = 22, 8 = 2°, and we write 
2» 
4, When we analyze the sequence that begins 1, 2, 6, 24, 120, 720, we may get 
stuck for a while. The differences between the terms are 1, 4, 18, 96, 600, and 
there is no obvious pattern here. Having just worked a problem where each 


term is the previous term times some factor, we look at this to try to detect 
a pattern: 


a, 





a, =a 





a3 =m: 


ay = ds 











as = ag- 











as = as 





BJ a) [+] [e] [n] 


ay =a5-[? 
Tt appears that ay will be ac times 7, or 5,040. The recursive formula is 
a=1 and ay =a,4-0 


For the closed formula, notice that a = 1, az = 1 -2, a = 1-2-3, a4 =1 
2-3-4, and so on. We write 


ay = 1-2-3 





Driver — IWDDOO3-Ensley.v6 


Jey9,2005 — 2050 


16 Chapter 1 / Puzzles, Patterns, and Mathematical Language 





3928 
2 6 18 54 
Table 1-4 Table of 

Values for Example 7 





Note: This “product of the integers from 1 to n” occurs in a variety of math- 
ematical contexts. In fact, mathematicians have created a special notation 
for it, namely n!, which is read as "n factorial: 


Practice Problem 2 For each sequence, identify the next number in the sequence. 
Then give either a closed formula, a recursive formula, or a verbal description, Give 
‘more than one description when you can. 


(a) 5,10, 15, 20, 25, 
(b) 5,10,20, 40, 80, 
(©) 4,9, 16,25, 36, 


‘Sometimes each term in a sequence is calculated from more than one of the 
previous terms. A simple example occurs in the famous Fibonacci numbers. This 
sequence was developed by Leonardo Pisano Fibonacci (1170-1250) as a recreational 
problem about the growth of rabbit populations, but it turns out to have wide-ranging 
applications. The sequence begins 


1,1,2,3,5,8,13,21,34,... 
‘The sequence follows the rule that, starting with the third term, each term is the 
sum of the two terms that precede it. For example, 3 =2 + 1,5 = 3 +2,8 = 5 +3. If 


we use F, to represent the n™ term of this sequence, this relationship can be stated 
succinctly as 





E. 





ies forn>3 





where the initial conditions F; = F = 1 must also be given to exactly describe the 
sequence. We should note that there is a formula that produces the ^ Fibonacci 


7o a EA) 


but it is not at all obvious from a casual examination of the sequence. 





uw Example 7 Discover a closed formula for the sequence whose recursive for- 
mula is ay = 2, and ay = 3ayi. 
SOLUTION To discover the formula, we first write down the first few terms 
of the sequence: 





4,722, @=6, a 18 ay =54 


In Table 1-4, we compare this to the simpler sequence by = 3", which also has 
the recursive pattern, “Each term is 3 times the previous term.” Observe that 
each term in the bottom row is 2/3 times the corresponding term in the by row. 
‘Therefore, 


P2.3 





Perhaps a more obvious connection between the two rows is that each 
term in the a, row is twice the previous term in the by row. This relationship can 


Driver — TWDDOO3-Ensley.v6 July 9,205 — 2050 


12 Number Puls and Sequences 17 


be literally transcribed, a, = 2 - b; 1, which results in the same closed formula 
as above. a 


Discovering closed formulas is much more difficult than finding recursive formu- 
las, so in this course, we will concentrate on how to verify a proposed closed formula 
for asequence whose recursive description we already know. This will be a focus in the 
next chapter. Before leaving the topic altogether however, we will look at a classic ex- 
ample of aclosed formula that comes from an important type of recursive description. 


"Bl Example 8 Discover a closed formula for the sequence hose recursive for 
mula is sı = 1, and Sn = 54-1 4 n. 
SOLUTION This recursive description is of the type, “Each term is the pre- 
vious term plus (something)." If we do not evaluate this sum at each stage, we 
can see an alternate way to describe sp: 





@n=1 
95-142 
*5-(4243 
95s-(4243)44 

Hence, 5, is the sum of the first n positive integers, and so there is a nice 
way to look at s, in order to discover a closed formula for it. We will start by 
writing s, as a sum twice, once with terms in increasing order and once with 
terms in decreasing order: 


s —1424-E(n-1)n 
s —n(n-1)- 241 
Now by adding the terms aligned in “columns” above, we have 
25, = (N41) (1) 011) (1) 
Since the right-hand side above consists of (n + 1) added to itself n times, we 
can conclude that. 
n- (n1) 
n = a 
Although discovering a closed formula can be difficult, even when we already 


know a recursive formula, the converse is rarely true. If we know a closed formula, 
then finding an alternate recursive description is usually straightforward. 


w Example 9 Discover a recursive formula for the sequence whose closed for- 

mula is a, = 3n +5. 
SOLUTION  Todiscoverthe formula, we first write down the first few terms of 
the sequence. Using the formula, we get a) =3 -1+5 =8,a =3-2+5= 11, 
a3 53:35 = 14, and so on. We summarize the results by just writing the 
answers: 8, 11, 14, 17, 20, 23, Then we look for a pattern that relates each. 
term to the previous term, and we see that each one is 3 more than the previous. 
This leads to the recursive formula 











4,28 and forn>2, a,— a4 3 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


18 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


We use the subscript n — 1 to indicate "the previous term,” that is, “the term 
just before the n^ term." a 


Actually finding a recursive formula from a closed formula is often just a matter 
of doing some algebra. In the previous exercise, since we are given the closed formula 
an = 3n + 5, we know that an-ı = 3(n — 1) +5, and so verifying 


an= ya $3 
is a simple matter of verifying algebraically that 
= ((n—1) +5) +3 


Let’s look at an example with a more complicated formula to see this idea in action. 





5n 


uw Example 10 Verify that the sequence given by the closed formula a, = 3" — 2 
satisfies the recursive formula 


Oy = 3+, $4 





SOLUTION Since an = 3" — 2, we know that ay. —2, andso 
3.4 4473-071 -2)44 
e644 
*y-2 
=a 
as desired. u 


Practice Problem 3 Verify that the sequence given by the closed formula a, 
T + n satisfies the recursive formula 





an = dy +20 


Our final examples illustrate some ways we can use what we know about a 
sequence to learn more about the sequence without “starting from scratch.” 


E] Example 11 For the sequence whose recursive formula is a, — 11, and ay = 
the Web. an-ı + 5, suppose someone tells you they have already calculated the 213^ term of the 
sequence, and the answer is 1,071. What is a214? 
SOLUTION We do not need to start over from a; to solve the problem. We 
simply calculate the 214" term from the 213". Replacing n with 214 in the 
recursive formula gives azı4 = azı + 5. Since we know a215 is 1,071, we get 


anm =am +5 


1,071+5 
1,076 ð 
Explore more on uw Example 12. For the sequence whose closed formula is a, — 2" — 1, do the 


the Web. following: 


1. Calculate the value of the 20% term. 


2. Someone tells you they have calculated the sum of the first 19 terms, and the answer 
was 1, 048, 555. What is the sum of the first 20 terms? 


[Driver JWDDOO3-Ensley-v6 July 9,205 20:50 


12 Number Puzzles and Sequences. 19 
SOLUTION, 


1. amw = 29 — 1 =1,048,575. 

2, This is similar to the previous example, except that it involves summing up 
the terms. Again we don’t have to start over from the first term; we can 
reason that the sum of the first 20 terms can be calculated by adding the 20" 
term to the sum of the first 19 terms. So our answer is 1,048,555 + 1,048,575 

2,097,130, We can write this in symbols as 





dads dp = (ay eda + a3 +++ + ag) + my 
048,555 + 1,048,575. 
097,130 








n 


Does it strike you as odd that in this sequence the 20" term is exactly 20 more 
than the sum of the first 19 terms? Do you think the 21* term will be exactly 21 more 
than the sum of the first 20 terms? Look at Exercises 19 and 20 below to explore this. 
some more. 


Notation for Sums 


Example 12 demonstrates an important source of number patterns in mathematics, 
Summing sequences of numbers is essential in mathematical applications ranging 
from the analysis of algorithms to integral calculus. We will study sums in more detail 
in Chapter 2, but for now we will just get acquainted with the traditional notation 
for sums. 





Definition For a sequence of numbers a, with k > 1, we use the notation 
La 
[1 

to denote the sum of the first n terms of the sequence. This is called sigma 


notation for the sum. In informal situations, we will sometimes write "a; + 
az +-+- + an” instead of using sigma notation, 











A simple variation on this notation is to write Yi ,, ay OF am + Am1 +++ ++ lp 
(where m < n) whenever we want to sum the numbers in the sequence starting with 
the ni? term and ending with the n™ term. This notation is easier to read than it is to 
write, since writing it requires that we know a closed form for the sequence ay at the. 
outset, and this is not always the case, 


a Example 13 Write each of the following sums using sigma notation: 


L. The sum of the first 10 numbers in the sequence ay = y with k > 1. 
2244484164324 64 

3. 246418 +54 +162 

4. (74) (C1) 424548411414 


Driver — IWDDOO3-Ensley.v6 


20. Chapter 1 / Puzzles, Patterns, and Mathematical Language 


Jey9,2005 — 2050 


SOLUTION In each case, we must first determine a closed form for the terms 
ay to be summed, and then determine the correct indices of the first term and 


the last term in the sum, 


1. In this case, the form of ay is given, so we simply have 3g }. 


2. The terms being summed have closed formula ay 
sum is a; = 2, and the last term in the sum is aj = 
represented by Df, 2. 


k, the first term in the 
64. Hence, this sum is 





3. The terms being summed have closed formula ay = 2 - 3*-*, the first term in 


the sum is a; — 2, and the last term in the sum is as 





62. Hence, this sum. 


is represented by Y, 2- 3. 


4, The terms being summed have closed formula ay. 
-4, and the last term in the sum is a7 = 14. Hence, this sum 





the sum isa; = 





c — 7, the first term in. 


is represented by Y 1, 3k — 7. 


Practice Problem 4 Evaluate each of the following sums: 


(@) Ya Qk- 1) 
(6) Eho 


© YR 
@ Tia gm 


Sigma notation will be used periodically throughout this book, so it is very 
important that you become comfortable using it. 


Solutions to Practice Problems 


1 (a) Method 1: The first term is4, and each termis2 more 
than the previous. Method 2: The n™ term is given 
by 2(n + 1). Method 3: These are the even integers 





starting at 4, 
(b) From Method 1 above, the next three terms are 
900,902,904. 
(c) From Method 2 above, the 1,000" term is 2(1,000 + 
1) =2,002. 

2 (a) The next number is 30. A closed formula is a, = Sn, 
a recursive formula is a =5, an = an-ı +5, and a 
description is “multiples of 5." 

(b) Next is 160. A closed formula is a, = 5-2", and a 


recursive formula is a 





5, an = 2, 

(c) Next is 49. A closed formula is a, = (n-+1)2, a re- 
cursive formula is a1 = 4, an = an-1 + (2n + 1), and 
a description is “perfect squares starting at 4.” 





Exercises for Section 1.2 


1. Foreach of these sequences, determine the next number 
in the sequence. Be able to explain how you got it. Then. 
give either a closed formula or a recursive formula. If 
you can give both, do so. 

(a) 2,4,6,8, 10, 12, 


Since a, =n" +n, we know that aj; — (n— 1) + 


(n — 1), and so 
aya + 2n = ((n— 1) + (n—1)) +20 
= (P= 2n+1) + (1-1) +20 
ran 











as desired. 
(a) 12354749411 
(b) 14349427481 
(909 

Q»i-bhté 






(5) 4,9, 16,25,36, 

(©) 2,5,10,17,26,37, (Hint. How does 
this relate to the sequence 1,4, 9, 16, 25, 36?) 

(d) 2,4,8, 16,32, 64, 

(©) 1,2,4,8, 16,32, 


 JWDDOOS-Ensley-v6 


Driver 


(5 1,3,7,15, 31,63, 
(9 2,5,8,11,14,1 
(b) 4,16, 64, 256, 1024, 

(G) 5, 10, 20, 40, 80, 160, 

© 5.9, 17,33, 65,129, 

(k) 2,5, 10, 50, 500, 

© 1,5,9.13,17, 

(m) 3,6, 9, 12, 15, 

(n) 3,5,9,17,33, 

2, For each of the following sequences, calculate a» and 
axs, and algebraically simplify the expressions: 

(a) an 

O) 2. 

(©) an 

(d) a, 

©) an 

0 a 

3, Ifyou form the decreasing list n,n — 1, n —2,... what 
will be the I? number in the ist? Your answer should 
involve variables n and k. 

4. Here are verbal descriptions of sequences Use these de- 

scriptions to write recursive formulas for the sequences. 

(a) The sequence starts with 2, and each entry is2 more 
than the previous entry. 

(b) The sequence starts with 1, and each entry is 6 more 
than the previous entry. 

(c) The sequence is obtained by starting with 2, and 
cach subsequent entry is 1 more than twice the pre- 
vious entry. 

(d) The sequence is obtained by starting with 2, and 
cach subsequent entry is the square of the previous 
entry. 

5. For each of these sequences described in English, write 
out the first five (or more if necessary) terms of the se- 
quence. Use your answer to discover a closed formula 
for the sequence. 

(a) The sequence starts with 2, and each entry is2 more 
than the previous entry. 

(b) The sequence starts with 1, and each entry is6 more 
than the previous entry, 

(c) The sequence obtained starting with 2, and each 
subsequent entry is 1 more than twice the previous 
entry. 

(d) The sequence obtained starting with 2, and each 
subsequent entryis the square of the previous entry. 

6. For each of the given sequences, use the given closed 
formula and algebra to check whether the given recur- 
sive formula is true 








pou 
pear 





14. 


July 9,205 — 2050 


12 Number Puzzles and Sequences 21 


(a) Given that a, — 2" — 1 for all n > 1, is it true that 
iaa 

(b) Given that a, — for all n> 1, is it true that 
Pr 

(c) Given that a, 
a,25:a,4—3? 

(d) Given that a, 
y= tite? 





in +3 for all n > 1, is it true that 





— 4 for all n = 1, is it true that 


. For each of these sequences given in closed form, write 


out the first five (or more if necessary) terms of the 
sequence. Use your answer to discover a recursive for- 
mula for the sequence. 


(a) dy =3n+1 
(b) a, = 5n—2 
(©) a, =2n+7 
(a, =H +1 
(©) a, = 7n—6 
[E 
(Ba 2-1 


For each of these sequences given recursively, write out. 
the first five (or more if necessary) terms of the se- 
quence. Use your answer to discover a closed formula 
for the sequence. 








(a) ay = Sia, 
() m = 2:0, 
(e) m Sun 
[TELA 
(©) a ia, 





Oa 


6a, 


. For each of these sequences given recursively, write out. 


the first five (or more if necessary) terms of the se- 
quence, Use the idea developed in Example 8 to dis- 
cover a closed formula for each sequence. 











(a) a1 = Sas = ana + (0-4) 
(b) ai = 1; an = an1 + (2n — 1) 
(© a i-a. +4n 





Give an algebraic expression that describes the n* 
whole number larger than 1,964. 

Give an algebraic expression that describes the n^ odd 
three-digit number. 

Give an algebraic expression that describes the n'* even 
two-digit number. 

Give an algebraic expression that describes the n* odd 
perfect square. 

Form a sequence using the relationship, “a, is the ones’ 
digit of (an-1 + an-2).” Thus all your terms will be from 
the set (0, 1,2,..., 9), which means that this sequence 
will have to repeat itself at some point. Try different 
pairs of starting numbers from (0, 1,2,...,9) and see 


IWDD003-Ensley-v6 


22 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


how long it takes for the sequence to repeat itself (For 
example, the sequence 6, 8, 4, 2 follows this rule and 
would repeat itself if continued.) What is the longest 
possible such cycle? 
15. What does the equation 
(a+b) =o 430° +300 +P 
say when a =2and b= -12 
16. What does the equation 
C(n.k) - C(n — 1,5) -C(n - 1,k— 1) 
say when n — 2k 
17. Give an algebraic expression for the n‘ term in the 
arithmetic progression 
a,a+3,a+6,a+9, 
18. Give an algebraic expression for the z^ term in the 
arithmetic progression 





a,a+d,a+2-d,a+3:-d, 


19. For the sequence whose closed formula is an 
the new sequence s, by the definition 


2, form 


Sn Oy bay te tay 
Fill in the blanks in Table 1-5, and make a conjecture 
about the relationship between the bottom two rows. 
20. For the sequence whose closed formula is ay = 2" 
form the new sequence s, by the definition 








1 


Sn = uhr 
Fill in the blanks in Table 1-6, and make a conjecture 
about the relationship between the bottom two rows. 

21. Evaluate each of the following sums: 

(9) Ekik 

O) Dea Tk 

© YR 

® Shox 

© Eka 3-2k 


2|1|2|3 |4|5|6|7|8 





a|2|4|8 




















alate tia 


























a, 1 | 3 
shila 
Table 1-6 Table for 
Exercise 20 


Juy9,2005 — 2050 


22. Consider the sequence in Practice Problem 3 with re- 
avi n with a, = 2. 
(a) Show that as = Ya 2k 
(b) Show that a; = Y, 2k 
(©) Show that a; = Sj, 2k 
23. Consider the sequence s, 
(a) Evaluate ss and sio. 
(b) Explain in words why s, 
(©) Give a closed formula for s, as an algebraic expres- 
sion using the variable n. 
24, Express cach of the following sums using sigma nota- 
tion: 
(a) 4+8+12+16+20+24+28+32 +36 
() 24344416417418 
(O 5+5+5+5+5+5 
(@itititeta 
(©) 149417425 433441449 
25. Consider the Josephus game from the previous section. 
F (a) Suppose we use J(p, s) to mean the Josephus game 
with p persons and a “skip” amount of s. For ex- 
ample, (0,2) means the game with 10 people and 
with the elimination starting out 2, 4, 6. For each 
value of p from 2 to 12, determine the winner for 
the J(p, 2) game, and fill in Table 1.7. 
(b) If you did this correctly, there should be a dis- 
cernible pattern in the answers Describe that 











p | Winner of Game J (p, 2) 











Table 1-7 Table for 
Exercise 25 


IWDD003-Ensley-v6 


Driver 


pattern in words. Then use the pattern to predict 
the answer for 13, 14, 15, and 16 people. 
(€) Predict the answer for 32 people; for 31 people; for 
30 people. 

26. These are somewhat more challenging number se- 
quences than those givenin Exercise 1. For each of these 
sequences, give a characterization of the sequence. For 
cach, we suggest the type that may be easiest to use, but 
you may give characterizations of other types. 

(a) 1,9, 25,49, 81, (Formula for n* 
term) 

(5) 1,2,2,4,8,32, 256, 
previous terms) 

(© 1,1,4, 36,1764, 
previous terms) 


(Relate termto 


(Relate term to 


(4) 1,4,27,256, 3125, (Formula for ne 
term) 

(©) 1,5,7.17,31,65, (Formula for n™ 
term) 

(© 1,3,4,7,11,18, (Relate term to pre- 
vious terms) 

9 2,3,5,7, 11, 101, 131, 151, (Recog- 
nizable set of integers) 

(h) 6,8, 10, 14, 15, 21,22, (Recogniz- 


able set of integers) 

27. Refer to Exercise 25, where we introduced the notation 

J(p, s) for the Josephus game with p persons and a skip 

amount of s. 

(a) Suppose five people wearing name tags labeled 1,2, 
3,4, and 5 line up in that order, and play the game 
with a skip amount of 4. It is easy to verify that 
Player 1 wins the game. (Try it!) Then answer these 
questions without actually playing the game. 

i. If five people wearing name tags Anne, Sue, 
Matt, Tom, and Linda line up in that order and 
play the game with a skip amount of 4, who will 
win? 

fi, If five people wearing name tags labeled 5, 6, 1, 

2,3 line up in that order and play the game with 
a skip amount of 4, who will win? 

(b) Suppose six people wearing name tags labeled 1,2, 

3,4,5,6line up in that order and play the game with 

a skip amount of 4. 

i, Who is the first person eliminated? 

ii, After the first person is eliminated, who is left 
(and in what order)? 

Without finishing the game, use the answer to a 

previous part ofthis question to tell who will win. 








July 9,205 — 2050 


12 Number Puzzles and Sequences. 23 


(©) The preceding exercise establishes that, knowing 
that (5,4) is 1, we can easily determine that J(6, 4) 
is 5. Use this fact, and similar reasoning, to deter- 
mine J(7,4). 

(@) Suppose you know that J(15, 4) is 13. Use this to 
calculate J(16,4). 


l In this exercise, the value of each term depends on a 


term that appears further back in the sequence, and the 
rule for a, depends on whether n is even or odd. Give. 
the first seven terms of each sequence. 


() a= Landay = | age 12, ifnisodd 


. In this exercise, we build a sequence whose values are 


strings of A's and B's, similar to those used in Section. 

11torepresent tennis matches. We define the sequence 

recursively by the following rule: 
a-B 
an = An BA 

Thatis a, consists of the letter A, followed by the string 

for dy-1, followed by the string BA. For example, a; is 

the string consisting of A followed by a; (which is B) 

followed by BA, making az = ABBA. 

(a) Calculate terms as through as of this sequence. 

(b) Modify the recursive definition to yield the se- 
quence AB, AABB, AAABBB, AAAABBBB, and 
so on. That is, a, should consist of n occurrences of 
A followed by n occurrences of B. 

(© Modify the recursive definition to yield the 
sequence ABB, AABB, AAABB, AAAABB, 
AAAAABB, and so on. That is, an should consist 
ofn occurrences of A followed by BB. 


|. This exercise combines the ideas in Exercises 28 and 29. 


Define the sequence as a; =A and 


a= [%2B, — ifniseven 
^7 Vaay2A, ifnis odd 
For example, a= aja B=aB=AB and as= 
aia A =a; A= ABA, 
(a) Write down the first ten strings in this sequence. 
(b) Use the results from part (a) to determine a and 
(©) Suppose you already know that as 
ABBAAABAA, Give the values of aes and ae. 











Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


24 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


1.3 Truth-tellers, Liars, and Propositional Logic 





Raymond Smullyan (1919—) has written several books of puzzles about conversations 
held on a fictional island where inhabitants are either truth-tellers or liars. The point 
ofthe puzzlesis to discern which inhabitant is of which type, using only the statements 
you hear them make, if possible. 


- Example 1 You meet two inhabitants of Smullyan's Island. A says, “Exactly 
one of us is lying," and B says, "At least one of us is telling the truth." Who (if anyone) 
is telling the truth? 


To solve this puzzle, we will consider all possibilities for each person's status as 
a liar or truth-teller. In the table below, we will use p to stand for the phrase, “A is 
truthful,” and q to stand for the phrase, “B is truthful.” With this convention, we can 
organize our consideration of all the possibilities as follows: Either p is true or p is 
false, and either q is true or q is false. In each case, we can determine the truth or 
fallacy of what A and B said. 

The secret to avoiding dizziness with these puzzles is to first consider each 
statement on its own instead of worrying about who said it. For example, we 
can tell that the phrase, "At least one of us is telling the truth" is true if either 
of p or q is true. We can make this conclusion without knowing who made the 
statement. 

In this same way, we can use a table to analyze the truth value of each spoken 
statement for every possible combination of truth values of the simpler statements 
pandg. 


Statement 1 Statement 2 


Exactly Oneis Lying At Least One is Truthful 














a || 
"mala 
mda 
mda 


We call this analytic device a truth table. It is a valuable tool for analyzing the 
truth value of any compound statement based on the truth values of its simpler parts, 
and we will see later in this section that truth tables are useful for far more than 
solving logic puzzles. Now that we have this representation of the problem, we just 
need to think about what a solution should look like. 

SOLUTION We can think about the meaning of each row in the truth table 
given above. 


© The first row of the table represents the hypothetical situation where both 
A and B are truth-tellers (because both p and q are true). This cannot be 
the real situation, however, because Statement 1 is false in this row, so a 
truth-teller like A could not have said it. 


Driver — TWDDOO3-Ensley.v6 Juy9,2005 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 25 


© The second row of the table cannot represent the real situation, since it 
cannot be that the liar B would have made a true statement. 

© The third row is ruled out, because in this row the liar A has made a true 
statement. 

© Inthe final row, both A and B are liars and both statements are false, 


We conclude that the fourth row of the truth table describes the only possi- 
ble real situation, and so the solution to the puzzle is, Both inhabitants are 
lying. o 

In this example, we found rows where the truth value describing each inhabi- 
tant's type matches the truthfulness of his or her statement. Another way to describe 
this is to say that the part of the row to the left of the double vertical lines should 
match the part to the right of those lines. In this way, the use of a truth table helps 
organize our thinking about the problem, making the search for a solution more 
routine, 

Before we proceed to our next example, let us say a few words about build- 
ing truth tables. For the two variables p and q there are a total of four possible 
combinations. We could list them in some random order, but if we proceed system- 
atically, we can be sure we don't leave any out. Most people do this by thinking 
analogously to our counting system. When we count from 20 to 39, for example, we 
write 


20, 21, 22, 23, ...,29, 30, 31, 32, ...,39 





The tens’ digit stays fixed while we cycle through all the possible ones’ digits. Then 
we go to the next tens digit and repeat the cycle of the ones’ digit. Likewise, we wrote 
the truth table in the order TT, TF, FT, FF—with the right-hand value cycling though 
its values for T, then again for F. 

Those who built “game trees” for exercises in Section 1.1 will be interested to 
know that this method of listing the rows is actually the same thing, as we see in 
Figure 1-9. For just two variables this is admittedly not that important, but as we add 
TLF more variables, it becomes more important to be consistent. For example, for four 
variables this order would be TTTT, TTTF, TTFT, TTFF, TFTT, TFTF, TFFT, TFFF, 


4 
4 


Li 


E TAFT PITT, ITE FIFT, FIFE, FFIT, FFTF, FFT, FFFF. 
S e 
Figure 1-9 The - Example 2 Suppose you meet three inhabitants of Smullyan's Island and have 
order of rows in a the following conversation. Can you tell which inhabitants (if any) are lying? 
truth table 


© A says, “Exactly one of us is telling the truth.” 
© B says, "We are all lying. 
© Csays, “The other two are lying." 





SOLUTION The analysis of this puzzle is the same once we add a symbol r 
to represent the statement "C is truthful.” The truth table below begins with 
TTT and follows the order we discussed earlier. In the table, we mark the row 
where the truth values of p, q, and r match the truth values of what A, B, and 
C say, respectively. 


Driver — IWDDOO3-Ensley.v6 


26 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


July 9,205 — 2050 














Statement 1 
—————_ Statement2 Statement 3 
Exactly One tooo 
p q + | isTruthful AllareLying Aand Bare Lying 
T T T F F F 
T T F F F F 
T BET F F F 
* T FE T F F 
F T T F F F 
FTF T F F 
F FT T F t 
F FF F T T 


We conclude that the only possible situation is where p is true while q and 
r are false. Therefore, A is the only inhabitant telling the truth. a 


Practice Problem 1 You meet two inhabitants of Smullyan's Island. A says, “We 
are both telling the truth." B says, “A is lying.” Who (if anyone) is telling the truth? 


In each of the previous examples, exactly one row of the truth table has been 
consistent, and we can conclude that this one row gives the solution. However, it is 
possible to have situations where more than one row of the table is consistent, in 
which case we cannot determine the status of all the speakers. On the other hand, if 
no row is consistent, then we have arrived at a paradox of sorts. Here are the simplest 
examples of each of these situations. 


BE Example 3 


1. You meet an inhabitant A, who says, "I am telling the truth." Is she? 
2. You meet an inhabitant A, who says, "I am lying." Is he? 


SOLUTION Ineach part we need only one variable p to represent the state- 


ment “A isa truth-teller." 


1. The table for this statement looks like this: 








Statement 1 
p | Tam Telling the Truth 
T T 
F F 








‘We cannot conclude anything, since both rows are consistent. 
2, The table for this statement looks like this: 














Statemes 
P 

T F 
F T 








Since neither row is consistent, we have a paradox. 


[Driver 


IWDD003-Ensley-v6 


Juy9,2005 2050 


13 Truth teler Liars, and Propostional Logic 27 


The Logic of Propositions 


Before we look at more examples, let's agree on some notation to make writing these 
things a little easier. We will refer to this notation as propositional logic notation. 





Definition We call a sentence a proposition if it is unambiguously true 
or false. A propositional variable is simply a variable name that stands for a 
proposition. 

A formal proposition will mean a proposition written using propositional 
logic notation according to the following rules: 


1. Any propositional variable alone is a formal proposition. 

2. Given formal propositions p and q, the compound statement p Aq is a 
formal proposition. The proposition p Aq stands for, “Both p and q are 
true,” and we read this as “p and q." 

3. Given formal propositions p and q, the compound statement p V q is a 
formal proposition. The proposition* p v q stands for, "Either p or q is 
true,” and we read this as “p or q.” In mathematics, when we say that either 

e mean that at least one is true. Saying "or" always allows 

ity that both are true. 

4. Given a formal proposition p, the compound statement =p is a formal 
proposition. The proposition —p stands for, “It is not the case that p is 
true,” and we read this as "not p.” We refer to ~p as the negation of p. 














This system allows for the creation of complicated expressions by repeatedly 
using these basic rules. 


"ll campie 4. sro ina da mprasion V) n p dita formal propo» 
sition using the definition above. 
SOLUTION As usual in mathematics, we perform our analysis from inside 
the parentheses out. 


© Since p and q are propositional variables, then they are each formal 
propositions by (1) in the definition. 

© Since p and q are formal propositions, then p v q and p Aq are formal 
propositions by (2) and (3) in the definition, respectively. 

© Since p ^ q is a formal proposition, then -(p ^ d) is a formal proposition 
by (4) in the definition. 

© Since p v q and —(p Aq) are formal propositions, then so is (p V q) ^ — 
(p ^ q) by (2) in the definition. 


* One way to keep A and v straight in your head is that the word “AND” begins with the letter A, and 
the A symbol looks sort of like capital A. 


Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


28 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


It is customary to consider these operations as having precedence rules (much 
as +, —, -, and/in arithmetic). The “not” operation (=) has highest precedence, 
followed by "and" (^) and then “or” (v). For example, the expression +p ^ —d V p. 
is the same as ((~p) ^ (74) V p. As with arithmetic operations, use parentheses if 
you are worried about ambiguity in your expressions. 


Practice Problem 2 You meet two inhabitants of Smullyan's Island, A and B. 
Using p to represent the proposition "A is truthful" and q to represent the proposition 
“B is truthful,” how would you write each of these statements as formal propositions? 


(a) Ais lying. 
(b) Atleast one of us is truthful. 
(© Either B is lying or A is. 

(d) Exactly one of us is lying. 


We should make a comment here about the meaning of the word "or" in math- 
ematics, This can be a slightly confusing issue since many times in English we use the 
word "or" to imply a choice must be made. In conversational English, we must rely 
on context to resolve the issue, and we often have to ask the speaker to clarify the. 
meaning. You may have had conversations like this: 


Waiter: Do you want soup or salad? 
You: Yes, I'l have both. 
Waiter: No, which one do you want? 


In this situation, the waiter meant for the word "or" to convey that a choice was to 
be made, while perhaps you thought that neither soup nor salad was included in the 
price of the meal and the waiter is simply asking if you would like an appetizer, in 
which case it is reasonable to get both things. To avoid this ambiguity of the English 
language, in mathematics when we say, "p OR q is true,” we will always mean that 
either one or both statements are true, 


wn Example 5 You meet two inhabitants of Smullyan's Island. A says, "Either B 

is lying or Iam,” and B says, “A is lying." Who (if anyone) is telling the truth? 
SOLUTION As before we build a truth table to consider all possiblities for 
the status of the two speakers. Notice that the statement “Either B is lying or 
A is" is the same as saying, “At least one is lying.” 


Statement 1 Statement 2 


Either B is Lyingor Ais Ais Lying 





`~ 
^ 














mud 
mama 
sans 
CELL 


Driver — TWDDOO3-Ensley.v6 Jey9,2005 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 29 


Only row 2 of the table is consistent, so we can conclude that A is truthful and 
Bislying. nu 
‘Using the operations ^, v, and +, we can write various combinations of proposi- 
tions in a concise manner. For example, suppose we use e to represent the statement. 
“Sue is an English major” and j to represent the statement "Sue is a junior.” Here 
are several possible combinations we could write. 


uw Example 6 How would you write each of these propositions using combina- 
tions of e (meaning "Sue is an English major") and j (meaning “Sue is a junior") with 
the operations ^, v, and ~? 


1. Sue is a junior English major. 
2. Sue is either an English major or she is a junior. 

3. Sue is a junior, but she is not an English major. 

4. Sue is neither an English major nor a junior. 

5. Sue is exactly one of the following: an English major or a junior. 


SOLUTION 


1. The sentence implies that both conditions are true, That is, she is a junior 
and she is an English major. So we write j ^ e. 

2. Although in English there might be some ambiguity about this, in mathe- 
matics the word or always means either one or both. So we write j v e. 

3. In English when we say “but,” it means the same as saying, “and,” so the 
statement is j A (~e). Because of the precedence conventions, we do not 
need the parentheses, and we could simply write j ^ ~e. 

4, There are two ways to think of "neither-nor" in English. The first is that it 
means the "opposite" (or negation) of either-or. This leads to the statement 
—(j v e). Another choice is to reason that "neither this nor that" means 
“not this and also not that." This leads to the statement ~j ^ ~e. These two 
statements are equivalent 

5. This is sometimes called the exclusive or of the two conditions—one or the 
other is true, but not both. Again, there are two equivalent solutions to the 
problem. One solution comes from writing the "one or the other is true, 
but not both" in symbols, as (j V e) ^ —(j Ae). The other solution comes 
from reasoning, "She could either be a junior but not an English major, or 
she could be an English major but not a junior." This gives the symbolic 
statement (j A ~e) V (e Aj). 





a 


Practice Problem 3 You meetthree inhabitants, A, B, andC, of Smullyan's Island. 
Using p to represent the statement “A is truthful,” q to represent the statement "B is 
truthful,” and r to represent the statement "C is truthful,” how would you write each 
of these phrases? 


(a) Aand Bare lying. 
(b) AN three are lying. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


30 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


(© One of us is lying. 
(d) Exactly one of the three is truthful. 


Truth Tables for Formal Propositions 


In addition to saving time and space, another reason for creating propositional logic 
notation is so we can understand the relationship between the truth value of a com- 
plex statement and the truth values of its simpler component propositions. We can 
illustrate this relationship for each of the basic propositional connectives and, or, and 
not using truth tables. 




















P o d4jP^a P 4 j|Pva Pre 
m cmm T- RD E T|r 
T F| F T Fir FIT 
F T| F F T| T 
F F F F FIF 


Asin the solutions to the logic puzzles, a truth table shows us the truth value of 
a compound statement for every possible combination of truth values of its simple 
components. 


uw Example 7 Give the truth table for the formal proposition p ^ ^q. 








SOLUTION 
P d4j[-^4|P^^0 
T T|F F 
T F|T T 
F T|F F 
F FIT F 














a 


In the solution above, we created a column in the table for ~q in order to get 
the column for p ^ — by simply applying the truth table rule for ^ to the p column 
and ~q column. Specifically, since A ^ B is true only when both propositions A and 
B are true, then p ^ — is true only when both propositions p and — are true, so 
the p and —g columns in the truth table are all we need in order to form the column 
forp^cg 

By making the construction of a truth table a formal process, we can easily build 
tables for complicated expressions without becoming mired in their grammatical or 
logical muck. 


Explore more on Practice Problem 4 Complete the following truth table: 
Pr poajpvaj-ovo 




















LEE 
mama 


[Driver JWDDOO3-Ensley-v6 July 9,205 20:50 


13 Truth-tellers, Liars, and Propositional Logic 31 


As the statements get complicated, it is necessary to have a step-by-step proce- 
dure for building a truth table. Once again, we will work from inside the parentheses 
out. 


Tl nacpicl iiir rahia yor e foondiprapatintpe dba pd 
SOLUTION  Thisisthe same proposition we looked closely at in Example 4. 
We can use the order in which the formation rules were used to get the order 
in which we should build the truth table. The final column is formed from the 
two columns that precede it using the truth table rule for A. 


Pag | 7(P^4) | PYG | Pv )^-(P^9 





























"44s 
mata || 
LLL 
Haam 
mdd4 
md" 


Negation and Logical Equivalence 


As we indicated earlier, we call +p the negation of p. We now examine this concept 
in a little more detail for two situations: (1) propositions involving comparisons, and 
(2) propositions involving the ^ and v connectives. 


uw Example 9 Let p stand for the proposition “Tammy has more than two chil- 

dren.” Express the negated proposition as an English-language sentence. 
SOLUTION A common mistake isto use “less than” as the negation of “more 
than,” The correct negation is “less than or equal to.” We write, "Tammy has 
less than or equal to 2 children,” or “The number of children Tammy has is less 
than or equal to 2.” 

If we use the symbol c to indicate how many children Tammy has, then we 

may write the original proposition p mathematically as "c > 2.” The negation 
=p can be written as "c <2.” a 


p[ pe e c——— 9" 
is actually 3, what does this mean? Which is true, the original proposition p or its 
negation =p? 
SOLUTION This means that Tammy has three children, and we conclude 
that p is a true statement. o 





Practice Problem 5 Ler g stand for John’s current grade point average. To be 
admitted to graduate school, a person needs at least a 3.0 grade point average. Use g to 
express the proposition p, "John's grade point average is high enough to be admitted 
to graduate school.” Then use g to express the negated proposition ^p. 


Driver — TWDDO03-Ensley.v6 Jey9,2005 — 2050 


32. Chapter 1 / Puzzles, Patterns, and Mathematical Language 


We now address the issue of forming the negation of propositions involving 
the ^ and v connectives. In Example 6 we found two different ways to express the 
proposition "Sue is neither an English major nor a junior,” using the symbols e for 
“Sue is an English major" and j for "Sue isa junior." We stated that the two solutions 
were "equivalent." To explore exactly what we mean by this, we build a truth table 
for our first statement "-(j V e)." In the truth table that follows, we calculate the 
final result, —( v e), by first calculating j v e and then negating that column. 




















je | ive | uve 
T Wr F 
T F| T F 
F T| T F 
F Fi F T 


We see that the statement “-(j V e)" is true precisely when both j and e are false. 
Next we build a truth table for the second statement “~j ^ ~e.” We calculate the 
final result by first calculating the intermediate values =j and ~e. 




















i e] >j >e || joe 
X s F F 
T F|F T F 
Poor |r F F 
F FIT T T 


We see that the statement “~j A ~e" is true precisely when both j and e are 
false. The two statements (“=(j v e)" and “~j ^ ~e") are true under exactly the 
same circumstances they have the same truth table. This is what we mean when we 
say the statements are equivalent. 





Definition Two statements are said to be logically equivalent if they have 
the same truth value for every row of the truth table, 











There is another important point to be made—the logical equivalence does not 
depend on the particular meaning of the symbols j and e in this example. The fact that 
these two statements are equivalent is just a special instance of a general property. 
For any statements p and q, it is always true that (p v q) is equivalent to =p A ~q. 
‘This is one of the two facts that have come to be known as DeMorgan's laws. 


Proposition 1 — (DeMorgan's laws) Let p and q be any propositions. Then 


1. =(p V q) is logically equivalent to =p Ag. 
2. —(p Aq) is logically equivalent to ^p v —. 





In words, to negate a condition containing and (A) or or (v), negate each part and 
change the and to or or vice versa. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 33 


PROOF The first of these is established in the previous discussion (for proposi- 
tional variables j and e), and the second is left for Exercise 16. 


Practice Problem 6 One way to write, “Sue is not both a junior and an English 
major" is to write the negation of “Sue is both a junior and an English major.” This 
gives the statement "—(j ^ e)." Use DeMorgan's laws to write an equivalent statement, 
and explain what this statement says in words. 


‘There are several other situations where we can easily see that two statements 
are logically equivalent. As a simple example, for any statements p and q, it is easy 
to see that p A q is equivalent to q A p. Similarly, we know from our use of ordinary 
English that any statement p is logically equivalent to its “double negative” &(-p). 
To see this formally, we just calculate the truth table: 


Poy me yy re 








F T 
T F 








7 
F 
‘The fact that the first and third columns are identical establishes the equivalence. 


uw Example 11 Write the negation of the phrase "Sue is a junior, but she is 
not an English major" two different ways. What does it mean for this phrase to be 
false? 
SOLUTION In an earlier example, we wrote, "Sue is a junior, but she is 
not an English major," as j Ae. The simplest way to write the negation is 
to enclose this in parentheses, preceded by the not operation: "-(j ^ —e).” 
By DeMorgan's laws and the double negative property, this is equivalent to 
“j V e" (change and to or and negate each part). In words, "Either Sue is nor 
a junior, or she is an English major." o 


ll keai 2. uso balade Johns score on te recet mathi ten, we can 
express the statement “John got a B on the test" as “(g > 80) A (g < 90)." Write the 
negation as an expression in symbolic logic notation. 
SOLUTION Thisexample uses both ideas we have considered in the section. 
To negate the statement, we must change the ^ to v and negate each part. 
To negate each part, we must take care to properly negate the comparison 
operations. The solution is “(g < 80) v (g > 90).” In words, “John either got 
below a B or above a B.” a 


Practice Problem 7 


(a) Using y to indicate yesterday's high temperature andt for today's high temperature, 
negate the proposition p written as "(y < 0) v (t <0).” Express both p and ^p. 
as English-language sentences. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


34 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


(b) You meet three inhabitants of Smullyan's Island, A, B and C. Using p to represent 
the statement “A is truthful,” q to represent the statement "B is truthful,” and r to 
represent the statement "C is truthful,” how would you write each of these phrases 
in symbolic logic notation? Use the double negative property and DeMorgan's 
laws to make your answer as simple as possible. 

i. Not all of us are lying. This is the negation of “All of us are lying.” 
ii. Not one of us is lying. This is the negation of “At least one of us is lying." 





‘There are many other examples of the logical equivalence of statements that can 
be established using truth tables. To make these easy to state, we will use the notation 
q to represent the statement “Propositions p and q are logically equivalent.” 





un Example 13 Use a truth table to establish the following logical equivalences: 


1 (pv) = pana 
2. pa(pvay=p 
SOLUTION 


1. In the table below, because the fifth and eighth columns are identical, we 
see that +(—p V 4) is logically equivalent to p A 4. 


Li ^P “Pya -Cc»Pva 74 Pa 














HS 
mama 
Hamm 
dama 
mum 
waan || 
dud" 
mum 


2. The equivalence follows from the fact that, in the table below, the fourth 
column is the same as the first. 




















* 2 PYG | P^tpva 

To UR T T 

T F T T 

FT T F 

F F F F 
p i 
Explore more on Practice Problem 8 Use a truth table to show that p V (q ^r) is logically equiv- 

the Web. alent to (p v q) ^ (p Vr). 


E. Example 14 Examine the statement (p V —) V (>p V q) with a truth table. 
SOLUTION Here is the truth table: 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 35 


bas e jie “Pya @v-av(-PvD 


























CETER] 
mama || a 
EEE] 
dud" 
duds 
sana 


n 


In this example, the column containing the final result (for the complete state- 
ment) consists of all T's. This means that no matter what truth values we assign to 
the variables p and q, the overall statement is true. It is also possible to have just the 
opposite situation, where all combinations of truth values yield a false result. This is 
significant enough to warrant special names for these kinds of propositions. 





Definition 





1. A tautology isa proposition whose value is True for all possible combinations 
of the truth values of the propositional variables. 

2. A contradiction is a proposition whose value is False for all possible com- 
binations of the truth values of the propositional variables. 








DeMorgan's laws and the double negation property were our first examples of 
logical equivalence, but there are many more, The following theorem summarizes 
several that will be important later in this course, The proof of each of these state- 
ments consists of simply creating a truth table for the two given propositions and 
verifying they are the same, 


Theorem 2 Let p,q, andr stand for any propositions. Lett indicate a tautology, 
and c indicate a contradiction. Then all the logical equivalences shown in Table 1-8 
hold. 


PROOF DeMorgan'slaws, the double negative property, one of the absorption 
properties, and one of the distributive properties are established throughout this 


section, See the exercises at the end of this section for the rest. m" 
(a) Commutative pag=anp pva=ave 

(b) Associative (ph) nre pagar) (pvvrenvivr) 

(E) Distributive pv mv apv) 
(d) — tdentity pve=p 

(e) Negation po 

(D Double negative 


(B) — Mempotent 
(h)  DeMorgan's laws 





G) Universal bound pacee 
G) Absorption pyaar 
(K) o Negiionsort ande weet 


Table 1-8 Logical Equivalences 


Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


36 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


Observe that each property (except (f)) actually consists of two properties. For 
example, the “commutative” property has two versions, one for the A operation and 
one for the v operation, 

Since each of these statements is true under all truth assignments, we can think 
of these logical equivalences as providing us with a substitution rule of sorts. For 
example, since (p ^ q) Ar = p ^ (q Ar), we can replace any expression of the form 
(p ^ q) ^r with the expression p ^ (q ^r) without changing the truth value. This 
is an instance of the substitution rule for propositional logic. We can use this rule 
along with the facts in Theorem 2 to demonstrate the logical equivalence of two 
propositions through what appears to be sheer algebraic manipulation. 





w Example 15 Verify the logical equivalence p v (>p Aq) = p V q using the 
substitution rule and quoting the appropriate parts of Theorem 2. 
SOLUTION Each proposition below is equivalent to the previous proposi- 
tion by the cited part of Theorem 2 and the substitution rule when necessary: 


pv-p)^(pvq) part(c), distributive 
^(pv q) part (e), negation 
pyg) at part (a), commutative 
pva part (d), identity 


pv(cP^q) 









In each step, we have replaced part of the expression with an equivalent ex- 
pression. For example, in the first step we use the second of the two distributive 
properties to replace the entire expression by the equivalent expression. In the 
second step, we replace (p v ~p) with r by quoting the first of the two negation 
properties. o 


Practice Problem 9 Verify p A (>p v q) 
quoting the appropriate parts of Theorem 2. 


Aq using the substitution rule and 





Solutions to Practice Problems 


1 Here is the truth table: (d) “Exactly one of A and B is lying” is represented by 
(Panay (a Anp). 
Statement 1 Statement2 3 (a) “A and B are lying” is represented by =p ^ =g. 
We are Both A is Lying (5) “All are lying” is represented by p A =g A =r. 
Telling the Truth (©) "One of usis lying” (take this to mean “at least one 
of us is lying”) is represented by ^p v 4 V ~. 
(d) "Exactly one is truthful” is represented by (pA 
p^r) v (g Anp Aor) V (r^p Ang). 
‘The complete truth table is shown below. 


EI 














mA |S 
mas 
m"m4 
sais 


‘The third row is the only one that is consistent, so we ? wvjpve] eve 
conclude that A is lying and B is telling the truth. 
2 (s) "Ais lying" is represented by p. 
(b) "At least one of us is truthful" is represented by 
pya 
(c) “Either B is lying or A is" is represented by ~q v 
ap. 5 We write pas “g > 3.0" and =p as “g < 3.0 




















amaa 
mus 
mass 
duum 


[Driver 


IWDD003-Ensley-v6 


6 By the second of DeMorgan's laws, *-((j A e)” is equiv- 
alent to “~j V ~e,” This says that (since she isn't both) 
it must be true either that Sue is not a junior or that she 
is not an English major. 

7 (a) We write =p as “(y = 0) A(t = 0)" In English, 
Pp says that the temperature was below 0 at least 
one of the two days, and ~p says it was at or above 
Oboth days. 

(i) “AIL are lying" is represented by =p Aq A 
cr. The negation is “(sp A-g ^-r). By 
DeMorgan's laws, to negate we change and to 
or and negate each part, giving p vg Vr. Ob- 
serve that the negation of “All are lying” is “At 


July 9,205 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 37 


=r), which simplifies to p Aq Ar. The negation 
of “At least one of us is lying" is "All are telling 
the truth.” 

8 One way to lay out the truth table is given below. 
Because the fifth and eighth columns are identical, the 
two expressions at the top of those columns are logically 
equivalent. 

9 This problem is the “dual” of the previous example— 
each ^ has been changed to v and vice versa. It is not 
surprising that the solution mimics the solution to the 
example. The duality principle will be addressed further 
in Section 34. 


PACPY4) = (pAnp)v(pAq) part (c), distributive 








Teast one is telling the truth.” mev(paq) part (c), negation 
(ii) "At least one of us is lying" is represented by = (paq) Ve part (a), commutative 
pv V ~r. The negation is «(^p V qv =paq part (d), identity 
roll gar py@an | pya pyr  (vQ^(vn 





aS 
maadam |a 
maa TETEE 
mmmdmmma 
LLL 


mda 
mdmdddds 
LLL 





‘Truth Table for Practice Problem 8. 


Exercises for Section 1.3 


1. Solve each of these logic puzzles by using truth tables. 

(a) You come across two inhabitants of Smullyan’s Is- 

land. A says, “We are both telling the truth,” and 

B says, “A is lying.” Who if anyone is telling the 
truth? 

(b) You come across three inhabitants of Smullyan's Is- 
land. A says, “B or C is lying," B says, "C is lying,” 
andC says, "A and Tare both telling the truth." Who 
if anyone is telling the truth? 

(c) You come across three inhabitants of Smullyan's 
Island. A says, “B and C are both lying,” B says, 
“Only one of the other two is lying,” and C says, 
“Atleast one of usis lying,” Who if anyone is telling 
the truth? 

2. Give an example of what two people might say to cre- 
atea paradox, where each person's individual statement. 
does not on its own create a paradox. 


3, Suppose you meet two inhabitants A and B of 
Smullyan's Island, Let p represent the statement “A is 

truthful” and g represent the statement “B is truthful’ 

Write each of the following in symbolic logic notation: 

(a) A islying or B is telling the truth, 

(b) Neither A nor B is lying. 

(©) Ais truthful but B is not. 

Upon meeting a third island inhabitant C, you can con- 

tinue the previous exercise, adding r to represent the 

statement “C is truthful.” Use these names along with 

the basic logic operations to write each of the following 

English sentences in symbolic logic notation: 

(a) Aislying and B or C is truthful. 

(b) A and B are lying, or A and C are truthful. 

(©) At least two people are telling the truth. 

(8) Exactly two people are telling the truth. 


IWDD003-Ensley-v6 


[Driver 


38 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


5. Use the letter f to represent the statement “this person 
is female,” a for “this person is over age 30,” and m for 
“this person is a math major.” Use these names along 
with the basic operations and (^), or (v), and not (=) 
to write each of the following English sentences with 
symbolic logic: 

(a) This person is female but not a math major. 

(b) This person is a female math major over age 30. 

(c) This person is neither female nor a math major. 

(@ Either this person is not female or this person is 
over age 30. 

6. Use the phrases in the previous exercise to write an 
English statement equivalent to each of the following 
propositions 
(9) of ^m 
©) -fava 
© f^(avm) 

@ (faa) vf ^m) 

7. Use the letter £ to represent the statement "Bill is tall,” 
d for “Bill is dark,” and A for “Bill is handsome.” As in 
Exercise 5, use these names along with the basic logic 
operations to write each of the following English sen- 
tences in symbolic logic notation: 

(a) Bill is tall, dark, and handsome 

(b) Bill is tall and dark, but not handsome. 

(©) Either Bill is tall or he is handsome, but not both. 
(d) Bill is neither tall nor handsome. 

8, Use the phrases in the previous exercise to write an 
English statement equivalent to each of the following 

propositions: 

(a) (Ev dv) ^E dA T) 

b) ^od 

(e) d (t ^d) 

(^d) v Ct ^d) 

Use the letter s to represent the statement "Chris likes 

to play soccer,” r for "Chris likes to read,” and p for 

“Chris likes to eat pizza," As before, use these names 

along with the basic logic operations to write each of the 

following English sentences in symbolic logic notation: 

(a) Chris likes pizza but he does not like soccer. 

(b) Chris likes to read and eat pizza, or he likes to play 

soccer. 

(©) Chris does not lie to eat pizza, but he likes to play 

soccer or read. 

(@ Chis ikesto do two of these things but not all three. 

10. Suppose x and y indicate particular real numbers. Write 

conditions that express the following, by using compar- 
isons (such as x > 0) and the basic operations of logic. 
(a) Both x and y are positive. 


9. 


Jey9,200 — 2050 


(5) At least one of x and yis positive. 
(c) Exactly one of x and yis positive. 
(d) Neither x nor y is positive. 
11, Complete the following truth tables for the given com- 
F pound expressions: 





p al|-»|-»va | »^t-»vo 
T 
@T F 
F T 
F F 


“Pya | EYDACPYD 




















p a | avr | prav» 
TOT T 
TOT F 
TOF T 
QT F F 
FOT T 
FOTO OF 
FF T 
FOFOF 
p_a rj pag | eror 
T T T 
T T F 
T Fo T 
eT F F 
POT OT 
F T F 
FE FEOT 
F F F 


IWDD003-Ensley-v6 


Driver 


12. Rewrite each of the following statements in propo- 

F sitional logic notation, making the meaning of your 
propositional variables clear, and then create a truth 
table for the sentence. The first one is done for you as 
an example. 

(ex) Either the food is terrific, or everyone is tired and 
hungry at mealtimes. Let f represent “the food is 
terrific,” represent “everyone is tied,” and h Tep- 
resent “everyone is hungry.” Then the given state- 
ment can be written as f V (f AA), and it has the 
following truth table: 

tah 


f Sv (tah) 





a 
mda 
mamamimi | S 
M A e e 
mM a 


(a) Either everyone is not hungry at mealtime, or ev- 
eryone is tired and the snackbar makes a profit. 

(b) The staffis friendly, or else they are not friendly but 
very well paid. 

(c) The staff is not very well paid, and they are friendly. 

(à) You play sports or you play mahjong or nobody 
knows your name. 

(e) You do not play mahjong or you do not play sports, 
and nobody knows your name, 

For cach of the statements in Exercise 12, obtain a sim- 

ple symbolic logic expression for the negation of that 

statement, and then rewrite this as a proper English 
sentence, 

Use the number variable b to represent Trina’s board 

exam math score and m to represent Trina’s math place- 

ment test score. To take calculus the first semester, a 

student must have a board exam math score of at least. 

600 or a math placement test score of at least 25, Use the 

variables b and m to express the statement “Trina may 

take calculus in her first semester.” Use the variables 

b and m to express the negation of this statement as 

well. 

15. Use the number variable a to represent Fred's age and 
v to represent the number of years Fred has worked 
at his job. To be eligible for the company's retirement 
plan, an employee must be at least 62 years of age and 
have worked for at least 15 years. Use the variables a 
and v to express the statement “Fred is eligible for the 
company's retirement plan." Use the variables a and v 
to express the negation of this statement as well 


aM. 


Jey9,200 — 2050 


13 Truth-tellers, Liars, and Propositional Logic 39 


Use truth tables to verify that ~(p Aq) is logically 
equivalent to =p Vg, establishing the second of 
DeMorgan's laws in Theorem 2. 

. Use truth tables to verify that p v (p ^ d) is logically 

equivalent to p, establishing the second absorption 

property in Theorem 2. 

Use truth tables to verify that p ^ (g v r) is logically 

equivalentto (p ^ d) v (p ^7), establishing the first dis- 

tributive property in Theorem 2. 

Use truth tables to verify the first versions of the com- 

mutative and associative properties from Theorem 2. 

Use truth tables to verify the first versions of the iden- 

tity, idempotent, and universal bound properties from 

Theorem 2. 

Use truth tables to check if each of the given pairs of 

symbolic logic statements are equivalent. 

(a) PACPYa)and p^(pvq) 

b) (pv q) ^ Cp v d) and 

(©) Ca^ p) v Cp Ag) and pv oq 

(4) p^(gvr)and(p^g)vr 

(e) (pva) ^ (qvr) and (pr) vq 

For each of the following statements, rewrite them in 

propositional logic notation, making the meaning of 

your propositional variables clear. Use truth tables to 
find any pairs of logically equivalent statements. 

(a) Jillian likes playingin the sand or volleyball, but she 
does not like sailing. 

(b) Jillian likes playing in the sand, and she likes sailing 
or volleyball. 

(© Jillian likes playing in the sand and volleyball, or 
she likes sailing. 

(@) Jillian likes playing in the sand and sailing, or she 
likes volleyball and sailing. 

23. Use the double negative property and DeMorgan'slaws 
to rewrite each of the following as an equivalent state- 
ment that never has the not symbol (>) outside of a 
parenthesized expression. The first one is done for you 
as an example. 

(ex) p^) is equivalent to —pv-(o4) by 
DeMorgan's law, and this is equivalent to ^p v g 
by the double negative property. T 
(a) cp v ^9) 
t) AP) 
© “CP Aa) 
@) —(p^(a v ^p) 

24. By quoting the parts of Theorem 2, verify the follow- 
ing logical equivalences. In each case, start from the left 

side and use parts of the theorem to change the prob- 

Jem, ending up with the right side. (See Example 15 and 

the solution to Practice Problem 9.) 


BOWE 


NBME WE A 


~e 


B 





[Driver JWDDOO3-Ensley-v6 July 9,205 2050 


40 Chapter 1 / Puzzles, Patterns, and Mathematical Language. 
(9) (p^ v pup 25. By using parts of Theorem 2 as algebraic rules, simplify 


©) (pv) ^(pvo)sp the expression (p v q) ^ =(>p ^r) as much as you can. 
(©) q^(pvr)m(p^g)v(a^r) 


1.4 Predicates 





In Section 1.3, we worked with propositions such as "Sue is an English major" and 
“Person A is lying." In each case, the subject is a particular person or other entity. 
However, in many circumstances the subject is not a particular fixed entity. For 
example, consider the question "How many students in my discrete math class are 
English majors?" To answer the question, you would check the truth value of the 
sentence “ is an English major,” filing in the blank with the name of each 
person in the class in turn. 

In mathematics, we call a statement of this form a predicate. One way to relate 
to the conceptis to think of the meaning of the word predicate in an English class. In 
the English sentence "Mr. Morton wrote Pearl a poem," the predicate is "wrote Pearl 
a poem," and it tells what Mr. Morton did." Since predicates do not have subjects, 
they are not complete sentences. However, with the addition of any subject, you will 
get some sentence. For example, the predicate “ is an English major" can 
become the propositions “Tom Myers is an English major" or “Mary Johnson is an 
English major" by adding the appropriate subjects. 

In mathematics, it is customary to use a variable rather than an empty blank to 
represent the missing subject. If we use the variable s (short for student), the sentence 
above would become "s is an English major.” Each time we replace s by the name 
of a student in the class, we get a statement that is either true or false. Sentences of 
this form, containing one or more variables, are discussed in this section. 


Simple Predicates and Their Negations 


‘The preceding discussion leads us to our formal definition, 





Definition A predicate P(x) is a statement that incorporates a variable 
x, such that whenever x is replaced by a value, the resulting proposition is 
‘unambiguously true or false. If the predicate uses several variables x1, x2, 
then we will extend the notation to P(x1, x7, ...) accordingly. 

















uw Example 1 Jf P(n) is the predicate "n is even,” write the proposition that 
results when the variable n is replaced by each of the values 2, 17, and 240. Which of 
the resulting statements are true? 


* Those of you who are ofa certain age will recognize this character from Saturday morning's Schoolhouse 
Rock. 


Driver 


IWDD003-Ensley-v6 


July 9,2005 


14 Predicates 41 


SOLUTION We will write P(2) to denote the proposition resulting from re- 
placing n with 2 in the predicate P(n). In this case, P(2) is the true proposition “2 
iseven.” Similarly, P(17) denotes replacing n by 17, giving us the false statement 
“17 is even.” Finally, P(240) is the true statement “240 is even.” a 


Practice Problem 1 Im each of the predicates below, replace x with each of the 
values 2, 23, -5, and 15. Decide if each resulting proposition is true or false. 


(a) P(x) is the predicate “x > 15.” 
(b) Q(x) is the predicate "x < 15." 
(c) R(x) is the predicate "(x > 5) ^ (x < 20). 





Most of what you have learned about propositions has an obvious analogue for 
predicates. For example, we can combine predicates using the operation symbols 
^ (and), v (or), — (nof) to create new predicates. We can talk about the negation 
of a predicate, and we can make sense of the notion of equivalent predicates. For 
example, here are two equivalent ways to express the negation of the predicate 
*x <0": 


© —(x < 0): in English, “x is not less than 0,” 
© x > O: in English, “x is greater than or equal to 0.” 


‘These statements are equivalent because, no matter what number is substituted for 
the variable x, the resulting propositions have the same truth value. In addition, the 
“double negative" property and DeMorgan's laws apply to predicates just as they do 
to propositions. 


2 Example 2 Table 1-9 shows some predicates and their negations. The final 
example shows a predicate with more than one variable. 


In our next example, we use a simple notation for describing a collection of 
numbers, written as what we call a set. We will study sets in detail in Chapter 3. For 
now, all you need to know is that our sets are collections of objects, and that the 
objects in that collection are called elements of the set or members of the set. 


um Example 3 Considertheset D = (1.2, 3,4, 5, 6, 7, 8, 9, 10). For each predicate 
below, make a list of the members of the set D that make the statement true. Also, 








x-5 x5 
(> 0)A(e<10) | (cs0vGz10 
A(x =8) x=8 


(z0v( 20 | («0^6 «0) 
Table 1-9 Table for Example 2 


2050 


[Driver JWDDO03-Ensley-v6 


42 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


July 9,205 — 2050 


form the negation of the predicate and list the members of D that make the negation 


true. 
Lxz8 4 x20 
2. (æ > S) ^ (x is even) Sx>x 
3. Xzx 


SOLUTION We present the solution in Table 1-10. The negation is true for 
precisely those numbers in the set D for which the original statement is not 
true. a 


Truth and Quantifiers 


Unlike propositions, we almost never talk about predicates being simply true or 
false. What is more important is the truth or fallacy of the proposition we get after 
substituting a value for the variable in a predicate. Hence, our discussion of the 
“truth” of predicates must include the set that these values come from. We will call 
this set the domain of the predicate, and we will never discuss the truth of predicates 
without making the domain clear. 

Given a predicate and domain, the natural question to ask is, “Does the predicate 
become a true proposition when its variable is replaced by members of the domain?” 
‘The three most common answers to this question are always, sometimes, or never. 





1. InExample 3, the answer is “always” for the fourth predicate but not for the others. 
When the answer is “always,” we might use phrases such as “for all members of 
the set, the statement is true,” or “for every element of the set, the statement is 
true” in describing the situation. 

2, In Example 3, the answer is "sometimes" for all but the fifth predicate. When we 
answer “sometimes” to this question, we are likely to use phrases similar to “there 
is an element of the set that makes the statement true” or “there exists a member 
of the set such that the predicate is satisfied.” 

3, In Example 3, the answer is “never” for the fifth predicate. This is not really 
necessary to treat separately since “never” is simply the negation of “sometimes.” 
We will address this case in the exercises at the end of this section. 


All acp doceri tabe Mrd 


1. "For every k that is a member of the set (1, 2, 3,4, 5], itis true that k < 20.” 

















True for... Negation True for... 
1. | 8,9,10 x«8 1234,567 
2. | 6810 (<5) v(wisodd) | 1,2,3,4,5,7,9 
EN x 2,3,4,5,6,7,8,9,10 
4. | All the members x0 None of the members 
5. | None of the members | x < x? All the members 


Table 1-10 Table for Example 3 


[Driver 


IWDD003-Ensley-v6 


Explore more on 
the Web. 


Juy9,2005 — 2050 

















14 Predicates 43 
Predicate | True for These Members of D... | True for At Least One? | True for AUI? 
x<0 
x>-3 
vex 
var 











Table 1-11 Table for Practice Problem 2 





2. "There exists a member m of the set (—1, 0,1) with the property that n? — m." 
Notice that "there exists" always means "there exists (at least one)." 


Practice Problem 2 In Table 1-11, list the members of the set D = (—1,0,1,2) 
that make each predicate true. In addition, answer the two questions: (1) Is the predicate 
true for at least one member of the set? and (2) Is the predicate true for all the members 
of the set? When the answer to either question is "yes," write the answer as a complete 
sentence using phrases similar to “there exists” or “for all.” 


‘The phrases “for all,” “there exists," “in the set,” and so on are used so often 
that special notation has been developed to make them shorter to write. 





Definition 


1. The symbol € indicates membership in a set. For example, “k € D" means 
that kis a member of the set D. 

2. The symbol V means “for all” or “for every.” Notice that this is an upside- 
down letter A, and that the letter A is the first letter in the word “all.” 

3. The symbol 3 means “there is (at least one)" or "there exists (at least one).” 
Notice that this is a backwards letter E, and that the letter E is the first letter 
in the word “exists.” 

4. The symbols V and 3 (and the corresponding phrases such as “for all” and. 
“there exists”) are called quantifiers. When we use quantifiers with a pred- 
icate, we refer to the resulting statement as a quantified predicate. 

5. A counterexample is an example illustrating that a “for all” statement is 
false, 








A quantified predicate is unambiguously either true or false. If the domain is 
small, we can usually determine whether the sentence is true or false by simply 
substituting each domain element for the variable(s) in the predicate, For larger 
domains, including infinite sets, it can be more difficult to determine. 


uw Example 5 Let D = (3,4, 5,10,20, 25]. Express each sentence using the spe- 
cial symbols e, V, and 3, and decide whether or not itis true. 


Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


44 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


1. For every n that is a member of D, n < 20. 
2. For alln in the set D, n < Sor n is a multiple of 5. 

3. There is (at least one) k in the set D with the property that K is also in the set D. 
4. There exists m a member of the set D such that m > 3. 


SOLUTION 


1. Vn € D, n < 20, This statement is false, since 20 and 25 are counterexamples. 

2. Yn € D,n < 5 or n is a multiple of 5. This statement is true. 

3. 3k e D such that & e D. This statement is true, since 5^ € D. Sometimes we 
replace the words "such that” with a comma, as in 3k € D, € € D. When 
this is read aloud, words similar to "such that" or "with the property that" 
are usually added to make it read more like ordinary English. 

4, 3m € D such that m > 3, or more simply, just 3m € D, m > 3. This statement 


is true. 
$ j 
Explore more on Practice Problem 3 Translate each symbolic quantified predicate into an English- 
the Web. language sentence about the set D = (—2, —1, 0, 1, 2), and then decide whether or not 
itis true. 
(a) Yde D,d> -2 (d) YWeeD <4 
(b) 3d eD,d > 2 (€) Sme Dm» 10 


(© Yn eD, (n> —3) ^ (n <3) 


When we write a quantified predicate, there is always a set D, called the domain, 
that we have in mind, If you are being informal, you might not specify the set, or you 
might specify the set without using the set notation. A useful practice, when faced 
with a quantified statement written in this less formal style, is to identify the set and 
rewrite the statement in terms of that set. 

The clear specification of the domain is very important in determining whether 
a quanitified statement is true. The next example illustrates this issue as well as the 
practical skill of making informal quantified statements more formal, 


Bl sci forhand Maccdseihedini Dd 
the predicate formally in terms of that set D. If the domain is ambiguous, give examples 
of how different domains can change the truth value of the statement. 


1. Forall x, x? > x. 
2. V even integer m, m ends in the digit 0, 2, 4,6, or 8 


3. There is an integer n whose square root is also an integer. 
4. Every integer larger than 0 has a square that is larger than 0. 


SOLUTION 


1. There are several possibilities. One possibility is that the person writing the 
predicate was intending to describe a property of the set of all real numbers, 


Driver 


IWDD003-Ensley-v6 


Juy9,2005 — 2050 


14 Predicates 45 


frequently written as R. If we use this notation for the domain, we can write, 
“Yx € R, x? > x.” This statement is false since x = 0.5 is a counterexample. 
If instead we take the domain to be the set of integers (commonly wr 
ten Z after the German word for numbers, Zahlen), then we have the true 
statement, “Vx € Z, x? > x.” 
2. Here D is the set of even integers, and we write 
Ym e D, m ends in the digit 0,2, 4,6, or 8 


3. Using Z once again to indicate the set of all integers, we can write, “3k € 
2, Jk eZ. 


4. Here the domain D is the set of positive integers, and we write, “Wn € D, nè > 
o” 








Negating Quantified Statements 


In Practice Problem 3, two of the statements are false, As before, when a statement 
is false, the negation of that statement is true. Looking more closely at these two 
statements in this Practice Problem will help us determine the formal meaning of 
“negation” for a quantified statement. 


uw Example 7 For the set D = (-2, 1,0, 1, 2], explain why each of these quan- 
tified predicates is false. Use your explanation to write the negation of the statement, 
first in English and then with more formal notation. 





1 Vd €D,d > -2 
2. Ame Dom » 10 


SOLUTION 


1. Ifwe write P(d) for the predicate "d > —2,” then P(-1), P(0), P(1), and P(2) 
are all true. However, P(—2) s false, so we cannot say that all the elements of 
D make P(d) true. That is, there exists (at least one) element d of D for which 
P(d) is false. This negation can be formally written as "3d € D, —P(d) or 
“Sd  D,d<-2" 

2. If we write Q (m) for the predicate “m > 10," then Q(-2), Q(-1), Q (0), 
Q(1), and Q (2) are all false, so we cannot find an element of D that makes 
Q (mm) true. That is, for all elements m of D, Q (m) is false. This negation can 
be formally written as “Wm € D, ~Q (m) or “Wm € D, m < 10." 

o 


In words, we can say that the opposite of “true for all” is “false for at least one.” 
Likewise, the opposite of “true for at least one” is “false for all.” Using the notation 
of this section, we can summarize the relationship between a quantified statement 
and its negation in the following proposition. 


Proposition 1 For any predicates P and Q over a domain D, 


© The negation of Vx € D, P(x) is 3x € D, P(x) 
© The negation of 3x € D,Q(x) is Yx e D. -Q(x). 





Driver — IWDDOO3-Ensley.v6 


Jüy9,2005 — 2050 


46. Chapter 1 / Puzzles, Patterns, and Mathematical Language 


Explore more on 
the Web. 


Of course, when we negate the predicates themselves, we use the same ideas we 
learned about earlier. In particular, there are two important points to remember: 


1. Techniques such as the double negative property and DeMorgan’s laws apply to 
predicates. 

2. Particular care must be taken when negating comparisons. For example, the nega- 
tion of a comparison involving < will involve >. 


Practice Problem 4 Ler D = (—1,0, 1,2}. For each statement, write the negation, 
and then decide which is true, the original statement or the negation. 

(a) Xx €D,(x S0) v(xz2) (c) Forallx € D, x? < x. 

(b) 3x € D, (x <0) AG? > 0) (d) There exists x € D such that x? < x. 





Multiple Quantifiers and Their Negations 


We have said that a predicate can have more than one variable. For example, consider 
the predicate P(x, y) given as x +2y = 3, where x and y stand for integers. That is, 
x and y are to be taken from the set Z = (0, +1, +2, +3, ...}. If we wish to substitute 
particular values (say, 10 and 4) for the variables, we use the notation P(10, 4). This 
notation indicates that we replace x by 10 and y by 4 in the predicate, obtaining the 
(false) proposition 10 +2-4 = 3. As usual, when we use two variables, we do not 
imply that the variables must indicate different quantities, only that they can indicate 
different quantities. For example, itis perfectly correct to write P(—5, —5) to indicate 
the statement (5) +2. (-5) =3. 








"EB Example 8 Ler P(x, be the predicate x -y = 36. 


1. Identify which of the following statements are true: P(3, 4), P(-9, —4), P(12, —1). 
2. Assuming the domain is the set of integers, find other values for x and y that make 
the statement P(x, y) true. 


SOLUTION 


1. Thestatements are 3 - 4 = 36, (—9) - (—4) = 36, and 12 - (-1) = 36. Only the 
second statement is true, the first and third are false. 

2. Here are some additional true statements: P(36, 1), P(6, 6), and P(—2, —18). 
There are several more that are true. Notice that if the variables x and y are 
taken from the domain of all real numbers rather than all integers, then there 
are an infinite number of choices for x and y. 





a 


When a predicate contains more than one variable, the question of quantifiers 
becomes more complicated since each variable can be quantified separately. Fortu- 
nately, some situations are relatively simple. If all the quantifiers are the same, there 
is no difficulty in making the proper interpretation. 


Driver — TWDDOO3-Ensley.v6 Juy9,2005 — 2050 


14 Predicates 47. 


BE Example 9 


1. Consider the quantified statement "There exist integers x and y such that x - y = 36.” 
Write this using symbolic logic notation, and decide if it is true or not. 

2. Do the same for the quantified statement "For all integers x and y, it is true that 
x+y =36." 


SOLUTION 


1. We can write this in two equivalent ways: 
© 3xeZ3yeZ,x.y 36 
9 3x, yeZ,x-y=%6 
‘The latter isa shortcut that means the same as the former. In either case, we 
can use P(x, y) to indicate the predicate x - y = 36, and write 3x € Z, 3y € 
Z, P(x, y), or 3x, y e Z, P(x, y). The quantified predicate is true since, for 
example, P(-2, —18) is true, 

2. Similarly, we can write the “for all” quantifiers in two equivalent ways: 
@ Vx eZ,Vy eZ,x-y=36 
© Vx, yeZ,x-y=36 
Since P(10, 2) is the false statement “10 - 2 = 36,” we see that the quantified 
predicate is false, and we say that x = 10, y = 2 is a counterexample to the 
statement Vx, y € Z, x - y = 36. 














o 


Practice Problem 5 Write the following as quantified statements using the symbols 
Band V, and decide whether the statement is true: 


(a) There are odd integers m and n whose product is 35. 
(b) There are even integers m and n whose product is 35. 

(€) For every choice of integers s and t, it is true that s? + È > 0. 
(d) Forevery choice of real numbers x, y, and z,x+y+z>1. 


The example and practice problem illustrate two important points. First, the 
domain of the variables is very important. If the domain D is the set of odd integers, 
the quantified statement 3m, n € D, m -n = 35is true. However, the same statement 
is false if the domain D is the set of even integers. Second, there are some situations 
where establishing the truth of a quantified predicate is easy, and others where it 
takes more work. 

A more difficult situation occurs when a quantified predicate contains both 3 and 
Y quantifiers in the same statement. Even with only two variables, there are several 
different possibilities for the arrangement of the quantifiers and variables. The key 
issues can be understood by thinking carefully about the difference between these. 
two quantified statements: 


9 Vx, 3y, P(x, y) 
© 3y, Yx, P(x, y) 


Driver 


IWDDOO3-Ensley-v6 July 9,205 — 2050 


48 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


‘The first thing to note is that, in mathematics, we always interpret these from left 
to right, while ordinary English is often ambiguous. For example, “For every problem 
there is a solution,” and “There is a solution for every problem,” are generally taken 
to mean the same thing. The listener forms the meaning from the context and, to 
a certain extent, from the speaker's intonations. In a mathematical setting, we are 
always more precise in our speaking, writing, and interpretation. 


E Example 10 Let Z indicate the set of all integers. Which of the following 
quantified predicates statements are true? Explain. 


1 xe Z,3yeZ,x&2y-3 
2. Vx eZ,3yeZ.x e y 15 
3. SyeZ Vx eZ.x& y 15 





SOLUTION 


1. For every possible value of x, you must be able to find a corresponding value 
for y that makes the predicate “x + 2y = 3” true. To see if this is can be done, 
think of it as a game: Your opponent chooses x, then you try to find a y that 
makes P(x, y) true. 

(a) The opponent chooses x = 1, and you choose y = 1, making P(x, y) the 
true statement 1+2- 1 

(b) The opponent chooses x = —3, and you choose y = 3, making P(x, y) 
the true statement -3+2 -3 =3. 

(c) The opponent chooses x =1,337, and you choose y = —667, making 
P(x, y) the true statement 1,337 + 2 - (-667) = 3. 

(d) The opponent chooses x = 0, and you give up. By algebra you can see 
that you would have to choose y = 3, but this is not a member of the set 
z 

The quantified statement is false, and x = 0 is a counterexample. If x has the 

value 0, no possible choice for y can make “x + 2y = 3" true. 

2. Again, think of this as a game, Your opponent chooses a value for x, and 
you must find a corresponding value for y. If you play the game with an 
opponent, you may discover a pattern for how you can choose your value 
for y, once your opponent has chosen his or her value for x. The strategy is 
“Always choose y = 15 — x.” This quantified predicate is a true statement. 

3. To the casual reader, this may appear the same as the previous example. 
However, because we read left to right, the rules of the game have changed 
in a subtle but crucial way. This time, you must go first, so you must try to 
find a y value that will work no matter how your opponent chooses his or her 
value for x. Clearly, this is impossible to do—no matter what y you choose, 
your opponent will have an infinite number of x values to choose from that 
make the predicate x + y = 15 false. For example, if you choose y = 73, your 
opponent only has to make the predicate x + 73 = 15 false, and that is easy 
to do. The quantified predicate is false. 




















[Driver 


IWDD003-Ensley-v6 


Juy9,2005 — 2050 


14 Predicates 49 
Practice Problem 6 Ler Z indicate the set of all integers and R the set of real 
numbers. Which of the following quantified predicates are true? Explain. 


(a) WyeZ,axeZ,x+2y=3 © AxeZVyeZ,x- 
() We eR, 3ye Rx: y (d) 3x eZ, Nye Zx 





-x 
y 





We now turn to the question of negating predicates that have multiple quanti- 
fiers. Actually, we already know how to do this, if we just keep in mind that we read 
the statements left to right. Recall from Proposition 1 that 


The negation of Vx eD, P(x) is ar eD, P(x). 
The negation of 3xeD,Q(x) is Vx eD,-Q(). 


When there is more than one quantifier, we apply this same process to each 
quantifier in turn, proceeding from left to right. The following example and practice 
problem illustrate the process, 


uw Example 11 Write the negation of each of these statements, simplified so as 
not to require the ^ symbol to the left of any quantifier. 


1L We eZ, ay €Z,x+2y =3 
2, Ax >0,Wy>0,x-y<x 
3. Bx e Z,3y eZ.x c y 213, andx- 








SOLUTION In each part, we proceed in several steps. First we simply 
put a negation symbol (=) in front of the entire statement. Then we apply 
the negation process to each quantifier in turn and finally to the predicate. 
itself, 


L WWW € Z, ay € Z,x42y 
ax € Z,-@y €Z,x+2y 


negation of original statement 
by Proposition 1 








ar € Z, Yy € Z,7(x + 2y =3) by Proposition 1 
Bre Z,WyeZ,x+2y¢3 equivalent form of “not equal” 
2. x> 0,¥y > xy <x) negation of original statement 
Ye > 0, (Wy >0,x-y <x) by Proposition 1 
Vx > 0,3y > 0, (y <x) by Proposition 1 
Ve >0,ay >O.x-y>x equivalent form of “not less than” 
3. Gre Z, 3y € Z, (x+ y o 13) A negation of original statement 
(Gy 38) 
Yr eZ, -Gy eZ (y =13) ^ by Proposition 1 
Gy 38) 
vr eZ, Vy e Z, "(x y 913) ^ by Proposition 1 
Gy 38) 
vxeZ Vy e Z, (x y 13) v by DeMorgan's laws 
ey = 36)) 
Ve eZ,Wy eZ, Ge y #13) v equivalent form of “not equal” 
Gy #36) 


a 


IWDD003-Ensley-v6 


[Driver 


50 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


Juy9,2005 — 2050 


Practice Problem 7 — Negate each quantified predicate. Which is true, the original 
statement or its negation? The domain for the first statement is R, the set of real 


numbers. 


(a) Vx > 0,3y ER, (y > x) AQ y = 2x) 


() WeZVyeZx-y<0 


() V y,zeZ, à y 220 


Solutions to Practice Problems 


1 (a) When x is replaced by 2, the statement becomes 
“2 > 15, a false statement. When x is replaced by 
23, it becomes “23 > 157" which is true. Similarly, 
"S > 15” and “15 > 15” are both false. Thus, only 
 P(23) is true, 

(b) We see that Q(2), Q(—5), and Q(15) are true. Ob- 
serve that “x < 15" is true for precisely the values 
of x for which the predicate “x > 15" is false. 

(c) Forthe listed numbers, only x = 15 makes this pred- 
icate true. (Only R(15) is true.) 

2 The completed table is given as Table 1-12. Here are 
some sentences that express the true cases shown in this 
table. 
© There exists an element x of D such that x < 0. 
© There is an element x in D with the property that 

z>-3. 

© There isa number in the set D whose square is bigger 

than or equal to itself 

© Forevery element x oftheset D, itis true that x > —3. 

9 For all xin D, x? > x. 

3 (a) For every d in the set D, d > —2. This is false since 

—2is a counterexample. 

(b) There is an element d in D such that d > —2. Thisis 
true. 

(c) For every element n of the set D, n > -3 and n < 3. 
This is true. 

(d) For all x in D, x* < 4. This is true. 

(e) There exists an min D with the property that m > 10. 
This is false. 


Table 1-13 shows each original statement along with its 

negation. 

(2) 3m, n odd integers, such that m n = 35. This is true, 
since 5-7=35. 

(5) àm,n even integers, such that m-n — 35. This is 
false. It is possible to prove (we will learn how in 
Chapter 2) that whenever two even integers are mul- 
tiplied, the result is even. 

(©) Vs, t € Z, + > O.Thisis true, One can prove this 
by showing that when an integer is squared, the re- 
sult cannot be negative, and by showing that the sum 
of two nonnegative numbers is never negative. 

(d) Vx, y, z € Rue y- 2 1 Thisis false. The choice 
x=-2,y=0,z= Lisa counterexample. 

(a) This is true. If your opponent chooses a value for y, 
the pattern x = 3 — 2y can be used to find a corre- 
sponding value for x. 

(5) This is false, since x = 0 is a counterexample—that 
is, there is no choice of y that makes 0 y = 1 true. 
However, if we change the domain for x to the set of 
nonzero real numbers, the statement would be true. 
T itis clear we are talking about real numbers, this 
might be written as Vx #0, 3y €R,x-y=1. 

(c) This is true, since we can choose x = 0. 

(d) "Thisis also true, since we can choose x = 1. 

(a) The negation is the true statement ax > 0, Yy € 
R (yea) V (x+y #20). 

(5) The negation is the true statement Vx € Z,3y € 
Za-0 

(©) Thenegationis the fale statement x, y, z € Z, x + 
y+ £ < 0, The original statement is true. 














Predicate | True for These Members of D... | True for at Least One? | True for All? 
x<0 E T F 
x23 -1012 T T 
xx None F F 
wax -1,0,1,2 T T 


Table 1-12 Solution for Practice Problem 2 















O= 
3. For each predicate given in the first column of Table 
F 1-14, listthe members of the set $ = (1, 2,3,4, 5,6,7,8, 
9) that make the statement true, (Recall that Z is the set 
of all integers) 
4. Which elements of the set D = (2,4, 6, 8, 10, 12) make 
F cach of the predicates from Exercise 3 true? 
5. Which elements of the set D = (2,4, 6, 8, 10, 12) make 
F the negation of each of these predicates true? 
(a) Q(n)is the predicate, “n > 10." 
(b) R(n) is the predicate, “n is even.” 





-8 and y — 2, which of (a-d) are true? 











Diver  IWDD003-Ensley-v6 July 9,2005 — 2050 
14 Predicates 51 
Statement. | Negation | Which is True? 
Lo WeeD,(x<0)v(x>2) ar eD.(x>0)A(x<2) | Negation 
2 Are D,(x <0) A(x? > 0) VreD.(x=0)v (<0) | Statement 
3. Forallxe Dx? «x. 3k e D such that è > x Negation 
A There exists x €D ouch thata? <x, | VreD,? 2x Negation. 
Table 1-13 Table for Practice Problem 4 
Exercises for Section 14 
1. Write each of the following predicates using the simple ‘True for These 
predicates x > Oand y > Oalong with the propositional Predicate Members of $ 
connectives A, v, and ~: 
(a) Both x and y are positive. xiseven 
(b) Atleast one of x and y is positive. 
(©) Exactly one of x and y is positive, B RUE. 
(d) Neither x nor y is positivo mis even and m> 5 
2, For each ofthe given values of x and y, determine which 
predicates from Exercise 1 become irue statements. 10 «3^1 «25 
(a) Ix — Sand y — 3, which of (a-d) are true? BeZ 
(b) Ifx = —5 and y — 0, which of (a-d) are true? 
(©) Hix — 7 and y = -7, which of (a-d) are true? lojx eZ 
(d) Ix = O and y = 0, which of (a-d) are true? 
(e) Ifx = —3 and y = —10, which of (a-d) are true? SNS TIS abe fee Bercaes 


(©) S(0) is the predicate, "I! < 1” 
(d) T(m) is the predicate, “m — 2 is an element of D." 
6. Based on your answers to Exercise 5, circle the true 
££. statements in the list given below. 
7. Let D = (1,3,5,7,8,9). Decide whether each ofthe fol- 
F lowing statements is true for all the elements of D. For 
cach that is not, give a counterexample. That is, provide 
a number in D for which the statement is not true. 
(a) xis even and x > 7. 
(b) xis odd or x > 7. 
(©) x isnot odd and x <7. 








(@) xis odd. 
YneD,Q( — 3»eD,Q() VYneD.-Q(n) — 3meD.-Q() 
YneD,R() — 3»eD,R() — vaeD,-R() — ANE D,-R(n) 
Yke D,S() 3ke D,S(k) Yke D. =S(k) 3ke D,-S(k) 
VmeD,T(m) — 3meD,T(m) — YmeD,>T(m) ame D, >T(m) 





List for Exercise 6 


IWDD003-Ensley-v6 


Driver 


52 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


38. Write each of the following statements using quanti- 
fiers and predicates. In each case, you must specify the 
domain and define the predicates you use. 

(a) Every biology major is required to take geometry. 

(b) There are computer science majors who do not mi- 
nor in mathematics 

(©) There is no math major who is required to take a 
business course. 

(d) There are puzzles that have no solution. 

Write each of the following statements using quanti- 

fiers and predicates. In each case, you must specify the 

domain and define the predicates you use. 

(a) For every integer n, 2n £9. 

(b) There exists a triangle T that is equilateral and has 
perimeter 10. 

(©) Every circle has an integer diameter or an integer 

(@ Every two real numbers has an integer in between. 

10. Let B be the set ofall biology majors and let G(x) be the 
predicate “x is required to take geometry" Write each 
statement below using quantifiers over the domain B 
and the predicate G(x), and then match any statements 
that are equivalent in meaning. 

(a) There is no biology major who is required to take 
geometry. 

(b) There isa biology major who is not required to take 
geometry. 

(©) There is no biology major who is not required to 
take geometry. 

(d) Every biology major is not required to take geom- 
etry. 

11. The lesson of Exercise 10is that a statement of the form 
“No member of D makes P(x) true" can be formally 
written as either "—àx € D, P(x)” or “Vx € D, =P(x) 
For each of the following English sentences, specify a 
domain and a predicate, and write the statement sym- 
bolically using both of these forms. 

(a) Friends of Alaina never get tired of playing at the 
beach. 

(b) No friend of Alaina dislikes doing cartwheels. 

(©) No math course is too hard for Jennica. 

(@) The meals at the camp are never too bad. 

32. For each of the following English sentences, specify a 
domain and a predicate, and write the quantified state- 
‘ment symbolically using either form discussed in Exer- 
cise 11, 

(a) Even numbers are never prime. 
(b) Triangles never have four sides. 
(©) There are no integers a and b for which a? /2? 


9. 








u 


16. 


a7 


July 9,205 — 2050 


(d) No square number immediately follows a prime 
number. 


i. Recall that Z denotes the set of all integers. 


(a) Consider the statement "Vx € Z, 3y € Z, x +2y 
EG 
i, I1xis27, what value can be chosen for y to make 
the equation true? 

i Find two values of x for which itis impossible 
to find a corresponding y value that makes the 
equation true. 

i. Is there a pattern for the x values that serve as 
counterexamples to the given statement? 

(b) Consider the statement "Vy € Z, 3x € Z, x +2y = 

E 

i. Identify the x value that makes the equation true 
for each of the following values of y: 3, -10, 
0,17. 

fi, Describe a general strategy for choosing x once 
your opponent has chosen the value for y. 

Write the negation of each of the following statements, 
using Proposition 1 and the rules for negating propo- 
sitions to simplify each to the point that no ~ symbol 
occurs to the left of a quantifier. (Recall that Z denotes 
the set of all integers, and R denotes the set of all real 
numbers) 

(a) Va eR, YbeZ,a’+beZ 

() 3yeR Yr e Rie yx 

(© Yr eZ, 3y eR x 2y 

(@ YxeZ,3yeR i2 











. For each statement in Exercise 14, is the original state- 


ment or its negation true? 

Write the negation of cach of the following statements. 

asan English sentence. You might find it helpful to write 

a symbolic expression as an intermediate step. 

(a) Every time you roll a "6," you have to take a card. 

(b) There is a day in your life that is better than every 
other day. 

(©) Inevery good book, there isa plot twist or a surprise 
ending. 

(d) Every math course has a topic that everyone finds 
easy to do. 

Write the negation of each of the following statements. 

as an English sentence: 

(a) For every integer x, there is an integer y that is big- 
ger than x. 

(b) In every set of integers, there is a smallest number. 

(©) For every positive integer x, there is a positive inte- 
ger y such that y is smaller than x and y is a factor 
ofx. 





Drive — IWDDOO3-Ensley.v6 Jey9,200 — 2050 


15 implications 53 


18. For each statement in Exercise 17, is the original state- (b) For every two real numbers x and y, there is an 
ment or its negation true? integer n such that x < n < y. 
19. Write the negation of each of the following statements (e) For every pair of integers that sum to 5, atleast one 
as an English sentence: of the numbers must be bigger than 2. 
(a) Forallodd integers, thereisnorealnumberxsuch 20. For each statement in Exercise 19, is the original state- 
that x2 + br +15 =0. ment or its negation true? 


1.5 Implications 





Many statements in mathematics, as well as in ordinary conversation, refer to the 
logical connection between two simpler statements rather than to the actual truth of 
either one. For example, 


(i) If Bob has an 8:00 class today, then it is a Tuesday. 

(ii) If it is raining, then the street is wet. 

(iii) Ifyou are a computer science major, then you must take discrete mathematics. 
(iv) Ita real number x satisfies x? > 4, then x > 2. 





Definition 

1. A statement of the form “if p is true, then q is true” is called an implication. 

2. We write an implication as p — q, which is read, “p implies q.” The > 
operator is taken as having lower precedence than A, V, and —. For example, 
the proposition sp ^ q — r means ((^p) Aq) > r- 

3. Inthe statement “if p, then q,” we call p the hypothesis and q the conclusion. 











In this definition, p and q can indicate either propositions or predicates. Of our 
four examples, the first two involve propositions and the last two involve predicates. 

When an implication involves predicates, there might be some parts of the sen- 
tence that are left unsaid. For example, when the speaker says, “If you are a computer 
science major, then you must take discrete mathematics,” she no doubt has in mind 
all the students at her particular school. So even though she didn’t say so, there is 
an unspoken domain the speaker has in mind, and there is an unspoken quantifier. 
Here are more precise versions of examples (iii) and (iv) that make the domain and 
the quantifier more explicit: 





(ii!) For all students s at this school, if s is a computer science major, then s must. 
take discrete mathematics. 


(iv') For all real numbers x, if x? > 4, then x > 2. 


uw Example 1 Im each of the following implicational statements, identify the 
domain D and the predicates P(x) and Q (x) so that the implication is of the form “For 
all x € D, if P(x), then Q (x)." (You may use variables other than x if you wish.) 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


54 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


1. If a triangle has three equal sides, then it has three equal angles. 

2. Ifan integer ends with a “2,” then it is a multiple* of 2. 

3. Ifa quadrilateral can be inscribed in a circle, then the opposite angles in the quadri- 
lateral sum to 180°. 


SOLUTION 

1. D is the set of all triangles, P(t) is "t has three equal sides,” and Q(t) is “ 
has three equal angles." 

2. D is the set of integers, P(n) is "n ends with 2," and Q (n) is “n is a multiple 
of 2.” 

3. Dis the set of quadrilaterals, P(d) is "d can be inscribed in a circle,” and 
Q(d) is "d has opposite angles summing to 180°." 





a 


Practice Problem 1 As in the preceding example, identify the domain D and the 
predicates P(x) and Q (x) so that the implication is of the form “For all x € D, if P(x), 
then Q(x)” 


(a) Ifa real number x has a real square root, then x is not negative. 
(b) Ifa real number x satisfies x? — x = 6, then x = 
(c) Ifan integer n is even, then 2" — 1 is a multiple of 3. 

(d) Ifan integer n ends with a 3, then n is a multiple of 3. 





The Logic of Implications 


To help us understand the logic of implicational statements, we begin with a concrete 
example. 


- Example 2 Trooper Jones walks into the Goldilocks Pub and sees four 
Boatsville College students (Al, Betty, Cindy, and Dan) enjoying various beverages. 
She asks the bartender, “Is anyone breaking the drinking law?” The bartender replies, 
"Everyone in here is obeying the law.” 

In front of each person, there is a card which has the person's age on one side and 
what he or she is drinking on the other side. Trooper Jones sees that the face-up sides 
of the cards look like Figure 1-10. 

The drinking age law states in effect 


Jf you are drinking beer, then you are at least 21 years of age. — (11) 


A Betty Cindy Dan 





19 Coke Beer 25 





























Figure 1-10 Age-drink cards. 


* The integers resulting from the product of 2 and an integer are the multiple of 2. 
{This example is based on a psychology experiment carried out by Griggs and Cox [32] in 1982. 


Driver — TWDDOO3-Ensley.v6 Jey9,200 — 2050 


15 Implications 55 


1. Identify a set D and predicates P(x) and Q(x) so that the bartender's statement is 
of the form "For all x € D, if P(x), then Q(x).” 

2. Whose cards does Trooper Jones need to turn over to check that everyone is obeying 
the law? 


SOLUTION 


1. D is the set of four Boatsville College students currently in the bar, P(x) is 
the predicate “x is drinking beer,” and Q (x) is the predicate “x is at least 21 
years of age.” 

2. She should turn over the cards for Al and Cindy. She is looking for a coun- 
terexample to the bartender's claim that everyone is obeying the law, and 
these are the only two students who could possibly be counterexamples. 





n 
This example illustrates the basic logic behind an implication or then..." 
statement. It will help to analyze why Trooper Jones does not turn over Betty's card 


and Dan's. 


© P(Betty) is the statement "Betty is drinking beer” and Q(Betty) is “Betty is at least. 
21 years of age.” Because P(Betty) is false, Trooper Jones doesn't care whether 
Q(Betty) is true or not. In ordinary English, Trooper Jones knows that Betty is 
not drinking beer, so Betty is obeying the law no matter what her age is. 

© P(Dan) is the statement "Dan is drinking beer” and Q(Dan) is “Dan is at least 
21 years of age.” Because Q(Dan) is true, Trooper Jones doesn't care whether 
P(Dan) is true or not. In ordinary English, Trooper Jones knows that Dan is of 
legal age, so Dan is obeying the law no matter what he is drinking. 


If the law is being broken, it must be because someone is drinking beer and is 
notat least21 years old. That is, the only time that a statement of the form “If p, then 
q” is false is when the hypothesis (statement p) is true, but the conclusion (statement 
4) is false. At all other times, we would have to say that the whole statement is true 

PETI since the law is not being broken. The truth table shown in Table 1-15 sums up our 

analysis of an implicational statement. 

T The last two rows of this table are the hardest for most people to swallow. How 

F can a statement of the form “If p, then q” be true when the statement p is false, and 

T 








especially when both statements p and q are false? For our concrete example, we 
can see that the last two rows correspond to the situation for Betty, for whom the 
hypothesis “x is drinking beer" is false. Betty would match the third row if she is 23, 
and the fourth row if she is 18. In either case, though, she is obeying the law, so the 
implication p — q is true. 

As we will see, this same logic applies to the more abstract setting of mathemat- 
ical implications. 








|) 
mana || a 


Table 1-15 Truth 
‘Table for 
Implication 


uw Example 3 Identify the hypothesis and conclusion for the following statement: 
For every positive integer n, if n is odd, then n? — n is divisible by 4. 


Do you think the statement is true or false? 


Driver IWDDOO3-Ensley.v6 July 9,205 20:50 


56 Chapter 1 / Puzzles, Patterns, and Mathematical Language 





























a || P-n Divisible by 4? 
1 0 | Yes since 0 = (4)(0). 
3 24 | Yes since 24 = (4)(6). 
5 | 120 | Yes since 120 = (4)(30). 
7 || 336 | Yes, since 336 = (4)(84). 
Table 1-16 Analysis of the Statement in. 
Example 3 


SOLUTION The domain is the set of positive integers. The hypothesis is the 
predicate “n is odd,” and the conclusion is the predicate “n? — n is divisible by 
4,” To explore the truth of the implication, it is natural to think about examples 
first, so we do this in Table 1-16. When considering examples, we only listed 
values for n for which the hypothesis is true. We never considered the values of 
2,4, 6, or 8 for n. This omission is analogous to the Trooper's behavior toward 
the people whose cards indicated that they were not drinking beer. Whether 
1? — nis divisible by 4 for even values of n has no relevance on the truth of the 
implication, just as people who are not drinking beer are not breaking the law 
regardless of their age. 

If we do look at these other (even) values of n, we see that sometimes they 
make the conclusion true and sometimes they make the conclusion false, just 
as some people who are not drinking beer might be under 21 and some might 
be 21 or older. 

Table 1-17 shows some examples of this. Since the values of n in Table 1-17 
never make the hypothesis true, they cannot be counterexamples to the impli- 
cation, It turns out that there are no counterexamples, and that the statement 
is therefore true o 





uw Example 4 Why do we consider the following statement to be true? 


For all integers n, if 3n = 9, then n? 





SOLUTION Asin the previous example, each of the component predicates 
“3n = 9" and “n? = 9" can be true or false depending on the value of as Table 
1-18 illustrates. Once again, when the hypothesis is false, we really don't care 
about the conclusion. To establish whether the implicational statement is true, 
we must ask, “Could there possibly be a counterexample—that is, is there a 
value of n for which the hypothesis is true but the conclusion is false?” Since 
the only value of n that makes the hypothesis true in this example is n = 3, 
and since this value of n also makes the conclusion true, we conclude that the 
implicational statement is true. a 





* We will learn how to prove this type of statement in the next chapter. 


[Driver 


IWDD003-Ensley-v6 


July9,2005 — 2050 


15 implications 57 















































n || won Divisible by 4? 
2 6 | Not divisible by 4. 
4 60 | Yes, since 60 = (4)(15) 
6 | 210 [| Notdivisible by4. 
8 || 504 | Yes since 504 = (4)(126). 
Table 1-17. More Analysis ofthe Statement. 
in Example 3 
a Conclusion (n? = 9) 
False ‘True 
o False False 
3 True "True 
10 False False 











Table 1-18 Analysis of the Statement in Example 4 


wn Example 5 Why do we consider the following statement to be false? 
For all integers n, if n? > 9, then n > 3. 


SOLUTION It is possible to find a counterexample, if we remember that 
“integers” includes both positive and negative numbers. For example, the value 
n = —4makes the hypothesis (n > 9) true while making the conclusion (n > 3) 
false, 
n 
It is only the presence of values of n that make the hypothesis true and the 
conclusion false, which causes the implicational statement to be false. 





Summary For astatement of the form “if hypothesis, then conclusion" to 
be FALSE, it must be the case that the hypothesis is true while the conclusion 
is false. Otherwise, the statement is TRUE. 

For a quantified statement “Wx, if P(x), then Q(x)” to be FALSE, it must 
be the case that at least one value of x is a counterexample—that is, there 
is at least one value of x that makes the hypothesis P(x) true but makes the 
conclusion Q (x) false, Otherwise, the quantified statement is TRUE. 











Practice Problem 2 Decide whether each of the following quantified statements 
is true or false. For each that is false, give a counterexample, Remember that R is the 
set of real numbers, and Z is the set of integers. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


58 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


(a) Vx € R ifx? — Sx +4 — 0, then x > 0. 
(b) Yn e Z, ifr? = 1, then n = 1. 
(©) V positive integers a and b, if a and b are both odd, then a + bis also odd. 
(d) V positive integers a and b, ifa and b are both odd, then ab is also odd. 





Negating Implications 


Since the negation of a statement captures what it means for the statement to be 
false, we can take our understanding of when an implication is false and turn it into 
a formal rule for forming the negation of an implicational statement. 


Proposition 1 The negation of the implication p — q is the statement p ^ (=q). 


PROOF The proof consists of showing that the propositions (p + q) and p ^ 
—q are logically equivalent using truth tables. 








P 4 pa 70929 74 | Paa 
T T T F F F 
TF F T T T 
F T T F F F 
F F T F T F 




















Since the columns for (p + q) and p A —4 are identical, we conclude that these 
statements are logically equivalent. Notice that the negation of an implication is 
not an implication! " 


Tf a quantified implicational statement has the form Vx € D, P(x) + Q (x) for 
some domain D, then the statement is false if there is a value for x in D that makes 
the hypothesis P(x) true but the conclusion Q(x) false. Hence, the negation of a 
quantified implicational statement can itself be expressed as a quantified statement. 


Proposition 2. Thenegation of the implication Vx € D, P(x) > Q(x) is the state- 
ment 3x € D, P(x) A (-O(x)). 


PROOF This follows from Proposition 1 of this section and Proposition 1 of 
Section 1.4. E 


uw Example 6 Write the negation of each of the following statements: 


1. If Bob has an 8:00 class today, then it is a Tuesday. 

2. If Jessica gets chocolate, then she has a happy birthday. 
3. For all real numbers x, if x > 2, then x? > 4. 

4. V real numbers x > 0, if x? = 1, then x 





Driver 


IWDD003-Ensley-v6 


Juy9,2005 — 2050 


15 implications 59 


SOLUTION Each negation is given. 


1. Bob has an 8:00 class today, and it is not Tuesday. 
2. Jessica gets chocolate, but she doesn't have a happy birthday. 
3. There exists a real number x such that x > 2 but x < 4. 


4, 3 real number x > 0, (x? = 1) ^ (x? # 1). Observe that the domain is still 
the set of real numbers that are greater than 0. 
n 


Practice Problem 3 Write the negation of each of the following statements: 


1. If you buy the extended warranty, then nothing will go wrong with your television. 
2. If Christopher gets a flu shot, then he will not get the flu. 

3. Forall triangles t, ift has three equal sides, then t has three equal angles. 

4. Vx € (2, 3,4, 5), x? is positive. 


Contrapositives, Converses, and Inverses 

We have determined how the truth value of an implicational statement depends on 
the truth value of its component parts. However, there are many ways to form an 
implication with a given pair of component statements. Trying different combinations 
for hypothesis and conclusion is a fairly natural thing to do when considering a new 
mathematics problem. It is often the case that there are important properties to be 
studied, and you are first trying to figure out how the properties are related to each 
other. 


kl Example 7 Let P(n) stand for the predicate “n ends in a digit 2,” and Q (n) for 
the predicate “n is divisible by 2.” We can use these predicates to form many different 
implications. 


1. P(n) + Q(n), that is, "If n ends in a digit 2, then n is divisible by 2.” 
2. Q(n) + P(n), that is "In is divisible by 2, then n ends in a digit 2." 

3. =P(n) + O(n), that is, "Ifn does not end in a digit 2, then n is not divisible by 2.” 
4. Qn) > —P(n), that is, “If nis not divisible by 2, then n does not end in a digit 2.” 
Asis often the case, each statement has an implied domain and quantifier—in this case, 


“For all integers n,... . " Decide if each of the quantified statements is true or false. For 
each that is false, give a counterexample. 


SOLUTION 


1. This statement is true. 


2. The value n = 14 isa counterexample since it makes the hypothesis true (14 
is divisible by 2) and the conclusion false (14 does not end in 2). 


3. The value n = 14 is a counterexample since it makes the hypothesis true (14 
does not end in 2) and the conclusion false (14 is divisible by 2). 


4. This statement is true. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


60 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


You might have noticed that we used the same counterexample for the second 
and third statements above. If you think about it, any counterexample to the second 
statement must be a counterexample to the third statement and vice versa. The 
same relationship holds between the first and fourth statements as well, so since. 
the first statement has no counterexamples, then neither does the fourth statement. 
Because of these connections, the relationships illustrated by these four statements 
have special names in the mathematics literature. 





Definition Consider the implication Vx € D, P(x) + Q(x). 


1. The converse of the implication is Vx € D, Q(x) > P(x). 
2. The inverse of the implication is Vx € D, —P(x) > -Q (x). 
3. The contrapositive of the implication is Yx € D, ~Q (x) > —P(). 











We have defined the terms in the context of quantified statements, but the same 
terms can be applied in an obvious way to propositions: 


© The converse of p — q isq > p. 
© The inverse of p > q is p + -g. 
© The contrapositive of p — q is =g + —p. 


Since we have a mechanism for proving propositions to be logically equivalent, 
we can formally establish the relationships mentioned in the discussion following 
Example 7. 


Proposition 3 


1. An implication and its contrapositive are logically equivalent. 
2. The converse and inverse of an implication are logically equivalent. 
3. Animplication is not logically equivalent to its converse (and thus not to its inverse). 


PROOF Wewillprovethefirststatement by comparing the truth tables of p + q 
and ~g > >p. 








P aj Poa || ajo |a 
T T - F F T 
T dE F T F F 
F T T F T T 
F F T T T T 




















Since the columns for p — q and ~q — ~p are identical, we conclude that these 
statements are logically equivalent. The other two parts of this proposition are 
addressed in Exercise 5 at the end of this section. m" 


Driver — TWDDOO3-Ensley.v6 Jey9,200 — 2050 


15 implications 61 


"MB Example 8 


1. Give an example of a true implication whose converse is false. 
2. Give an example of a true implication whose converse is also true. 


SOLUTION 


1. The implication “For all integers a and b, if a and b are odd, then a + b is 
even" is true.* Its converse “For all integers a and b, if a + b is even, then a 
and b are odd" is false. One counterexample is a = 2, b = 14. 
2. “If mis even, then n? is even" is true and has a true converse. 5 
Since it is possible for a true implicational statement to have a false converse, 
but it is also possible for a true statement to have a true converse, we must always 
treat implications and their converses as two entirely different statements that must 
each be analyzed on its own merits 
The Language of Implication 
One of the frustrations of the English language is that there are many ways to state 
the same thought. This phenomenon occurs not only in ordinary conversation, but. 
also in mathematical expression. This is one reason mathematicians are so fond of 
symbolism—a completely symbolic expression of a theorem is less likely to contain 
the ambiguity inherent in spoken and written language. In this short section, we 
will give you a few examples of some common sources of confusion in the English 
language. 
One possible source of confusion is our tendency in English to rearrange the 
parts of an implication without changing its meaning. For example, these sentences 
convey exactly the same idea: 


© Iwill pass the course if I ace the final. 
© If Lace the final, then I will pass the course. 


Switching the location of the hypothesis and conclusion within the sentence does not 
change the meaning. The hypothesis is the part that goes with the word if, and in 
each sentence that is the phrase “I ace the final.” Do not confuse this with forming 
the converse, which involves interchanging the hypothesis and conclusion. Here is a 
mathematical example: 


Statement: If an integer m ends in the digit 0, then mis a multiple of 5. 
Same statement: An integer m is a multiple of 5 if it ends in the digit 0. 
Converse: If an integer m is a multiple of 5, then m ends in the digit 0. 


Another possible source of confusion is the variety of ways in which we can 
express quantification and implication. We have discussed earlier our tendency to 


* In the next chapter, we will learn how to prove statements of the type in this example. 


Driver TWDDOO3-Ensley-v6 July 9,205 — 2050 


62 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


leave out domains and quantifiers when we speak. Another issue when formalizing 
statements is the choice of writing a “for all” statement as an implication or not. 
Given a statement of the form "Vx € D, Q(x)" where D isa subset of a natural larger 
set U, we can equivalently write the statement as "Vx € U, (x € D) > Q(x).” For 
now, the choice is a matter of taste, but we will learn to prefer the latter in the next 
chapter when we discuss formal proofs of this type of statement. 


p pee ferek one vay o sprer t tiea hatve apiti vim 
major at this university must take discrete mathematics: “Let D be the set of computer 
science majors at this university. For alls € D, s must take discrete mathematics.” 


1. Rewrite this as an implication. 


2. Form the negation of the original statement, and the statement written as an impli- 
cation. 


SOLUTION 


1. Here is one possible solution: The set U of all students at this university is a 
natural set that includes all of D. So an equivalent implicational statement 
is “For all s € U, ifs is a computer science major, then s must take discrete 
mathematics.” 

2. The negation of the original statement is “There exists a computer science 
‘major at this university who does not have to take discrete mathematics." 
For the implicational form, the negation is "There exists a student at this 
university who is a computer science major but does not have to take discrete 





mathematics.” Both say the same thing, but in a slightly different way. 


a 


Practice Problem 4 Rewrite each quantified predicate as an implication. Unless 
otherwise indicated, assume that variables stand for real numbers. Use R and Z to 
indicate the sets of reals and integers, if you wish. 


(a) V even integer m, m ends in the digit 0, 2, 4, 6, or 8. 
(b) Vx > 0x7 > x 
(©) For every positive odd integer n, n? — nis divisible by 4. 


Finally, it is sometimes the case that both an implication and its converse are 
true. This is the strongest possible relationship between the properties that make 
up the hypothesis and conclusion of the implication. Because of the importance of 
this situation, mathematicians have some common ways to convey the idea that an 
implication and its converse are both true. For example, the statement “If n is even, 
then i? is even” is true, and it has a true converse, A mathematician might say, “If n 
is even, then n? is even, and conversely,” or “n is even if and only if n? is even.” 

In general, for any statements p and q (with or without variables), the phrase 
“pif and only if q” means that both p — q and q — p are true. For this reason, we 
use the notation p <q to express this biconditional statement. When considering a 
statement of this form, it is often easiest to consider each of these two statements 
separately. 


[Driver JWDDO3-Ensley-v6 July 9,205 — 2050 


15 implications 63 


Practice Problem 5 What are the two implicational statements expressed by the 
statement "The integer n is a multiple of 10 if and only if n is even"? Is each statement 
true? If not, give a counterexample. 


Logic Puzzles Revisited 

Since now we know howto make truth tables for implicational statements, we end this 
section with some more logic puzzles, involving the use of implication. Remember 
to make your truth tables by first considering the statements themselves without 
worrying about who said them. 


T Example 10 You meet two inhabitants. A says, “If B is truthful, then so am I,” 
and B says, “At least one of us is lying.” Who (if anyone) is telling the truth? 


SOLUTION Hereisthetruth table. Notice that A'sstatementisg + p, which 
is false only when q is true but p is false. 














Statement 1 Statement 2 
P q | EBistruthful, then sois A Atleast one of us is lying 
T T T F 
T F T T 

* F T F T 
F F T T 








[2] 
We conclude that only B is a truth-teller. 


uw Example 11 You meet two inhabitants. A says, "If B is truthful, then so am I,” 
and B says, "A is lying." Who (if anyone) is telling the truth? 


SOLUTION This is similar to the previous example. A's statement (if q, then 
p) is false only in the situation where q is true but p is false. 








Statement 1 Statement 2 
pP q | HBistruthful, thensoisA Ais lying 
T Y T F 

po qe oe T F 

2 FT F T 
F F T T 








In this case there is an ambiguous answer since either of the middle two rows 
could be solutions. We can conclude that exactly one of A and Bisa truth-teller, 
but we cannot determine which one from what they said. a 


 JWDDOOS-Ensley-v6 


Driver 


64 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


July 9,205 — 2050 


"lll Example 12 You meet three inhabitants. A says “If B is lying, then so eC,” 
B says, "C is truthful,” and C says, “At least one of us is lying." Who (if anyone) is 


telling the truth? 


SOLUTION To build the table for A's statement, we analyze it as being 
=q — =r. The only time itis false is when ~q is true but — is false. Put another 
way, the only time Statement 1 is false is when q is false but r is true. 




















Statement 1 Statement 2 Statement 3 
p q r | MBisWigthensoisC —— Cistruhfül One of us is lying 
T T © T T F 
T T EF T F T 
T F T F T T 
T FF T F T 
F T T T T T 
F T F T F T 
BB: or F T T 
F F F T F T 


Inthis example there are no solutions, so this isa paradox. In other words, given 
the rules of the problem, no three people could have uttered those particular 


phrases. 


Solutions to Practice Problems 


1 (a) D is the set of real numbers, P(x) is “x has a real 
square root,” and Q (x) is “xis not negative." 

(b) D is the set of real numbers, P(x) is “x satisfies the 
equation x? — x = 6," and Q(x) is “x =3." 

(c) Dis the set of integers, P(n) is “n is even,” and O (n) 
is “2" — Lis divisible by 3.” 

(d) Disthe set of integers, P(n) is “n ends with a3,” and 
Q(n) is "n is divisible by 3.” 

2. (a) There are only two values for x that make the hy- 
pothesis true (x = 1 and x = 4), and for both these 
values the conclusion is also true. Therefore, the “if, 
then” statement is true. 

(5) Although there is a value of n (namely n — 1) that 
makes both the hypothesis and conclusion true, 
there is also a value (n = —1) where the hypothesis 
is true and the conclusion is false. So the “if, then” 
statement is false, 

(c) This statement is false, and it is easy to find a 
counterexample—just pick any odd integers for a 
and b. For example, with a — 1 and b.— 7, we have 
a + b = 8.Since for this choice of a and bthe hypoth- 
esis s true but the conclusionis false, the original “if, 

" statement is false. 








a 


(d) Table 1-19 shows some possible values of a and b. 
The fact that some entries make the conclusion false 
is irrelevant, since these same values make the hy- 
pothesis false as well. It turns out that any time we 
make the hypothesis true, the conclusion is also true, 
and so the “if, then" statement is true. In the next 
chapter we will see how to prove this fact. 

(a) Youbuy the extended warranty, and something does 
go wrong with your television. 

(b) Christopher gets the flu shot, but he does get the flu. 

(c) There is a triangle t where t has three equal sides 
but r does not have three equal angles. 

(d) 3x € {1,2,3,4, 5} such that x? < 0. 

(a) Vm € Z, if mis even, then m ends in the digit 0, 2, 4, 
6, or&. 

(b) Vx € R, ifx > 0 then x7 > x. 

(c) Vn € Z, (n > 0) A(n odd) + n? — nis divisible by 4. 

The statement “If n is a multiple of 10, then n is even” 

is a true statement, but the converse “If n is even, then 

n is a multiple of 10” is false, having n = 2 as a coun- 

terexample. Hence, the given “if and only if” statement 

is false. 





[Driver JWDDOO3-Ensley-v6 








Table 1-19 Table for Practice Problem 2(d) 


Exercises for Section 1.5 


1. Write cach statement as a statement of formal proposi- 


a 


=P | pva 


July9,2005 — 2050 


15 implications 65 


piv 





tional logic. That is assign variable names to the simple. 





phrases, and write the statement using those variables. 
along with the logical connectives — ^, V, and —. w 
(a) Ifyou don't attend the concert, you will get an F for 

the course. 
(b) We will go if you go. 
(c) Late my lunch but I did not eat breakfast. 
(@) Ifyou don't eat your breakfast, you will be hungry. » 
(e) Itis false that this triangle has both a 30° angle and 











a || 
a 


EI 


pya 


pvqa 








a 60° angle. T 

(D Ita quadrilateral is a square, it has four equal sides eT 

and four equal angles. F 

(g) Ifa triangle has either two equal sides or two equal F 
angles, then itis an isosceles triangle. 

2. For each statement in Exercise 1, give a truth table for 

F ‘he statement, and use it to explain what conditions 

would make the statement true and what would make 











mama 


` 
a 


ap ra rae 


“PAG p) 








it false, 
3. In a certain board game played with a pair of dice, if (a) T 
you roll "doubles" three times in a row, you must place F 
your piece on the board square marked “Jail.” After F 
playing the game for two grueling hours, no one ever 
rolled “doubles” three times in a row. Can we conclude 
that no piece was ever placed on the “Jail” square dur- 
ing the game? Refer specifically to the hypothesis and 











CETE] 


E 
* 
4 





p^(qon 





conclusion of the rule above in your explanation. 





4. Complete the following truth tables for the given com- 
F pound expressions: 


^ ao 
pag | 0^0 














4 
maan |a 
byt | 
mA A 
mamamana 














 JWDDOOS-Ensley-v6 


Driver 


66 Chapter 1 / Puzzles, Patterns, and Mathematical Language 








p q 7 \ PAG) PYF | ^g (vn 
TORT 

T TF 

T FT 
(T F F 

FTT 

FTF 

FFT 

FFF 

















5. Use truth tables to check if each of the given pairs of 
symbolic logic statements are equivalent. 
(0) p> qandq—> p (Nort: This is a generic impli- 
cation and its converse.) 
(b) =p => ~g and qo» p (Nore: This is the inverse 
and converse of a generic implication p — q.) 
(©) palp > a)and pag 
(d) p> (qr) and (pag) >r 
(©) (pyg) e rand (p+ r) v (qr) 
6, It is sometimes useful to realize that the implication. 
p > q is logically equivalent to ~p v q. This exercise 
explores this equivalence, 
(a) What combination of truth values for pand q makes. 
the implication p — g false? 
(©) Explain how you know that the negation of p v q 
ispA-q. (Hint: Refer to Theorem 2 of Section 
13) 
(©) What combination of truth values makes p A ~g 
true? (This same combination would make ~p V q 
false.) 
(d) Use a truth table to formally demonstrate that 
P > qis logically equivalent to p v q. 
For each of the following statements, rewrite them in 
F propositional logic notation, making the meaning of 
your propositional variables clear. Use truth tables to 
find any pairs of logically equivalent statements 
(a) If Alaina likes basketball, then she likes swimming 
and gymnastics. 

(b) It Alaina likes gymnastics, then she likes swimming 
and basketball 

(© Tf Alaina dislikes gymnastics or dislikes swimming, 
then she dislikes basketball. 

(@ Alaina dislikes basketball or she likes both swim- 
ming and gymnastics. 

3. Write each of the following predicates using the simple 
predicates > 0and y > Üalong with the propositional 
connectives A, V, =, and =: 
(a) If x is positive, then y is positive. 
(6) It xis positive, then y is not positive 


We 


= 


E 


I 


Jey9,2005 — 2050 


(©) If x is not positive, then y is positive. 
(d) If x is not positive, then y is not positive. 


. For each of the given values of x and y, determine which 


predicates from Exercise 8 become true statements. 
(a) Ifx =8 and y — 3, which of (a-d) are true? 







©) Tex 0, which of (a-d) are true? 
(© Ix ~1, which of (a-d) are true? 
(à) lex 0, which of (a-d) are true? 


(e) 1x 
(1x 


—10, which of (a-d) are true? 
2, which of (a-d) are true? 


|. Let D be the set (1, 3, 5,7, 8,10, 11, 12). For each of the 


following, decide whether it is true for all the elements 
of D. If it is not, give a counterexample. 

(a) Ix is even, then x > 7. 

(b) If x is odd, then x > 7. 

(©) Ifxis even, then x < 12. 

(d) If xis odd, then x < 12. 

(e) Ex > 12, then xis even. 


|. Let D be the set (1,3, 5,7, 8, 10, 11, 12). For each of the 


following, decide whether it is true for all the elements 
of D. If it is not, give a counterexample. 
(a) If x is even and x > 7, then x < 20. 
(b) If x is odd, then x < 10. 
(© Ifx < 10 and x # 8, then x is odd. 
(d) If xis odd or x < 5, then x — 1 is even. 
(e) Ifx > Sand x <7, then x is negative. 
For each of the following statements, describe the pred- 
icates P and Q that make the formal statement 
For every positive integer n, P(n)  Q(n) 


correctly represent the statement. 
(a) Itn is even, then n? + n is even. 

(b) If n is a multiple of 5, then n has ones’ digit of 5. 
(©) Ifnis prime, then 2^ — 1 is prime. 


j. For each of the following statements, describe the pred- 


icates P and Q that make the formal statement 
‘There exists an integer n, P(n) A Q(n) 


correctly represent the statement. 
(a) Some odd numbers n make 2" — 1 a multiple of 7. 


(b) Atleast one multiple of S doesnot have a ones' digit 
of 0. 


(©) Itis possible for a multiple of 3 to have a ones’ digit 
of 7. 


Express each of the following statements using predi- 
cates and the quantifier Y: 

(a) I£n endsin 5, then n is a multiple of 5. 

(b) If m ends in 3, then mis a multiple of 3. 


IWDD003-Ensley-v6 


Driver 


(©) Tfn is a multiple of 5, then r£ — 1 is a multiple of 3. 
(d) For every positive real number x, if x < V7, then 
2Jx>V2. 
15. Which of the statements in Exercise 14 are true? 
16. For each of the statements in Exercise 14, express the 
negation using predicates and the quantifier 3. 
17. Express each of the following statements using predi- 
cates and the quantifiers V and 3: 
(a) If mis a multiple of 5, then n ends in 5 or n ends in 
0. 
(b) Itn is not a multiple of 3, then n? — 1 is a multiple 
of, 
(e) Forallodd integers a and b, there is no real number 
x such that x? -ax b — 0, 
(d) For every real number y, if y > 0, then there exists 
x e Rsuch that i? = y. 
. Which of the statements in Exercise 17 are true? 

. For each of the statements in Exercise 17, express the 
negation using predicates and the quantifiers 3 and V. 
20. Give a counterexample to each of the following to show 
that itis a false implicational statement. Ifyou think the 
statement is actually true, write one sentence explaining 

why you think no counterexample exists 





Sk 








(9) If? — 4, then? — 8. 
(b) Itsin(x) = 0, then cos(x) = 1. 
(©) If cos(x) = 0, then sin(x) = 1. 
(@ Ix =x, then? = 





21. Consider the statement “If n is even, then 2" —1 is a 
multiple of3.” If we let P(x) be the predicate “xiseven,” 
and we let Q(x) be the predicate “2* — 1 is a multiple 
of 3,” then we can represent the original statement as 


For every natural number n, P(n) -» Q(n). 


Create an organized table of your attempts to find a 
counterexample to this statement. Why do you think 
that no counterexample exists? 

22. A consequence of Proposition 2 is that the negation 
of the statement 3x € D, P(x) ^ Q(x) is the statement 
Vx € D, P(x) -> —Q (x). Use this fact to write the nega- 
tion of each of the following statements, using careful 
wording and an “if, then” structure when possible: 

(a) There exists a positive integer n such that n is even 
and} >1. 

(b) There exist positive integers a and bsuch that a — b 
is odd and a? = 29. 

(©) There exist integers a and b such that a and b are 
positive and a/b = 1+ b/a. 

(d) There existsa right triangle with perimeter equal to 
three times the length of one leg 


July 9,205 — 2050 


15 Implications 67 


23. Use the idea in Exercise 22 to write the negation of each 
of the following statements, using careful wording and 
an “if, then” structure when possible: 

(a) There exist integers m and n such that m > n and 


m+ = 11. 
(b) There exists a real number z such that z > 0 and 
z+lal 


(©) Ifxisa positive real number, then there isa positive 
integer n with mx > 1. 

(@) IES is a set of three people, then there exist two 
people in S with the same sex. 

Which of the original statements in the previous exer- 

cise are true? 
25. Form the contrapositive of each of these statements: 
(a) Ifyou don't attend the concert, you will get an F for 
the course. 

(b) We will go if you go. 

(©) Ifyou don't eat your breakfast, you will be hungry. 

(d) Ifa quadrilateral is a square, it has four equal sides. 
and four equal angles. 

(©) Ifa triangle has either two equal sides or two equal 
angles, then itis an isosceles triangle. 

26. Form the converse of each of the statements in Exercise 
25 

27. Form the inverse of cach of the statements in Exercise 
25 

28. Solve cach of these logic puzzles by using truth tables. 
(a) You come across two inhabitants. A says, “I am ly- 

ing if Bis” and B says, "A is lying if Lam." Can you 
tell who if anyone is telling the truth? 

(b) You come across three inhabitants. A says, “If B is 
lying, then so is C," B says, “IEC is lying, then so is 
A," and C says, “ICA is lying, then so is B." Who if 
anyone is telling the truth? 

29. If there was a third person in Example 11, what could 
she have said that would have determined everyone's 
truthfulness? 

30. Express each of the following using quantified state- 
ments over the domain $ of all college students and 
the predicates C(x) meaning “x is a computer science 
major,” and D(x) meaning "x takes discrete mathemat- 
ics” Which statements are equivalent to one another? 
Which statements are negations of one another? 

(a) Every computer science major takes discrete math- 
ematics. 

(b) Some computer science majors take discrete math- 
ematics, 

(©) No computer science major takes discrete mathe- 
matics. 


[Driver JWDDOO3-Ensley-v6 July9,2005 — 2050 


68 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


(d) Not every computer science major takes discrete. (f) You must take discrete mathematics if you are a 
‘mathematics. computer science major. 

(e) All computer science majors do not take discrete (g) Some computer science majors do not take discrete 
mathematics. mathematics. 


i 1.6 Excursion: Vali 





ity of Arguments 





Charles Sanders Peirce wrote [41] 


-.. Bad reasoning as well as good reasoning is possible; and this fact is the foun- 
dation of the practical side of logic. 


Logic has been an important part of intellectual endeavors since the time of the 
ancient Greeks. Aristotle (384-322 ».c.£.) wrote about the importance of precise 
reasoning in discourse, and Euclid's famous book The Elements has been used in 
teaching the art of rigorous thought and the skill of persuasive argument for nearly 
2,000 years. Since so many decisions in our lives are based on the persuasive use 
of language and logic, the field of formal logic is an active, important discipline 
of philosophy. In the next chapter, we will consider the question of mathematical 
theorems and their proofs. These mathematical proofs are one example of a general 
concept—using valid arguments to persuade others that something is true. In the 
current section, we will examine the general notion of arguments in everyday life, 
and how to detect some common misuses of logic. We begin with an example of a 
phenomenon that we sometimes see in advertising: What you think you hear may not 
be what the advertisement really said. 


uw Example 1 A car company advertises, “If you didn't buy from us, you paid 
too much." Write this as a statement of formal propositional logic, and examine it with 
a truth table. 


SOLUTION Let p stand for "You bought a car from us,” and q stand for 
“You paid too much." The statement in the ad is represented by ^p — q, and 
its truth table looks like this: 





CEET 


P 
T 
T 
F 
F 


Hence, the only circumstances under which the statement is false are if you do 
not buy your car from this company and you do not pay too much for it. Notice. 
that if you do buy your car from this company, the advertised statement is true 
regardless of the truth value of statement q. o 


FETE] 


‘The car company that sponsored this ad is probably hoping you hear something 
different from what they actually say. Two possible misinterpretations are: 


Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


16 Excursion: Validity of Arguments 69 


@ Ifyou paid too much for your car, you didn't buy from us. This is the converse of 
the advertiser's claim. 


© Ifyou bought a car from us, you did not pay too much. This is the inverse of the 
advertiser's claim, 


As we indicated in the previous section, a statement and its converse (or inverse) 
do not necessarily have the same truth value, Truth in advertising may force the car 
company to make true statements in its advertisements—but it is up to the consumer 
to resist inferring that the inverse or converse is also true. 

Why are we so likely to misinterpret the advertiser's claim? One reason is that, 
in ordinary conversation, we frequently make statements of the form “if p, then q” 
when our true meaning does include both the statement and its converse (or inverse). 


uw Example 2 When you were a child, your parents said, “If you don't eat your 
peas, you can't have dessert." You promptly ate your peas and asked for dessert. What 
statement did you hear? Is it the same statement your parents made? Do you think you 
misinterpreted their statement? 
SOLUTION Youareacting under the assumption that their statement was “If 
you do eat your peas, you can have dessert.” This is NOT the same statement. It 
is the inverse. However, most parents who make a statement such as this really 
mean both what they actually say and its inverse. [2] 


When we are dealing with our parents or our friends, there is perhaps no harm 
in reading into their statements more than was actually said. However, in a more 
formal setting such as a debate, a mathematical proof, or a false advertising suit, it is 
important to establish the exact meaning of each statement, and to analyze whether 
each successive statement follows logically from the previous statements. 


- Example 3 Using p to stand for "You bought a car from us" and q to stand. 
for "You paid too much, "analyze the inverse of the advertiser's claim: “If you bought 
a car from us, you did not pay too much." Under what circumstances is the inverse of 
this statement false? 
SOLUTION The statement is written formally as p — 4, and it has the 
following truth table: 





maaa |a 


P 
T 
T 
F 
F 


The first row is the only row where the result is false. Hence, if there is a 
customer who buys from the dealer and pays too much, then this implicational 
statement is false. In every other case, the implicational statement is true. — [1 


dud" 
Bann 


If the car dealer's ad contained this inverse statement, there would be a good 
chance that it would be false, and they could be sued for false advertising, The original 


Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


70 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


ad is carefully phrased to avoid false advertising, counting on you the consumer to 
misinterpret what it says. 


Practice Problem 1 Create the truth tables for each of the following, and relate 
them to the truth tables for the original claim and the inverse of the claim: 


(a) The converse of the advertisers claim: “If you did pay too much, you didn't buy 
from us.” 

(b) The contrapositive of the advertiser's claim: “If you didn't pay too much, you 
bought from us.” 


More on the Language of Implication 

(One obstacle to analyzing the truth of English statements is that English as a language 
is somewhat complicated and often confusing. One could focus a large part of an 
entire course on the pitfalls of ambiguity in spoken language. We will not pursue 
that route since mathematics rarely suffers from this problem. Precision in language 
is one of the hallmarks of mathematics. Even in mathematics, though, there are 
a variety of ways to state the basic logical relationships between properties, so it is 
worth spending some time on the most common ways to express the basic implication 
relationship. 


Proposition 1 Let p and q stand for statements. The implication "if p is true, 
then q is true" can be expressed in each of the following ways: 


1. If pis true, then q is true. 
2. pistrue only if q is true, 

3. For to be true, it is sufficient that p is true. 
4. For pto be true, it is necessary that q is true. 


uw Example 4 Each of the following is an equivalent way to express the basic. 
implication “If you live in Pittsburgh, then you live in Pennsylvania:” 


© You live in Pittsburgh only if you live in Pennsylvania. 
© Iris necessary to live in Pennsylvania in order to live in Pittsburgh. 
© To live in Pennsylvania, it is sufficient to live in Pittsburgh. 


At least partly because of the possible rearrangements of the clauses within the 
sentences, implications that are phrased using only if, sufficient, or necessary can be 
more difficult to understand than those that use the word if. Here are some hints that 
might prove helpful: 


© If the order of the clauses is confusing to you, then before doing anything else, 
rewrite the statement ordered the way with which you are more comfortable. 

© A statement that uses “only if” is the converse of the same statement with “if” in 
its place. 


Driver TWDDOO3-Ensley.v6 July 9,205 — 2050 


1 Excursion: Validity of Arguments. 71 


© Ifa statement uses “sufficient,” the condition that is sufficient is the hypothesis of 
the corresponding "if, then” statement. 

© Ifa statement uses “necessary,” the condition that is necessary is the conclusion of 
the corresponding “if, then” statement. Hence, a statement that uses “necessary” 
is the converse of the same statement with "sufficient" in its place. 


Practice Problem 2 Rewrite each of the following sentences in "if, then" form: 


(a) You will pass the test only if you study for at least four hours. 
(6) Attending class regularly is a necessary condition for passing the course. 

(©) Inorderto bea square, itis sufficient that the quadrilateral have four equal angles. 
(d) Inorder to be a square, itis necessary that the quadrilateral have four equal angles. 
(e) An integer is an odd prime only if it is greater than 2. 


Valid and Invalid Forms of Reasoning 


Early in life, children develop reasoning skills, Some of these skills are so fundamental 
that they seem almost trivial. For example, suppose you tell a child, “Your mother 
has gone to work, and so has your father.” If you later ask, "Where is your mother?" 
the child will likely state that she is at work. The child has used a form of reasoning 
that we might write as 


Statements p and q are both true. 
Therefore, I can conclude that p is true, 


where p is the proposition “Mother is at work,” and q is the proposition "Father is 
at work.” We can write forms of reasoning like this using two different brief repre- 
sentations: 
PAG 
PAT Qo mpi 
r PAG. 

Similarly, if you tell the child, "Either you must eat your peas or you must eat 
your carrots,” the child will instinctively eat the carrots and announce, "I'm all done.” 
‘The child has used the form of reasoning represented as follows: 

Li 
“PVG 
where p indicates "Peas have been eaten,” and q indicates “Carrots have been eaten.” 
Inboth vertical and horizontal representation formats, we list one or more statements 
that are presumed to be true, and then we list a conclusion that can be inferred from 
these premises Here is another simple example of a valid form of reasoning consisting 
of two premises followed by the conclusion: 











o ge. pv 


Pp 
4 


“PAG 








q^ pag 


This example points out the rather obvious fact that if you know p is true, and you 
also know q is true, it is valid to conclude that the statement "p A q” is also true. 


Driver TWDDO03-Ensley.v6 July 9,205 2050 


72 Chapter 1 / Puzzles, Pattems, and Mathematical Language 


Tn future examples and exercises, we will explore a variety of other valid forms 
of reasoning, some trivial and some not so trivial. For now, we will concentrate our 
attention on some important forms of reasoning related to implicational statements. 


a Example 5 A parent says, “If you clean your room, we will go play miniature 
golf” The child cleans her room, and announces, “Let's go golfing.” Give a formal 
description of the reasoning process used by the child. 
SOLUTION We use p for “You clean your room,” and q for “We play minia- 
ture golf.” We can write the given argument structure in vertical or horizontal 
format as follows: 
pg 
p 
4 





or 





u 


Tn this example, the child has applied the form of reasoning known by the Latin 
name modus ponens, which means "in a manner that asserts" In words, it simply 
states that if an implication is true, and its hypothesis is also true, then its conclusion. 
must be true. 

This direct method of reasoning is related to another valid form of reasoning 
known as modus tollens, which means "in a manner that negates.” Simply stated, if 
an implication statement is true, and its conclusion is false, then its hypothesis must 
be false. This is represented symbolically as follows: 


pda 





"Bl example 6 Here isan example of a valid argument using modus tollens: 
If this quadrilateral is a square, then it has four equal sides. 


This quadrilateral does not have four equal sides. 
Therefore, this quadrilateral is not a square. 





Notice that with modus tollens, one is simply applying the reasoning process 
of modus ponens to the contrapositive statement ^g — ~p. That is, the following 
methods of reasoning are the same because they only differ in the first line, and those. 
lines represent equivalent implicational statements: 





p nnda 
n y 
“np bymodus tollens —— ..-p by modus ponens 


Of course, not every form of reasoning applied by a person in an argument is a 
valid form of reasoning. Before we discuss how we tell a valid form from an invalid 
form of reasoning, we will look at two common erroneous methods of argument. 
‘These and other invalid reasoning forms are referred to as fallacies. The particular 
names of the two fallacies below stem from the tendency to confuse an implication 
with its converse or inverse, as we discussed earlier in this section. 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


16 Excursion: Validity of Arguments 73 





Definition 


1. The converse fallacy is the invalid reasoning form given by p + q, q. `. p. 
2. The inverse fallacy is the invalid reasoning form given by p + q, =p, -. ~ 














uw Example 7 For each of the following, identify the reasoning as either valid 
(modus ponens), valid (modus tollens), invalid (converse fallacy), or invalid (inverse 
fallacy): 


1. If nis even, then n? is even, I know n? is even. Therefore, n is even. 

2. If this triangle has three equal sides, then it has three equal angles. This triangle has 
three equal sides. Therefore, it has three equal angles. 

3. If you eat your supper, we will play miniature golf. You didn't eat your supper. 
Therefore, we will not play miniature golf. 

4. If I get an A on the final exam, I get an A for the course. I didn't get an A for the 
course. Therefore, I didn't get an A on the final. 

5. IfI don't get an A on the final exam, I won't get an A for the course. I got an A for 
the course. Therefore, I got an A on the final. 


SOLUTION 


1. Let p stand for "n is even” and q for “r? is even.” The reasoning follows the 
form p — q, q.. p. This is the converse fallacy. 

2. Let p be “this triangle has three equal sides" and q "this triangle has three 
equal angles" We write the reasoning as p + q, p, <. q, which is valid by 
modus ponens. 

3. With p as "eat supper" and q as "miniature golf,” this is p > 4, ~ 
the inverse fallacy. 

4, 1t p indicates “I get an A on the final exam” and q “I get an A for the course,” 
we have p — q, =q, -. ^p, which is valid by modus tollens. 

5. We can choose our symbols to make the implication read as p > 4, so p 
would be "I don't get an A on the final exam” and q would be “I don't get 
an A for the course.” The second sentence, "I got an A for the course,” is 
the negation of q, that is, ~q. Likewise, the third sentence, "I got an A on 
the final,” is ~p. The reasoning is p > q, 4, .. =p, valid by modus tollens. 

o 





^n 





Practice Problem 3 For each of the following, identify the reasoning as either 
valid (modus ponens), valid (modus tollens), invalid (converse fallacy), or invalid. 
(inverse fallacy): 


(a) If you don't eat your supper, we won't play miniature golf. You ate your supper. 
Therefore, we will play miniature golf. 

(b) If you fail the final, you will fail the course. You passed the final. Therefore, you 
will pass the course. 


Driver TWDDO03-Ensley.v6 July 9,205 — 2050 


74 Chapter 1 / Puzzles, Patterns, and Mathematical Language 


(c) If you fail the final, you will fail the course. You failed the final. Therefore, you 
will fail the course. 


(d) If the sky is red at morning, it will not rain. It rained. Therefore, the sky was not 
red at morning. 


Analysis of Arguments 


In order to use logic effectively in discourse, we need to see how to analyze the. 
validity of persuasive arguments. We have already looked at some specific types of 
arguments that use modus ponens and modus tollens, and we have also learned to 
detect invalid arguments caused by the converse and inverse fallacies, At this point 
we have all the tools to analyze more complex arguments. 

Recall from Section 1.3 that an propositional logic expression is a tautology if its 
truth value is true for all possible combinations of the truth values of the individual 
components. With this piece of terminology, we can give a formal definition to a valid 
argument structure. 











Definition An argument structure is a list of propositions 


Phy Prec Pay q 








where statements pi, pr, .... py are called the premises of the argument and 
the statement q is called the conclusion of the argument. Such an argument 
structure is said to be valid if 


(PAPAA Pn) 4 


is a tautology, and it is said to be invalid otherwise. 











To be picky, we should always use the phrase “valid argument structure” rather 
than “valid argument.” Technically, a valid argument is an argument (with actual 
content) that follows a valid argument structure. For example, p — q, p, -q is an 
argument structure. An argument that follows this structure is “If it is raining, the 
streets are wet. It is raining. Therefore, the streets are wet.” However, many people 
use the two terms interchangeably. 


MI example 8 


1. Show that the modus ponens argument structure is valid. 


2. Decide whether the following argument is valid. If not, give an assignment of truth 
values that makes all premises true and the conclusion false. 


You must pass the final exam or pass all the tests if you pass the course. You failed 
the final exam and the course. Therefore, you must not have passed all the tests. 
SOLUTION 


1. Recall that modus ponens is formalized as p — q, p, -. q. We build a truth. 
table for the compound expression ((p — d) ^ p) + 4: 


[Driver JWDDOO3-Ensley-v6 July 9,205 — 2050 


16 Excursion: Validity of Arguments 75 


p—4 (—o^p a |(p—0^p—a 


s] 4 | 
T T 
T T 
F F 


Since the final result hasall T truth values, the expression ((p > 4) ^ p) > q 
is a tautology, so the argument structure is valid. 

2. Let stand for passing the final exam, a for passing all the tests, and p for 
passing the course. Do not be fooled by the placement of the word “if” 
in the middle of the statement. This means that p is the hypothesis of this 
statement, so we can write this statement as p — (e V a). 

So we can write the argument as p + (e V a), ~e ^ =p, ~a. The truth 
table for this statement is shown below. The "final result" column is for the. 
expression (p + (e v a) ^ (~e ^ =p)) + a. 





mS 
Hama 
LLL 
sada 











(evan Final 
e a p | eva p>eva | c ~p ^p Ce^-p Result. 
T X sfe T FOF F F F T 
T T F/T T F T F F F T 
T ow T F F F F T T 
Tom Fo t > f F F bi t 
me poss cms T T F F F F i 
F T Fi tT T T OT T T F F 
F F T|F F T F F F T T 
F F Fi F T T T T 3 T T 


Since there is an F in the final column, the reasoning is faulty. A person 
who fails the final, passes all the tests, but fails the course demonstrates the 
fallacy. o 


Practice Problem 4 


(a) Show that modus tollens is a valid argument structure. 

(b) The argument structure p v q, p — r,q — r, -r is called "division into cases.” 
We know one of two things is true, and we prove that no matter which is true, the. 
conclusion r follows. Show that this argument structure is valid. 


Solutions to Practice Problems 


1 We use p to stand for "You bought a car from us" and 
q to stand for "You paid too much." 
(a) The converse statement is written formally as q — 
~p. Here is the truth table, The final result is the 
same as for the inverse. 





IWDD003-Ensley-v6 


[Driver July 9,205 — 2050 


76 Chapter 1 / Puzzles, Pattems, and Mathematical Language 


(b) The contrapositive statement is written formally as 
~g — p. Here is the truth table, The final result is 
the same as for the original statement. 


~“ P 





aS 
mH |a 
dud" 
máu 


2 (a) If you pass the test, then you studied for at least four 


(©) If a quadrilateral has four equal angles, then it is a 
square, 

(d) If a quadrilateral is a square, then it has four equal 
angles. 

(e) Ian integer is an odd prime, then it is greater than 
a 

(a) Inverse fallacy 

(b) Inverse fallacy 

(c) Valid by modus ponens. 

(d) Valid by modus tollens 

(a) Modus tollens is formalized as p» q, =, =p- 
Sec the first truth table below, for the compound 
expression ((p > 4) ^ ^4) > ~p. The final column 








mm E LA 
ularly. @>n)>r): 
P 4 pa 4 (p—4^^4 ^P | (p—4)^-2g) > ^p 
(pvq)^(p—r) (pvq^ip—rn 
"ECCL uM 





Exercises for Section 1.6 


1. A parent says, “If you don't eat your supper, you can't 
have dessert." The child cats his supper and the parent. 
says, "No dessert for you." If the child takes his parent. 
to court, does he have a valid case? 

2. Each argument is either correct or it has a fallacy. Write 
the argument in symbols, then determine whether the 
argumentis valid. If itis valid, determine whether it uses. 
‘modus ponens or modus tollens. It it isn’t valid, identify 
the fallacy as either the converse or the inverse fallacy. 


(a) If both numbers are even, then the sum is even. 
They are not both even. Therefore, the sum is not. 
(b) If this number is a perfect square, then the equation 
has a rational solution. The equation has a rational 
solution. Therefore, this number is a perfect square. 
(© If you didn't buy from us, you paid too much. You 
did buy from us. Therefore, you didn'tpay too much. 


Driver 


IWDD003-Ensley-v6 


4 


(à) If this prime number is even, then it is less than 5. 
This prime number is even. Therefore, itis less than 
5 

e) If this city is large, then it has large buildings. This 
city has large buildings, Therefore, it is large. 


j. For each of the following, identify the reasoning as ei- 


ther valid (modus ponens), valid (modus tollens), invalid 

(converse fallacy), or invalid (inverse fallacy). 

(a) If you pass the test, you studied for at least four 
hours You studied for at least four hours. There- 
fore, you will pass the test. 

(b) If you pass the course, you attend class regularly. 
You did not attend class regularly. Therefore, you 
will not pass the course. 

(©) The quadrilateral is a square if it has four equal an- 
gles The quadrilateral has four equal angles. There- 
fore, itis a square. 

Recall that you can phrase “if, then” statements with 

words like “necessary.” “sufficient” and “only if” Fol- 

lowing the model from the text, convert the following 
statements to “if, then " statements. 

(a) In order to pass this course, it is sufficient to read 
the book. 

(b) In order to pass this course, it is necessary that you 
read the book. 

(©) You will pass this course only if you read the book. 


. Write each of the following as an implication in “if, 


then” form: 

(a) In order for you to get a refund from the store, it is 
necessary that you have your sales receipt. 

(b) To grasp someone's mathematics background, it 
is sufficient to have them pronounce the name 
“Euler.” 

(© You are legally driving in Pennsylvania only if you 
are 16 years old. 

(à) To understand the history of calculus itis necessary 
to study Descartes, 

(e) Itis sufficient to carry an umbrella to stay dry out- 
side. 

(Ð Stephen Kingis fun to readif youlike horrorsstories. 

Use truth tables to characterize each of the follow- 

ing propositions as a tautology, a contradiction, or 

neither: 

@ p p 

W (p a) v(a p) 

© (pag) v (a > >p) 

® (pv 9) > (a ^p) 


l. Use truth tables to characterize each of the follow- 


ing propositions as a tautology, a contradiction, or 
neither: 


10. 


July 9,205 — 2050 


16 Excursion: Validity of Arguments 77 


(9) (p> (QD) v (pA) D 

© (p> daa > ^P) > -P 

© (p — d) ^ Cp» ) > avr) 

Some of the valid reasoning forms are so obvious that 
they come automatically to us For example, if we know 
two things are true, then we certainly know that the first 
is true, We can formalize this as p A q, -. p. Use a truth 
table to formally demonstrate the validity of this and 
the other “obvious” forms of reasoning. For each, give 
a concrete example of this form of reasoning in daily 
life. 

@ p^q,- p 

© ppva 

(© p. P ^g 

@ pvq.-p.-.4 

(opo qqonopr 


. Usea truth table to decide whether each of the following 


argument structures is valid. I it is not, give an assign- 

ment of truth values to the propositional variables that 

makes each premise true and the conclusion false. 

(a) p-q4q- pi pag 

(b) p> (^q ^r).q.-. 7p. 

O pogr porc 

@ pva por 

(e) palgyr)r > Prag 

Use truth tables to show that each of the following ar- 

gumentsis invalid. Write one sentence explaining a sit- 

uation (corresponding to an assignment of truth values) 
that illustrates this 

(a) If Newton is not considered a great mathemati- 
cian and Leibniz work is not ignored, then cal- 
culus would not be the centerpiece of the modern 
math curriculum, Newton is considered the great- 
est mathematician only if Leibniz's work is ignored. 
Therefore calculusisthe centerpiece ofthe modern. 
math curriculum and Leibniz's work is not ignored. 

(b) Ifthe weather forecast is good, then we have a pic- 
nic if and only if we have bread for sandwiches. If 
wwe have a picnic, then the weather forecasts good. 
Therefore, we have bread for sandwiches. (Nore: 
The statement ^p if and only if q” can be expressed 
symbolically as (p > q) A (q —» p.) 

(©) TET have a good round of golf, then the wind is 
calm or the weather is dry. The wind is calm and the 
weather is dry. Therefore, I have a good round of 
golt. 

(d) Either I read a novel or I both lie on the couch and 
watch a baseball game on TV. I read a novel only 
if T lie on the couch, Therefore, if T do not watch a 
baseball game, then T do not lie on the couch. 





or 


IWDD003-Ensley-v6 


78 Chapter 1 / Puzzles, Pattems, and Mathematical Language 


11. Use a truth table to show that cach of the following 
arguments is valid: 

(a) For our camping trip, we take extra blankets if we 
take our gas heater, andif we do not take extra blan- 
kets, we do not take our air mattress, Therefore, we 
take our air mattress or gas heater only if we take 
extra blankets. 

(b) Steve votes for a Libertarian candidate if and only 
if both his wife (Stella) votes for a Democrat and 
his father (Stan) votes for a Republican, For Stella 
to vote for a Democrat, itis necessary that Stan not 


Chapter 1 Summary 


July 9,205 2050 


vote for a Republican, Therefore, Steve does not 
vote Libertarian. 

(©) If the prime interest rate goes up, then it is suf- 
ficient that unemployment goes down for prices to 
rise, However, unemployment goes down only ifthe 
prime interest rate goes up. Therefore, if prices do 
not go up, then unemployment does not go down. 

(d) Each summer either I visit my family or I take a 
car trip and take off some time from work. I visit 
my family only if I do not take off time from work. 
Therefore, if I take off time from work, then I take 
a car trip. 





11 First Examples 


‘You should have tried one or more of the games and puzzles 

in this section: 

© Perhaps you have tried the card trick, using different sets 
of actions by the person following the instructions, to see 
ifthe club is always the card facing the opposite way from 
the others. 

© Perhaps you have played the Josephus game with differ- 
ent numbers of people, and different rules for how many 
people are “skipped” at each step, looking for patterns 
in the results. 

© Perhaps you have looked for organized ways to represent. 
the outcomes of tennis matches, individual tennis games, 
and other similar situations. 

© Perhaps you have tried to find a pattern for what kind of 
figures can be traced without lifting the pencil from the 
paper and without retracing any lines. 

© Perhaps you have looked for winning strategies in the 
game played on the 4 x 4 grid. 


12 Number Puzzles and Sequences 

Terms and concepts 

© You should recognize the use of the term sequence to 
describe a list of numbers, and feel comfortable with the 
notation a, to indicate the n^ term of the sequence. 

© You should be able to distinguish between a recursive 
{formula and a closed formula for a sequence. 


© You should recognize the Fibonacci numbers and their 
recursive formula, 


© You should feel comfortable with sigma notation. In par- 
ticular, you should be able to interpret Y, ay as the 
sum of the first n terms of a sequence. 


‘Working with sequences 
© Given a recursive formula for a sequence, you should be 
able to: 

~ Calculate the first several terms of the sequence. 

~ Give recursive formulas for specific terms in the sc- 
quence (for example, the 20% term, or the (3k — 1)* 
term). 

~ Calculate particular terms of the sequence, given in- 
formation about earlier terms in the sequence. 

© Given a closed formula for a sequence, you should be 
able to: 

= Calculate the first several terms of the sequence. 

— Give values or formulas for specific terms in the se- 
quence (for example, the 20* term, or the (3k — 1)* 
term). 

© Given the first few terms of a sequence, you should be 
able to: 

~ Discover a pattern in those terms 

— Apply that pattern to calculate additional terms, 

— Apply that pattern to discover recursive and/or closed 
formulas for the sequence. 

© You should know how to verify that a sequence satisfies 
a given recursive formula. 

© You should be able to convert from sigma notation to the 
‘usual summation notation, and back. 


IWDD003-Ensley-v6 


13 Truth-Tellers, Liars, and 
Propositional Logic 


‘Terms and concepts 
© You should be familiar with propositions and the related 
terms propositional variable and formal proposition. 
© You should understand the concept of the negation of a 
proposition, 

© You should understand the meaning of the operations A 
(and), v (or), and — (not) 

© Youshould know the terminology “exclusiveor” and how 
it differs from the v (or) operation. 

© You should know what it means to say that two state- 
ments are logically equivalent. 

© You should understand the terms tautology and contra- 
diction. 


Operations with propositions 
© You should be able to create new propositions from ex- 
isting ones using the operations A (and), v (or), and ~ 
(not), 
© You should be able to interpret and formalize English- 
language constructions such as “neither-nor” and "ex- 
actly one.” 
© You should be able to form the negation of a proposi- 
tion using the double negative property and DeMorgan's 
laws 
© You should be able to apply truth tables to: 
- Analyze the statements made by Smullyan's Truth- 
tellers and Liars. 
- Analyze the truth value of compound statements 
based on the truth value of their components. 
- Determine whether two statements are logically 
equivalent. 
© You should be able to use Theorem 2 to: 
~ Simplify compound statements. 
~ Verify that two statements are equivalent. 


14 Predicates 

Terms and concepts 

© You should be familiar with predicates and the notation 
PC). 

© You should understand the concept of domain of a pred- 


icate, and the related terms ser, element and member, as 
well as the notation €. 


July 9,205 — 2050 


16 Excursion: Validity of Arguments 79. 


© You should know the purpose of quantifers and under- 
stand the notation V (for all) and 3 (there exists). 

© You should know the circumstances in which a quanti- 
fied statement is true, and in particular the meaning of 
the word counterexample 


Operations with predicates 

© You should be able to create new predicates from ex- 
isting ones using the operations A (and), v (or), and ~ 
(not) 

© Youshould be able to form the negation ofa predicate us- 
ing the double negative property and DeMorgan's laws. 

© Given a predicate over a specific domain, you should be 
able to determine the truth value of P(x) for particular 
elements of the domain. 

© Given a quantified statement over a specific domain, you 
should be able to determine the truth value of the state- 
ment. 

© You should be able to negate a quantified statement both 
symbolically and in natural language: 
~ The negation of Vx € D, P(x) is 3x € D, ^P(). 
~ The negation of 3x € D,O(x)is Yx € D,-Q(3). 

© With mutipte quantifiers, you should know the difference 
between Vx, 3y, P(x, y) and 3y, Vx, P(x, y) 


1.5 Implications 

Terms and concepts 

© You should recognize implications, whether written as 
^if pis true, then g is true," as “p implies q,” or using the 
notation p > q. 

© You should be able to identify the hypothesis and the 
conclusion of an implication. 

© You should understand the meaning of the word coun- 
terexample as applied to implications. 

© You should know the meaning of the terms converse, in- 
verse, and contrapositive. 

© You should know the term biconditional and its notation 
peg 


Interpreting implications 

© Given an implication involving predicates, you should 
be able to identify the domain, the hypothesis, and the 
conclusion, writing the implication in the form “For all 
x € Dit P(x), then Q(x).” 

© You should understand the logic of the implication p —> 
4 


Driver 


IWDD003-Ensley-v6 


8O Chapter 1 / Puzzles, Patterns, and Mathematical Language 


— You should be able to give a truth table for p — q. 
— You should know that the only time p — q is false 
is when the hypothesis (statement p) is true but the 
conclusion (statement q) is false. 
© For an implication in the form, “For all x € D, if P(x), 
then Q (x),” you should be able to determine if you think 
the implication is true, and be able to give a counterex- 
ample if itis not true. 
© You should be able to interpret a variety of English- 
language constructions that explicitly or implicitly in- 
volve quantified implications: 
— Convert such sentences to formal notation. 
— Recognize which sentences have the same meaning as 
each other, and which do not. 


‘Operations with implications 

© You should be able to negate implications, including 
‘quantified implications, 

© You should be able to form the converse, inverse, and 
contrapositive of an implication. 

© You should know that the original implication and its 
contrapositive are logically equivalent, and that the con- 
verse and inverse are logically equivalent. 

© You should be able to give examples illustrating that a 
statement and its converse may not have the same truth 
value, 

© You should be able to apply the logic of implications to 
logic puzzles (Smullyan's truth-tellers and liars). 


1.6 Excursion: Validity of Arguments 
‘Terms and concepts 


© You should understand the use of the phrases “if” “only 
iff’ “necessary,” and “sufficient,” in English-language im- 
plications. 


July 9,205 — 2050 


© You should know that an argument structure has the gen- 
eral form: 


Phy Poss Pu. 
and recognize the premises of the argument 
(pi, prs... pe) and the conclusion of the argument 


@. 

© You should recognize modus ponens and modus tollens. 
as two valid argument structures. 

© Youshould recognize the converse fallacy and the inverse 
{fallacy as two invalid argument structures. 


Interpreting arguments 


© You should be able to distinguish among a statement, its 
converse, and its inverse, and apply this to discourse of 
various types. 
© You should know how to analyze what a person has ac- 
tually said, avoiding the inappropriate substitution of a 
converse or inverse for the person's statement. 
© You should be able to interpret English-Language impli- 
cations stated in a variety of ways (if, only if, necessary, 
sufficient), converting each to standard “if p, then q” 
form. 
© You should be able to use a truth table to determine 
whether or not a given argument structure is valid. 
© You should be able to analyze English-language argu- 
ments using a two-step process: 
~ Convert the argument to a formal argument structure. 
~ Analyze the validity of that structure. In many situa- 
tions, you may recognize the structure as: 
* One of the valid argument forms modus ponens, 
‘modus tollens; 
* One ofthe invalid argument forms converse fallacy, 
inverse fallacy. 








A Primer of 
Mathematical Writing 


In the previous chapter, we looked at the nature of deductive reasoning as it 
relates to everyday experience, and we studied the logical structure of math. 
ematical statements, discussing at length what it means for a statement to be 
true or false. In this chapter, we will discuss the properties of some common 
mathematical objects and how to present a mathematical proof about these 
Properties to others. 

The subject of our first efforts will be properties of integers, simply be- 
Cause integers are the mathematical objects most familiar to us. We will use 
these familiar objects to build our skills in writing proofs of mathematical 
statements. In the third and fourth sections, we will consider the proof tech- 
nique known as mathematical induction, arguably the most important proof 
structure in discrete mathematics. In the fifth section, we will study a tech- 
nique called "proof by contradiction," which applies indirect reasoning in an 
argument. Finally, in the last two sections we will apply the skills we have 
developed to explore some problems of interest to the typical student of 
‘mathematics, computer science, or education. 

It is often the case that the abstract nature of mathematics makes mathe- 
matical proofs seem more mysterious than other forms of argument. For this 
frason, we will make every attempt to separate the logic of our arguments 
from the abstract nature of the subject matter. We will take time to build an 
understanding of the mathematical objects and properties before attempting 
Proofs. If we are patient enough, the logical structure of your mathematical 


| writing will seem as natural as an argument with your sister. 


Bl 





82 (Chapter 2 / A Primer of Mathematical Writing 


2.1 Mathematical Writing 





Mathematicians today rely on the notion of symbolic logic to serve as the foundation 
OF the system of reasoning that they bring to bear on the solution af problems. Of 
Course, mathematics was quite successful for many centuries before symbolic logic 
Tos introduced, so it is not necessary to study symbolic logic in order ro understend 
formal mathematics. A defining characteristic of mathematics is the thorough under- 
Standing of the cause-and-effect relationship between properties of formal d ned 
objects. The demonstration of this understanding is the notion of. around 
Which the mathematics literature is based. In this section we will try to understand 
how to recognize a proof when we see one, and we will start io develop the skill of 
writing them ourselves. 

Unlike conversational English, most mathematical statements are in the form of 
implications That is since mathematics studies relationships between fon objects, 
see ments that effectively state, “Whenever an object has property P, then it aust 
also have property Q" are commonplace. These are often mere succinctly written 

if p. then g " but the nature of the English language allows a wide variety of 
equivalent forms, 














f Example 1 Write each of the following statements in “if, then” form 





Whenever n is an even integer, 2n’ + n is divisible by 3. 
For every prime n, nè — n + 41 is also prime. 


The sum of the interior angles of any triangle is 180 


SOLUTION 


1: Ian integer n is even, then 2n? +n is divisible by 3, 
2. Ifa positive integer n is prime, then the number nè — n + 41 is prime. 
3. If P is a triangle, then the sum of the interior angles of P is 180°, 
o 


{ide from the variety of different ways to make a mathematical statement, you 
should glean two important points from the examples above. Not every mathematical 
statement is necessarily true, and not every mathematical statement is necessarily 
about numbers, 





Practice Problem 1 Determine which one of the three statements above is false. 


Implications and Their Contrapositives 


We begin by highlighting a few of the ideas covered in some detail in Chapter 1, 
starting with this example first discussed in Section 1.5. 


2 Example 2 Trooper Jones walks into the Goldilocks Pub and sees four 
Boatsvile College students enjoying various beverages In front of each person, there 





SEE’ :“=— NEU"... 


21 Mathematical Writing 83 


19 Coke Boer 25 


Figure 2-1 Age-drink cards. 








Di card which has the person's age on one side and what he or she is drinking on the 
Other side. She sees that the face-up sides of the cards look like Figure 2-1 
The drinking age law states in effect. 


Af you are drinking beer, then you are at least 21 years of age. — (21) 





Which cards does Trooper Jones need to see turned over to check that everyone is 
obeying the law? 


Ifthe law is being broken, it must be because someone is drink 
ing beer and is under 21 years of age. That is, the only time that a statement of 


the form “if p, then q” is false is when the hypothesis pis true but the conclusion 
4 is false, At all other times, we would have to say that the whole statement iy 
true (Le, the law is not being broken). Trooper Jones is looking for counterex 
amples to the "if, then" statement given in (2.1). She must turn over the cards 
that could give counterexamples: the "19" and the "Beer." 











This example illustrates the basic logic behind an implication or “if, then" state- 
ment, which is summarized in Table 2-1, The last two rows of the table are the hardest 
{er most people to swallow, How can a statement of the form "if p, then q” be true 
when the statement pis false, and especially when both statements p and q are false? 
A visual aid might be useful here 








1n 
Aio Example 3 The diagram in Figure 2-2 shows a way that the Goldilocks Pub 
can be sure to uphold the Pennsylvania law. It can simply have a separate room for it 
= and within that room reserve a table for their 





patrons who are at least 21 years of a 
Patrons who are drinking beer. 

As long as the pub is arranged in this way, the people in the Goldilocks Pub are 
sure to be obeying the law. The patrons for whom the hypothesis is false (Le. they 
dre not drinking beer) can be in either room (ie, the conclusion can be true or false 
Notice also that the hypothesis could be false for all the patrons—they could all hz 
enjoying nonalcoholic beverages—and the law would still be obeved. 











Hypothesis (p) Conclusion (g) || Implication (If p, then g) 








You are Drinking Beer You are at Least 21 | Yow are Obeying the 























True True I True 
True Falke Fake 
False Tue || True 
False Fake True 


Table 2-1 Summary Analysis of an Implication 


CC Ce 


a O 


er 2 / A Primer of Mathematical Writing 








Crs airing beer —] {People atieastage 27 


eo 36 
(S $^ 
el Gs) o 
ES 





ue 


People underage 21]^^ 





Figure 2-2 A lawful situation. 





Most people have no difficulty with this concrete example—certainly Trooper 
Jones would immediately realize whose cards need to be checked. The same rea. 
soning applies to more abstract settings. It is our contention that it is usually the 
abstraction in mathematics that causes the confusion, not the deductive logic used. 
To see this point illustrated, consider the following variation." 


Practice Problem 2 Suppose someone makes four cards with letters on one side 
and numbers on the other, and that person then states, "M one side has a vowel, then 
the other side has an odd number." Which of the cards in Figure 2-3 must be turned. 
over to tell whether the person is being truthful? 





In Section 1.5 we also introduced the 





'otion of the contrapositive of an implica 
tion, Let us review the idea by thinking of equivalent ways of expressing the drinking 
age law 





If you are under 21 years of age, then you are not drinking beer 





To see this is equivalent to our original statement of the law, think of the only sit- 
uation in which the law is broken—specifically, when someone under the age of 21 
drinks beer. This situation makes the hypothesis of (2.2) true and the conclusion 
false, exactly the situation that makes the implication false as discussed before, Note 
however that this same situation makes the hypothesis of (2.1) true while making its 
conclusion false. In other words, the statements (2,2) and (2.1) are false in exactly 
the same situation and hence true in the same situations—this is what is meant by 
being “logically equivalent. 

In general, the statement “If p is true, then q is true" can always be equivalently 
expressed by its contrapositive statement “If q is not true, then p is not true 














Ta 

W^ Example 4 For each implication below, explain the traits a counterex- 
ample to the statement must have. Which of the pairs are contrapositives of one 
another? 


1. (a) Ifnis even, then nl +1 is prime 
(b) Ifn is odd, then nl +1 is not prime. 


* This experiment, due to P. C. Wason [48], actually precedes the “drinking age" version by almost 
20 years, It is now commonly called Wason' selection task in the cognitive science literature, 





21 Mathematical Writing 85 


batted ue E 2s 


Figure 2-3. Letter-number cards, 


2. (a) If r? is even, then nis even 
(b) If isnot even, then n? is not even 
è (a) Ifn is prime, then the number following n is not a perfect square. 


(0) Af mis perfect square, then the number preceding m is not prime 


1. These two statements are nor contrapositives of each other. We can see this 

by thinking of the traits a counterexample to each statement must have 

(#) A counterexample would have to be an even number n for which n! 4 1 
is not prime. 

(h) A counterexample would have to be an odd number n for which zt 4 1 
is prime 

These two statements are contrapositives of one another, We can also sce 

this by thinking of the traits a counterexample to each statement must have 





(9). A counterexample would be an integer n where n? is even and n is not 
even, 

(b) A counterexample would be an integer n where n is not even and n? is 

These two statements are also contrapositives of one another, but this is 

clear only after fiddling with the language a bit, We can see this more clearly 

by thinking of the traits a counterexample to each statement must have 

(a) A counterexample would have to come from a prime number followed 
by a perfect square. 

(b). A counterexample would be a perfect squ 





seeded by a prime num 





re p 
ber 


Practice Problem 3 Write rhe contrapositive of each of the following statements 
Describe the traits of a counterexample for both the original statement and its conim 
positive 


(a) Mf is divisible by 3, then n? is divisible by 9, 


(0) If mis prime, then 2n is not divisible by 3. 


Mathematical Proofs 


Now that we understand the logic of implication statements, we turn to the question 
How do we prove that an implication is true?" In the small setting encountered by 
Trooper Jones, the answer is simple. Once Trooper Jones is sure there are no cour. 
Grsamples she will be satisfied that the implication is true for all the patrons of the 
Goldilocks Pub. The same is true in a more abstract settings—if there cannot possibly 
be a counterexample, then we will know the implication is true for all elements cy 
the domain, 












B6 Chapter 2 / A Primer of Mathematical Wing 


£ 


Explore more on 
the Web. 


Practice Problem 4 Play the role of Trooper Jones for these mathematical impli- 
Cations. That is, try to find a counterexample for each implication. 


(a) For every integer n > 1, ifn is odd, then n? + 4 is a prime number 


(b). For every positive integer n, if n is odd, then nè — n is divisible by 4, 


Perhaps, for the first implication above, 





then n 


aues a prime number,” you went through the following thought processes 


“For en 





ery integer n > 1, if n is odd, 





playing alternately the role of Trooper Jones looking for counterexamples, and the 





pub's owner tr 


Trooper Jones: 
Owner: 


Trooper Jones: 
Owner: 
TroorER Jones: 
Owner: 
Trooper Jones: 
Owner: 
Trooper Jones: 
Owner: 
Trooper Jones: 
Owner: 
Trooper Jones: 


There are two les 





18 to show that there are none. 





Maybe 1 is a counterexample. 

No, because I? +4 = 5, and 5 is prime. 

Let's try 3. 

Since 3° + 4 is the prime number 13, that doesn’t work 
I think 5. 

Nope, 5? + 4 is the prime number 29. 

T? +4 = 53, and 53 is prime 


1 give up, maybe the implication is true. 
Thank goodness! 

No, wait, let's try 9, 

Well, +4 
So9is 


85, and 85 can be factored as 5.17, 
counterexample! I knew the st 





tement was false! 


ns to be learned from this dialogue: 


© Trooper Jones doesn't bother checking even numbers. Even numbers cannot be 
Counterexamples since they don't satisfy the hypothesis of the implication, 


9 Just because you have not yet found a countes 





xample doesn't mean there isn’t 


one, unless the domain is finite and you have checked the entire domain, 








ie 2-2 shows an abbreviated description of the thought process involved in 
the second implication of 


Practice Problem 4 





“For every positive int 





rn itn 


Divisible by 47 











0 | Yes since o 






24 | Yes since 24 








5 || 120 | Yes since 1204-30 





336 | Yes since 336 = 4-84 














9 i 720 | Yes since 720 = 4-180 


Table 2-2 Summary Analysis of 
Practice Problem 4 








21 Mathematical Writing 87 


is odd, then n° —n is divisible by 4." The first column represents the choice of n 
made by Trooper Jones, and the third column shows the owner checking the truth of 
{he conclusion for this value of n. Once again, the first few chosen numbers satistying 
the hypothesis are not counterexamples, but how do we know that in this case, unlike 
the first practice problem statement, Trooper Jones will never find a counterexam- 
ple? To form a proof of the implication, the owner would somehow have to convinec 
Trooper Jones that, no matter what number she chooses, it will not be a counterex. 
ample. Since she only chooses numbers that satisfy the hypothesis, this amounts iy 
demonstrating that 











No matter what number is chosen satisfying the hypothesis itis guaranteed to 
also satisfy the conclusion. 


In the remainder of this section, you will develop your ability to write arguments 
{hat will convince even the most skeptical readers that they are wasting their time 
looking for a counterexample. In Exercise 4e, you will be asked to furnish the proof 
for the proposition “For every positive integer n, if n is odd, then n? — n is diveible 
by4. 








Proofs as Games 


We can think of a mathematical proof as a game between the AUTHOR of the proof 
Und the Reaoex of the proof. Like Trooper Jones, the READER is skeptically trying to 
show that the statement under consideration is not true, and the AUritor s job is to 
fonvince the Reaper that he should quit trying to do this. By being convinced that 
he is never going to find a counterexample, the READER accepts the Aurion s proof, 





For any “if, then” statement about an integer n, we can think of the interaction 
between Aumos and READER in terms of the following 





© Reaver chooses a value of n that satisfies the hypothesis, 
© Autor tries to demonstrate that the conclusion is true for this choice of n. 


© If the conclusion is true for this choice of n, then Avtuok is successful, and it is 
READER'S turn again—the game continues. 





9 If the conclusion is false for this choice of n, then Reaver wins. 


If the statement under consideration is true, this game will never end, But no 
one has that kind of time! Auruox will write an argument that convinces even the 
most stubborn READER that the game will never end, This argument isa mathematical 
proof. 

Kets try the game with a simple proposition that comes from the sort of explo- 
ration that mathematicians are always doing for fun 


Sammy has been playing around with numbers, and has observed that the prime 
number 3 is followed immediately by the perfect square 4. Intrigued by this, he 
has tried to find another pair of consecutive integers consisting ofa prime number 
followed by a perfect square. He has come to the conclusion that there is no other 
such pair, and wants to prove that he is correct. 


Proposition 1 Other than 3.4 there is no pair of consecutive integers where the 
first is a prime number and the second is a perfect square. 


OO EE 


Cr o 


BB. Chapter 2 / A Primer of ‘Mathematical Writing 


The first important step is to realize that the game cannot be played at all if 
AuTHoR and READER do not agree on the meaning of the terms in the suterwent, 
For this reason, mathematics books make a big deal of giving formal denenn af 
sera at their outset. There are many different ways to define any given term, so 
Bren sten a single definition is an important part of the proof process. For the 
Pint Avdtement, there are two definitions that need to be given by the textbook so 
that Auror and READER are using the terms in the same way 














Definition A positiveintegerngreater than lissaid tobe, primeifit cannot 
be factored as n = a - b, where both a and b are greater than 1 











Definition A perfectsquareisa positive integer thats equal to 
Positive integer z. 


2 forsome 








‘The second observation is that our proposed game requires an “if, then state- 
some ipa plication), and Sammy's proposition is not in that form. However, after 
some thought Sammy is able to come up with a suitable way torewrite his proposition 


Theorem 2 Forallimegersn > 4, ifnisaperfectsquare,thenn Lisnotaprime 
number. 





Here are some sample "plays" of the game for this stat 





ment, 





R.choosesn 4! —16, — thenA.factors15 = 3 x 5, 50 15is not prime 
Rchoowsn- 6-36, then A.factors35=5x7, — so3Sisnot prime 
? R.choosesn=7? —49, then A. factors 48 = 16x3 50 48is not prime 
R.choosesn-10'—100, then A.factors99 = 9x11, — so99ismot prime 


j R. chooses 


12? =144, then A. factors 143 = 11 x 13. 50143isnotprime 
R. chooses 


25 = 625, then A. factors624 = 13x48, 50624isnot prime 





Tt appears that READER cannot win this game, but he will not give up unless we (as 
mUTIOM) can convince him of the futility of his efforts. We need to look for reasons 
he should not pick a particular value of n before he even bothers to sc t it. If we 

espond. ice READER in advance that no matter what he picks, we will be able to 
respond, then we will have proved Theorem 2. 

In this game, AutHoR’s response consists of showing that n — 1 is not prime, by 
factoring it. We should see if we can predict in advance what the factories will 
be. The Reaper is going to choose a number n (some positive number. Let's give 
a variable name to the “some positive number,” to make it easier to refer to Tene 
agree to call it m, then each line in the game follows the following pattern 














READER chooses n = m?, then Auron tries lo factor n — 1 


Since n is the same as m, each line in the game can be seen to follow this slightly 
revised pattern: 





21 Mathematical Writing 89 


READER chooses n =m, then AurHOR tries to factor m? — 1 


Will Auron always be able to do this? Perhaps you remember the rule 
nm? —1- (m- (m4 1) 


for factoring the difference of two squares from an algebra course 

With this realization, it appears that AUTHOR will always succeed! Now we just 
pred to write down these observations in a note to READER so that he will stop trying 
to win the game 





Dear READER, 


Every time you choose a perfect square (greater than 4) for n, say, n = pr 
(ma positive integer), I can factor n — 1. I know this because n — 1 is the same as 





am — 1, which factors as (m — 1)(m + 1), So as long as these two factors, m — 1 
and m- 1, are both at least 2 (which they are since n > 4), this will demonstrate 
that n — 1 is not prime. So give it up and get on with your life 

Your pal 


AuTHOR 
Mathematicians do not write proofs of theorems in the friendly style of a letter 


to Reaves. For example, here is a formal proof of the theorem as it might appear iy 
à math book: 





Theorem 2 (Revisited) For all integers n > 4, if n is a perfect square, then 
n — 1 is not a prime number. 


Letaperfectsquaren > 4be given. By the definition ofa perfect square, 
n = nr’ for some positive integer m. Since n > 4, it follows that m > 2. Now the 
number — 1 = m? — 1 can be factored as (m — 1)(m + 1). Since m > 2, then both 
Dr Land m + 1 are greater than 1, so n— 1 = (m — 1)(m + 1) is a factorization 
of the number n — 1 into the product of two positive numbers, each of which is 
greater than 1. By the definition of prime number, it follows that the number n — 1 
is not prime. Li 








These two arguments, the informal and the formal, are really saying the same 
thing with the same logical structure. The informal proof is certainly less terse, but it 
includes no fewer de 








ls and no less rigorous logic. 





f£ Tracing Proofs 
Explore more on You might imagine READER receiving the letter from Auror. It's a little abstract—it 
the Web has all these variable names. So Reaper might attempt to understand the letter by 


dans t out with some particular numbers. Professional mathematicians frequently 
do this same thing to help them understand proofs they are encountering for the first 
time. The process is sometimes called tracing the proof. 

Perhaps the first number Reaner tfies is 21, which is 117. Reaper realizes the 
‘min the letter is the 121 and the m is the 11. So 1 — 1 is 120, and (m— D)(m +1) s 





REN 


IEEE Rl ae 


Chapter 2 / A Primer of Mathematical Writing 






(m — Din + 





G-)G+1) 
4-441) 
(7-107 +1) 
(10— (10 1) = (911) 
(2-12: 1) = Q3) 
Q5 ~1)(25 +1) = (24)06) 


na) 
315) 
(68) 














Table 2-3 Tracing a Proof 





(11 7 DL + 1) = (10/02), and sure enough this factors the 120, Table 2 summa- 
Tuis & tani! more numbers READER tries out to help visualize the letters conten 
This is fairly compelling evidence, since not only do we see that the number n — 1 
will factor, we actually see how it will factor. 

{tis important to realize that tracing a proof with particular numbers can help us 
understand the proof, but it does not really verify that the proof is correo. We must 
we will edsoning skills to think about the steps of the proof as we read. However, ae 
De eil sce later in this section, the tracing process can sometimes be used te doi 
flaws in faulty proofs, 








Simple Proofs About Numbers 


Before moving on to other issues, it might help to see more ex nples of simple proofs, 
We did not look at these as our first example, because they are ioo oas, 1a the sense 
that what they are proving does not really seem to need proot Iti appropriate to 
look at them now for further practice in reading and writing formal proofs. These 
proofs will use familiar terminology like “even,” “odd,” and “divisible by 4," so we 
fan focus on the actual proof writing. However, we must still agree on the precise 
formal definitions, even for these easy terms we already know. 














Definition 


1- An integer nis even fit can be written in the form n Ê 2. K for some integer 
K and an integer mis odd if it can be written in the form/m = 2. L 4 10x 
some integer L. 

An integer n is divisible by 4 if it can be written in the form n = 
some integer M. 








M for 





These definitions are all "existence" statements The phrase “it can be written in 
the formn = 2- K for some integer K” i equivalent to the more formal phrase “there 
the un eger K such that n — 2. K.” As we learned in Section 14, to establish 
the truth of an existence statement, one simply produces a particular value ax. the 
domain that makes the predicate true. 

These definitions might not be the ones you would have written down if asked 
to describe the concepts of even and odd integers or of divisibility. For example, you 
might think of even as meaning that when written in our standard notation, the croc 
digit is 0, 2, 4, 6, or 8. This is another perfectly valid definition of th concept of an 
even integer. As long as the Avon and READER agree that the definition being used 














21 Mathematical Writing 91 


is valid, we may use either definition. The definition we have chosen simply makes it 
easier to write proofs, as we will see shortly 





-" 
A Example 5 Use the definitions to show that the following the statements are 


72, 0, and —18 are even. 





81 and —15 are odd. 
3. 72 is divisible by 4. 


4. For any choice of integer n, Ar 











72 is even since 72 = 2-36, 0 is even since 0 = 2: 0, and —18 is even since 
18 —2.(-9) 
7- Blisodd because 81 — 2-40 + 1,and -ISisodd because 15 =2.(—8) +1 
3. Since we can write 72 = 4- 18, this shows that 72 is divisible by 4. 
+ We must find an integer K such that 4n? — 2n can be written as 2. K. Put 
another way, we must write 4n? — 2n = 2.(some integer), Since 4n? — 2n 
2- Qn? — n), we need only explain how we know that 27 — n is an integer 


The answer is that n and are integers, and a 





ay combination of integers using 


Auldition, subtraction, and multiplication always produces another integer 


The truly skeptical reader might be upset by the last line in the solution above 


Of ine assumes we agree on a property of numbers without proving it. This property 
of integer operations (called closure) can be established from 


the formal definition 
of the integ 


but since we are using an informal understanding of the integers, we 
really cannot prove it here. Instead, we will assume that Aruon and Reaver have 
agreed that this property of the integers is true 








Closure Property of the Integers: Whenever the operations of ad 
| dition, subtraction, or multiplication are applied to integers, the result willbe | 
an integer 


Practice Problem S Fitt 





the blanks to make each of the following statements 


fa) 240 is even because 240 — 2 


^) ~is odd because-79=2- 4.4. 
For any choice 


integer n, \6n* — 44n + 17 is odd because 16n? — 44n +17 = 





d) For any choice of integer n, —&n 4- 12n 


20 is divisible by 4 because —8n? + 
12020 —4 


Now here is a proof t 





at uses some of these definitions 


L Mi eS S S S S S S S S 


92 Chapter 2 / A Primer ‘of Mathematical Writing 


Proposition 3 The result 


an odd integer. 


PROOF Let odd integer x and even integer y be given. By the 
^ here is an integer A such that x 


'odd, 


Of summing any odd integer with any even integer is 


definition of 
2- A + 1. By the definition of “even,” 


there is an integer B such that y — 2. B, This means that 


x+y=(2.A+1)+2-B 


Since A + B is an integer, the algebraic equations above 


A+2-B+1 
2-(A+B)+1 








demonstrate that x + y 


can be written as2 times an integer plus 1, so we can conclude from the definition 


of "odd" that x + y is odd, 


Although we did not write. 
Anak following along to try to understand the proof. ReAprs Y thoughts might go 


this proof as a letter to READER, we can still imagine 


something like the following example. 


$ 


Explore more on 
the Web. 


g 
ut Example 6 Trace. 


would read for those numbers 


SOLUTION By the definition of “odd,” there 
Such that x =2-A+1 (i.e, 17 


isa 
that 


integer B (namely B 





1741 


This demonstrates that 17 4- 12 


we can conclude from the 


Continuing with more examples, 
shows exactly how x + y will be 


formal definition of “odd.” 
By the w 


the 
That is, suppose that Reaver c 


= 6) such that y = 2. B (ie. 12 


definition of “odd” that 17 + 1 


y, did you notice 
B)? A common mistake made by 


Proof for the particular numbers x = 
'hooses 17 for x and 12 for y, and show 


17 and y = 12. 
how the proof 





an integer A (namely A = 8) 
2-8 + 1). By the definition of “even,” there 
6). This means 











2-8+1)+2-6 
22.842.641 
=2:(8+6)41 


Can be written as 2 times an integer plus 1, so 


2 is odd, 1 


Reaper might build Table 2-4, So the proof 
written in the form of an odd number given by our 
that we used two different v 
students just learning these. 


ariable names (A and. 
ideas is to use the same 























= | y [4] B [xe |2229 0941 
njij s| s| 2 
37] 8] 18] 4| as 
wor} 14] so] 7 | nis 
-17| | -9| 2 | e 
si | so | 25125 | 10 
Table 2-4 Tracing the Proof of Proposition 3 


I O'=éSSSS'r 


21 Mathematical Wing 93 


variable name for two different things. For example, here is an incorrect version of 
this same proof. 


Proposition 4 The result of summing any odd integer with any even integer is 
an odd integer. 





FLAWED ARGUMEN todd integer x, and even integer y be given, By the 
definition of "odd." there is an integer A such that x =2-A + 1. By the delitos 
of “even,” there is an integer A such that y = 2- A. This means that 





I+y=(2-A+1)42-A 
2-A+2-A¢1 
=2:(A+A)41 





This demonstrates that x + y can be written as 2 times 
conclude from the definition of “odd” that x + y is odd. 





integer plus 1, so we can 





To see the problem with this proof, imagine again that Reaperis following along 
‘0 try to understand the argument. Reanen’s thoughts might go something like this 


T'U choose 17 for x and 12 for y. Then the A would be... well, I can't tell what it 
would be. It seems like it would have to be 8, because 17 = 2 -8 4-1, But if'so, the 
proof would say that 17 +12 = 2-16+ 1, which is false. So maybe it needs 10 


be 6 because 12 = 2-6. But then the proof would say that 17+ 12 = 2-12 4.1, 
which is also false. Hmmm 





Clearly, Reaper should be unconvinced by the argument, We summarize this 
twp aim Principle as follows: In any proof, do not use the same variable to represent 
{wo different things. To be more precise, we use two variables A and B because the 
two numbers might be different. Using two variables does not say that they cannot 
be the same—for example, when Reaper chose 51 and 50, both A and B wore 25 
Using two variables simply says that they are not known to be the same—they might 
be the same, or they might be different, 





Practice Problem 6 Writea proof that the sum of two even. integers is even, in the 
form of a letter from the AUTWOR to the READER. 


Here is an example that combines the even/odd concept with the concept of 
divisibility 


Proposition 5 Ifnis even, then n! is divisible by 4. 


PROOF We write this argument as an informal letter from Auror to READER. 





Dear READER, 


Xou will never find a counterexample to this one, Choose any even integer 
sid call it n. All I have to do is write your n? in the form 4- (some integer) 
Heres how I know I'll always be able to do that. I know your n can be written 
as 2- k with Kalso an integer—by our agreement on what “even” means When 


at 


EI Zo ioiovoioi nn Khi. 


94 Chapter 2 / A Primer of Mathematical Writing 


you square your n, you will get 
QA 
= 4K 

4- (È) 





Since kis an integer, so is ^ and by the definition of “divisible by 4" this shows 








that your n? will always be divisible by 4, 
Better luck next time, 
Autor = 


Finally, here isan example that illustrates a pitfall and suggests a possible remedy 
if you get stuck 


Proposition 6 For all integers n, ifr? is even, then n is even, 


FLAWED ARGUMENT Letn be any integer with nè even, We can writen? 2k 
for some integer k Our goatisto writen = 2: (some integer), so we divide both sides 
by m getting n = $ = 2 . (È). Since $ is an integer, this proves the result 


There is a major problem with this attempt at a proof, Although addition, sub- 
noctem, and multiplication of integers always produces integers, division doce not 
have that closure property, There is no reason to believe the statement Ë is an inte- 
ger,” For some integers it might be true, but for others it will not be, When we write 
4 proof, we must be sure that each step is justified, 

So once we reach “n? = 2k for some integer k," we are stuck, We need to come 





wi with a formula for n, and division does not have the closure property we need, 
We might try taking a square root, but that is even worse. When you take the square 
root of an integer, the result is only rarely another integer. What can we do? One 


Possibility is to take 





alternative approach based on our earlier discussion 


Proof by Contrapositive 





The fact that an implicational statement and its contrapositive are equivalent means 
that we always have two possible paths to follow when attempting a proof. We can 
{iy to prove the theorem as it was originally stated, or we can rewrite the theorem as 
its contrapositive and prove that instead, 

Since sometimes the two statements (the original and the contrapositive) seem 
different writing the contrapositive often allows us to gain fresh perspective when 
trying to solve a problem. In some situations, a proof of the contrapositive may be 
Ser than a proof of the original statement, Proposition 6 is a good example of this 
situation. 





m 
Me. Example 7 Informally prove the statement in Proposition 6, "For all integers 
n, ifn? is even, then n is even,” to a stubborn READER. 
SOLUTION "According to the discussion above, this statement is equivalent 
to its contrapositive “If is not even, then n? is not even,” or “If n is odd, then 





n is odd 





21 Mathematical Writing 95 


To prove this statement, we imagine a stubborn READER repeatedly 


Choosing odd integers and testing each to see if its square is odd, We might 
interrupt our READER to offer some advice: 


Dear READER, 


Any time you choose an odd integer n, we agree that n can be written in 
the form 2k +1 for some integer k, By algebra we can see that 


m = (2k+1) 


AP + Akt 
2- QE + 2k) 1 


This means that n? will have the form 2 - (an integer) + 1, so n? will be odd, So 


there is no point searching for 


Hugs and kisses, 


AUTHOR 





n odd integer whose square is even, 


As before, we can turn this into a more formal mathematical proof 


Theorem 7 


follows that n? = 4K? + 4k + 1 


For all integers n, if n? is even, then n is even, 





JOE We prove the contrapositive statement “If n is odd, then n? is odd.” Let 
anoddinteger be given. Since nis odd, there isan integer ksuch than = 2k + 1 1 
2: (2K? + 2k) + 1, Because 2k? + 2kis an integer 

" 


this shows that n? is odd. 


Solutions to Practice Problems 


P —nA S AE 4p ed edt 


which is not prime 

A counterexample to the statement “If one side has a 
vowel, then the other side has an odd integer" must have 
a vowel on one side and an even integer on the other 
side. Hence, only the first and third cards (each of which 
meets one of these two conditions) need to be turned. 








(a) If n? is not divisible by 9, then n is not divisible by 
3. A counterexample for the original statement is a 
value of n where n is divisible by 3 but n? is not divis 
ible by 9. A counterexample for the contrapositive 





is a Value of n where n? is not divisible by 9 but n is 
le by 3. 








1) If 2n is divisible by 3, then m is not prime. A coun. 
terexample for the original statement is a value of 


"Mat this point you ask, "How do 1 know that ‘not even’ the same as ‘oda! for integers 


n where n is prime and 2n is divisible by 3. A coun: 
terexample for the contrapositive is a value of 1 


Where 2n is divisible by 3 and n ìs prime 





(9) 9 is a counterexample 
(b) There are no counterexamples, 
(») 120 
(b) -40 

) 8? -20n 4.8 


4) -28 3-5 


© Dear Reapen: 


Suppose you have two even integers that we agree 
tocallx and y. The definition of "even" means that 
You have agreed there is an integer K such that 
x —2. K and there is an integer L such that y 

2+ L. In this case we can use algebra to see that 


{iy pedanti inclination to bea first-rate mathematician. Is true that our definitions of ever ead ody 


daas uarie that they are mutually exclusive without some further explanation, That explana, 


will come in the form of the division theo 


em in Section 22. 





ES Lp, 


96 Chapter 2 / A Primer of Mathematica Witing 


*+y=2-K+2-L=2:(K+1) Sotbesumot 
x and yis twice an integer (namely, K + L), which 
means that x + y is even, like I told you! 

Love always, 


Aurion 


Exercises for Section 2.1* 


1 Fill in the blanks in cach statement to complete the true 
Statement using the definitions from this section. 
(9) 24 is even because 24 
(b) 123 is odd because 123 
(©) —16 is even because — 1 
4d) —7 is odd because —7 
(©) 2n 4n - 6 











is even because Z +4n—6 








(D 6n + 3is odd because 6n +3 =2 +1. 
(W) -72 is divisible by 6 because — 








(h) 24 is not prime because 24 = 


14 is 
( ES. 
Decide which of the following you believe to be true 
F and which you believe to be false. If the statement 
false, give a specific counterexample, If you belie» 
Statement is true, provide the examples you tried 
‘evidence, 
(a) For each n> 1 and m > 1, if mis odd and nis even, 
then m + n is divisible by 3, 
b) For each n> 1, if n is odd and divisible by 3, then 
n? — Vis divisible by 8, 
(©) For every n 22, if n is divisible by 3, then 
n(n Vn + 2) is divisible by 4, 
(d) For each n > 1, if nis odd, then 4* — 1 is prime 
e) Foreachn > 1,ifnis odd, then2^ — 1 is not divisible 
by3, 
4D Foreachn > 1and m > 1, if mand nare even, then 
mn — 1 is not a perfect square. 
(8) For cach n = 1, if n is not prime, then neither is 
2-1 
(b) For each n > 1, if n is divisible by 4, then 3* +1 is 
divisible by 4, 
3+ Decide which of the following you believe to be true 
F and which you believe to be false. If the statement i 
false, give a specific counterexample. If you believe the 





perfect square because 144 = 

















"Some of the exercises refer to divisi 








Y by 3, by 6, and so on. This is defined analogously to the defini 


Statement is true, provide the examples you tried as 

“evidence.” 

(a) For cach n = 1 and m > 1, if mn is a multiple of 4, 
then either m or nis a multiple of 4, 

(b) For each n > 1 and m > 1, if mn is a multiple of 3, 
then either m or n is a multiple of 3. 

(€) For cach n = 1 and m > 1, if m is a perfect square, 
then cither m or n is a perfect square 

4 For cachn = 1 and m» 1, if mis even and nis odd, 
then either m+n? or m? +n is prime. 

(©) For each n > 1, if n is even or divisible by 3, then. 
nè + mis divisible by 6, 

(0 For 








if n is neither even nor divisible by 

1 is divisible by 6, 

(8) Foreachn > 1,ifnisdivisible by3,thenn® — n +41 
is prime, 

(W) Given a positive integer N > 10, form the number 
N by removing the ones’ digit from N and subtract 
ing this digit from the remaining truncated integer, 
(For example, if N = 1,309, then N' = 130-9 
121.) The statement to check is: If N'is divisible by 
11, then A is divisible by 11, 

9 Given a positive integer N > 10, form the number 
WV by removing the ones’ digit from N and adding 
this digit to the remaining truncated integer. (For 
example, if N = 1239, then N' = 1234.9 = 132) 
The statement to check is: If N’ is divisible by 7, 
then N is divisible by 7, 

Assume that you, the Aurwok, and the Reaper can only 

agree on the definitions given in this section and your 

Knowledge of basicalgebra, Write aletter tothe Reper 

to convince him of the truth of each of the following 

statements: 

(a) H n is ever 

(b) Ifnis even, then n + 8is even. 

(©) Hnis even, then n+ 1 is odd. 

D If nis odd, then n + 1 is even, 

(©) Inis odd, then n? — n is divisible by 4, 

(0 I nis divisible by 4, then n? — nis divisible by 4 

















then 3n is even. 








of divisibility by 4 


EH 


6 


, 


1. Write a proof for each of the following 


(8) If nis divisible by 3, then nè is divisible by 9. 
Fill in the missing steps in each of the following f 
proofs: 

(a) Proposition The sum of two odd integers is even. 
Proof Let m and n be odd integers This means 
there isan integer K such thatm = 2K + I and there 

an integer L such that n = 21. + 


ormal 











and so 











Since — — is an integer, this means that 
m+ nis even. a 

w ition If nis even, then nè is even 
Proof. Let n be an even integer. This means that 
we can write n for some integer m 
This, in turn, means that n? = 2. ( ),so. 
nè is even, because isanime, m 

(©) Proposition Every odd perfect square can be writ 
ten in the form 4k + I, where k is an integer. 
Proof Let s be an odd perfect square. So s = n 


for some integer n, and n is odd. By the contr, 
tive of the previous exercise, n is odd. Since n 








there isan integer such thats This 
s= = y 
y= 4 )+1, where is 

an integer, as desired. a 
Here isa proof that the sum of two ven integers is even: 
Proof. Let even integers x and y be given. Then there 
is an integer K such that x =2-K, and there is an in 
teger L such that y=2. L.So 4 y -2-K 42-L 
2- (K + L). Since K + L is an integer, this shows that 
x+ yis even, . 
As we did in Example 6, trace this proof for each of the 
following choices of x and y 
(2) x= 18, y=20 
(b) x=34,y=8 
(x= 18 y=18 
(@) x= 24, y = 40 


^. Refer to Exercise 6 above. Explain why the proof must 





variables K and L 
an that the two integers must be dif 
ferent, or just that they might be different? 


Does using two 
different letters m 





Write a proof in the form of a letter from the Aumtor to 
the READER of the statement “If n is odd, then 3n? +1 
is divisible by 4 

statements in 
the form of a letter from the Auruok to the Reaper: 
(a) The product of two odd integers is always odd. 








21 Mathematical Writing 97 


(b) The product of an odd integer and an even integer 
is always even, 
*) If one integer is even and another is divisible by 3, 
then their product is divisible by 6, 
10. Write a proof for each of the following statements: 


(a) Ifnis even, then the product of n and its successor 


iseven. (Nore: The successor of an integer n is the 
integer n+1.) 
b) If n is odd, then the product of n and its success 


c) The product of any two consecutive integers is even, 
(HiNT: Refer the READER tothe first two p 
Problem, and explain why this proves the 
tion.) 





11. Change your informal proof to a formal proof for each 
F part of Exercise 9. 
12. Which of the 





llowing pairs of statements are contra 

In each case 
traits that a counterexample to each statement must 
have 


Positives of one another describe the 


12). If Trot for the Braves, my brother roots for the 
other team. 
fi, If 1 do not root for the Braves, my brother roots 
for the other team, 


(b) i. In is a positive even integer, then 3" +1 is di 
Visible by 5 
fi, 1f3* Lis an odd integer, then n is not divisible 
bys. 
(©) L If you do not do some math problems every 





night, you will not be good at mathem 





fi. If you are good at mathen 
problems every night 


ies, you do math 





(@) i If you like computers, you would love computer 
i If you do not like computers, you would not like 
computer science. 

If n isa prime greater than 3, then +1 is not a 
Perfect square 


i 





fi, Ifmisa perfect square greater than4, then i — 1 
is not prin 
13, For each of the following, write the contrapositive state 
‘ment, and then prove the original statement by proving 
its contrapositive 





a) If mt +n? 4 0, then m 


(b) If 3n is odd, then n is odd, 


Oorn 20. 


) M m-+nis odd, then morn must be even, (Hinr 
Have you already proved this?) 
4D If mn is even, then m or n must be even. (Hint 


Have you already proved this?) 


COKE PN 


98 Chapter2 1 A Primer of Matematica! Wing 


14. Write a proof for each of the following statements in (©) 7is the only prime number that precedes a pertes 
the form of a letter from the Auror to the Re ADER: cube." (Hint: How does the polynomial x 3 
(90 Viris an even integer greater than 2, then 2" is factor) : 

not prime, d) Every even perfect square is a multiple of 
(b) Every odd perfect square can be written in theform (HiNT: Use Theor m7.) 


8k-+1. (Hinr; Exercise 10e can help.) 


2.2 Proofs About Numbers 








The primary goal of mathematical writing is clarity, In their profession, mathemata 
cians have given up flowery adjectives or Interesting metaphors in order to brinig 
about a clarity in exposition as absolute as the vacuum of space, There are two sidie 
effects of this: (1) knowing standard definition, is essential for understanding, armel 
(2) mathematics is often boring to read, No math textbook will compare a function te» 
a summer day or a number to a bird i fight, so to understand what you are re ding. 
you must know what the abstract objects ar and be prepared to be told of their 
interesting properties in perhaps not-so-int, resting words, 

This section is intended to expose You to some of these standard mathematical 
definitions still within the familiar environment of integers and fractions, Along the 
‘way you will continue practicing your lo al analysis of statements and your writes 
explanations of why they are true or false. The trick will be to hold on to the basic 
Processes we have discussed as the subject matter moves deeper into the world of 
abstract mathematics, 














Divisibility 
In the previous section, we relied entirely on our familiarity with division of integers 
slignake sense of statements about numbers, We will acc develop these ideas with 
Slightly more abstract definitions in order to buila our skills at writing coherent. 
mathematical arguments about formally defined concepts. We begin with a general 
definition of divisibility. 





Definition | An integer n is divisible by a nonzero integer k if there is an 
Hc (called the quotient) such that n = kg Equivalent ways to say this 
are “k divides n," “kis a factor of n,” or even “ng g multiple of &; 








Proposition 1 ifthe integers mand n are both divisible by 3, then the number 
m+n is also divisible by 3, 
PROOF. Let m and n be integers, each divisible by 3. This means that there 
3(K 4 1) GIA Ond L such that m = 3K and n = 3L., Hence m 1 3K 3L 
3(K + L). Since K + L is an integer, this shows that m + nis divisible by 3. 8 








"Definition A number x isa pertes cube (xj for some integer y. 


I—— S —  —ÓQ— 








22 Proofs About Numbers 99 


Proposition 2 Ifthe integers m and n are both divisible by 3, then the number 
m-n is divisible by 9. 
PROOF Let m and n be integers, each divisible by 3, This means that there 
ar, Pregers K and L such that m= 3K and n — 3L, Hence, m.n =3K 3L — 
9(K'-L), Since K - L is an integer, this shows that m-n is divisible by 9, " 





Not every proof has to use the definition of divisibiy. It is typical in mathe: 
maties to use results that have been previously established when constructing new 
arguments, 


Proposition 3 if the integer n is divisible by 3, then the number i? 3a is 
divisible by 9, 
LROOE Letan integer n that is divisible by 3 be given. By Proposition 1, n 4+ 3i 
divisible by 3, and so by Proposition 2, n - (n +3) is divisible by 9, Since zr 1 ay 
n(n+3), this establishes the fact that? + 3n is divisible by 9 a 








Ure Practice Problem 1 Emulate Propositions 1 and 210 write proofsof he following 
the two statements 
(a) Ifmand n are divisible by 4, then m -- n is divisible by 4, 


(D). Ifmis divisible by 4 and n is any even integer, then m-n is divisible by 





Another habit of mathematicians is to generalize results so that they apply 
{0 anany situations. This practice makes the mathematical results more universally 
aeu) but at the same time the results become more abstract and hence sometime, 
harder to understand, For example, Proposition 1 can be generalized as follows 








Proposition 4 | Foranynonzero integerd, ifthe integers mandnare both divisible 
by d, then the number m+n is also divisible by d. 


Compare the following proof of this proposition to the proof of Proposition 1 
que only difference is that instead of using the particular integer 3, the proof uses 
the variable d that can stand for any nonzero integer 


ROOI oet the integer d # Obe given, and let mand be integers, cach divisible 
ty d. This means that there are integers K and L such that m = d K and n — d. L 
Hence, m+n =d; K +d. L =d. (K + LY, Since K+ Lis an integer, this shows 
that m + n is also divisible by d. m 


Rational Numbers 


In the proofs of the previous section, we mentioned the closure property of the set of 
Tuis with respect to the operations of addition, subtraction, and multiplication 
This means that if we start with integers and use only those operations, we will alas 
dat ta results As soon as we allow division, however, we have to acknowledge 
that the set is no longer closed. For example, the result of dividing the integer 5 by 
the integer 8 certainly exists, but it is not an integer, We will refer to such fractions] 
numbers as rational numbers, 











OS _’''’-'- iV UI <_< HEN 


Chapter 2 / A Primer of Mathematical Writing 





Definition 


1. A real number r is rational if there exist integers a and b (b #0) with 
7 = $. Rational numbers (also called fractions) can be expressed in many 
equivalent ways (e.g, } = $ = È = ---). It is always possible to choose the 
integers a and b with no common divisors" greater than 1, 

2. A real number is irrational if it is not rational. 














Proofs about rational numbers are often similar to the proofs we have been 
writing in the sense that the result you are trying to establish can be viewed as an. 
“existence proof.” For example, to show that a particular number is even, you must 
show that there exists an integer ksuch that your number is equal to 2 - k. Put another 
way, you must show that your number can be written as 2 (some integer). Similarly, 
to show that a particular number is a rational number, you must show that it can be 


some integer atan example oof of this type 
Some nonzero integer’ LEUS look at an example of a proof of this type 


using the formal definition of "rational." 








written as 


Proposition § For any rational number r, the number r +1 is also rational. 
PROOF Letr be a rational number. Then we may write r = $ for some i 
a and b with b s 0, It follows that 





legers 


rl 


" 
i 


= 


Since a + b and b are integers and b # 0, by the definition of rational numbers this 
shows that r + 1 is a rational number, LI 


The style of this proof is similar to those we have seen before, We use the hypoth- 
esis and the definition of rational numbers to write rin “fraction form" as 2. Our goal 
is to use this to show that there exist integers cand d with the property that +1 = $. 
The proof simply shows that the integers c = a + b and d = b satisfy the conditions. 

The set of rational numbers has all the familiar closure properties (for addi- 
tion, subtraction, and multiplication) enjoyed by the set of integers as well as some 
additional closure properties, For example, the average of two rational numbers is 
rational, as is the quotient provided the denominator is not zero, These properties 
Will be developed in the exercises, using proofs that are similar to the one given in 
Proposition 

As in Section 2.1, when faced with a statement to prove, we are free to prove 
the contrapositive statement instead. It might be easier to do this if the conclusion 
of the original implication is a negative condition, like the property that a number is 
irrational. In general, itis easier to prove a number has a particular form than to prove 

















Two integers with no common divisor greater than 1 are said to be relatively prime. 


22 Proofs About Numbers. 101 


à number does not have that form. With this in mind, think about the contrapositive 
tement in completing the following problem. 





Practice Problem 2 — Prove that for any real number x, if 2x is irrational, then x 
is also irrational. 


Proving by Cases 


We continue the discussion of proofs about properties of integers by using an argu- 
ment structure called “proof by cases." This technique is logically unremarkable—the 
READER agrees that one of several cases must hold and then the AUTHOR illustrates 
that in each one of these cases the same conclusion follows, Here are two examples 
that use the fact that every integer must be either even or odd, 








Proposition 6 — For any integer n, nè + n is even. 


PROOF (Informal version) 


Dear Reaven, 


Any time you choose an integer m, we both know it will either be even or 
‘odd, The proof is slightly different for those two situations, so I'll address them 
one at a time in separate paragraphs. 

If your number is even, then I can write n = 2. L for some 
this, I can substitute and do a little algebra to get 


+2L =2(2L? +L) 





nteger L. From 





m+n = QLY + QL) - 4L 





Since you will agree 2L? -- L is an integer (adding and multiplying integers 
always gives an integer), this shows that n +n is even, but only for this case 
where your choice of n is even. 

Perhaps the number you have chosen is odd. If so, then 1 can writ 
n= 2 M +1 for some integer M. In this case, I can also substitute and do 
a little algebra to get 














mone (2M +1? QM 1) 

=4M? EAM E1428 +1 
AM? 6M +2 

=2(2M? 43M +1) 





Since 2M? + 3M +1 is an integer, this demonstrates that n? + n is also even in 
the event that your choice of n is odd, 

So you can see that no matter how you choose your integer m, whether it 
is even or odd, I'l be able to demonstrate that n? + 1 is even, 





Your good friend, 
Auror n 


PROOF (Formal version) Let n be given. Since every integer is either even or 
‘odd, we have two cases to consider for n: 


and 





© Case 1: Suppose n is even. Then n = 2- L for some integer 





ITB CSCC GL! ee 


102 Chapter 2 / A Primer of Mathematical Writing 


mè +n = (LY + QL) 
= 4L? +2L = 2QL! + L) 








Since 24? + L is an integer, we have that n? + n is even in this case, 
© Case 2: Suppose n is odd. Then n = 2. M + 1 for some integer M, and 





Pen QM QM 1) 
=4M°+4M+1+2M+1 


4M? + 6M +2 = 20M" +3M +1) 











Since 2M? + 3M + 1 is an integer, n? + n is even in this ca 


‘Thus, in either case we have shown that n? + n is even, m" 


In the preceding proof, we used two different variables L and M, but we could 
have used the same variable, Because the two cases are really separate little “proofs 
Within a proof” they are completely independent of each other, The next example 
illustrates the alternative approach. Notice that for any particular integer n chosen 
by the Reaper, only one of the cases will apply, so as one reads through the proof, 
there will only be one value given to Keven though the letter appears in two different 
parts of the proof, Exercise 5 looks at this issue in more detail, 





Proposition 7 Every perfect square is ether a multiple of orof te form 4q +1 
for some integer q. 


PROOF Let a perfect square n be given. We will give the name m to the integer 


that has m? = n. We know that every integer is either even or odd, so we have two 
cases to consider; 








© Case 1: Suppose mis even, Then m = 2k for some integer k, and hence 
n= mt = (2k? =4e 
Since K is an integer, this establishes that n is a multiple of 4 in this case, 





9 Case 2: Suppose m is odd. Then m = 2k + 1 for some integer k, and hence 





(2k+1 





(+k) +1 


establishing that n is of the form 4- q + 1, where in this case, g = K? + k 
" 





In the above argument we stated matter-of-factly that “we know every integer is 
either even or odd.” While this remark most likely did not elicit any gasps of amaze- 
ment, itis worth stopping for a moment to think about what that statement means It 
has to do with the way we prefer to define even and odd as well as the way we learn 
to do division in elementary school. Specifically, whenever you divide an integer n by 
2, you get some quotient q and a remainder r that is either Ó or 1. 

We can state this more formally by writing that every integer n can be written 
in the form 











22 Proofs About Numbers. 103. 


n=2-q+r 





for some r q and where r must be 0 or 1. In other words, either n — 2-q or 
711—241 for some integer q. This is precisely what we mean when we say, "We 
know that every integer is either even or odd." 

This way of thinking about even and odd numbers explains why we have resisted 
the temptation to simply define "odd" as meaning "not even," or vice versa, We have 
seen the benefits of this point of view in writing proofs—if we know a number is odd, 
itis more useful to be able to say something positive about it (e.g, "it can be written as 
2k +1") rather than something negative about it (c.g. "it cannot be written as 24”), 
But the most important thing about this point of view about even and odd numbers 
is that it can be generalized for any divisor, Specifically, 




















Whenever you divide an integer n by a positive integer d, you get a unique 
integer quotient q and a unique remainder r from the set (0, 1, ... d — 1] 





+ 
Mf Example 1 Investigate the above statement for various integers n divided by 
d=s, 
SOLUTION In Table 2-5, we give the quotient and remainder, as well as 
the multiplicative way to write the relationship, for various values of n. These 
examples support the claim that when an integer is divided by 5, the only 
possible remainders are 0, 1,2, 3, and 4, C 














This fact, using the multiplicative style from the example to express it, is called 
the division theorem. It is also called division algorithm in some contexts, You will be 
asked to prove this through exercises at appropriate times in this book, The important 
thing now is to remember what it means in terms of elementary school division. 








Theorem 8 (Division Theorem) For all integers a and b (with b > 0), 
there is an integer q (called “the quotient when a is divided by b") and an integer 
r (called “the remainder when a is divided by b") such that 





la bqr and 
10er-b 


Furthermore, q and r are the only two integers satisfying both these conditions. 












































n 1 mo os D -13 
Quotient 0 4 9 ^ 0 - 
Remainder 1 3 : 4 0 

Equtionn - (0.541 [4.53 | 9-544 | 0-00 | -3-542 


Table 2-5 Five Remainders when Dividing by $ 


a JS 


Chapter 2 / A Primer of Mathematical Writing 


PROOF The existence of values for q and is the focus of Exercise 14 in Section 
24, and the uniqueness of these values is addressed in Exercises 24 and 25 at the 
end of the current section and in Exercises 7 and 8 in Section 2.5. = 


Since this is not written the way we are accustomed to seeing division with a 
remainder, it might be helpful to consider one more example before moving on to 
proofs that use this rather abstract-looking result. 





BB Example 2 Discuss 73-45 in terms of quotients and remainders, and connect 
the way you understand the answer to the formal statement of the division theorem. 


SOLUTION In elementary school, we would have said that 73 +5 yields a 
quotient of 14 with a remainder of 3, Perhaps later in our education we might 
have given the answer as 143, and today we might just say the answer is 14.6, 
‘The division theorem takes us back to our original understanding of division. 

More to the point, the formal division theorem is most like the way we 
would have checked our elementary school answer, To check that answer, we 
would have multiplied the quotient 14 by the divisor 5 and added the remainder 
3. If we got the dividend 73, we knew we had the right answer, Thus, our "check" 
consists of checking that 








B 





1443 


This is exactly the way the division theorem writes the result of dividing 73 by 
5 





hen we use the division theorem in a proof, we almost always 
have a particular divisor in mind. Propositions 6 and 7 are good examples of this. In 
each of these proofs, we use cases based on whether a number is even or odd. This is 
exactly the same thing as using cases for the remainder of the number when divided 
by 2, When we do have a particular divisor in mind, we can think about the division 
theorem in a slightly less abstract form. 








BE examples Suppose we have “division by 5” in mind, justas we did in Example 
1. Then the division theorem says that for any integer a, we can find a quotient q and. 
a remainder r, where 


a=5:q+r and — riseither0,1,2,3, or4 





A less formal way to phrase this is to say that, for any integer a, we can find a quotient 
4 so that a can be written as one of the following 


a=5-q, a=S-q4+l, a 





5:442, a=5:q+3, or a=5.q+4 





Here is an example of how one might use the division theorem in a proof in the 
same way we used our understanding of evens and odds in the proofs of Propositions 
6and7, 


Proposition 9 Ifnis any integer not divisible by 5, then n has a square thatis of 
either the form Sk+ 1 or Sk + 4, (For example, 13? = 5:33 + 4and ? — 5-161.) 





22 Proofs About Numbers 105 


PROOF Let an integer n not divisible by 5 be given. By the division theorem, 
when n is divided by 5, it leaves a remainder of 0, 1, 2, 3, or 4, But since n is not 





divisible by 5, we know the remainder must be 1, 2, 3, or 4, That is, one of the 
following cases must be true: 


© Case 1: It might be that n = Sq + 1 for some integer q, In this case, 





259? + 
10q-+1 





Gg +24) +1 

© Case 2: It might be that n = Sq + 2 for some integer q. In this case, n 
20g +4  5- (54? +44) +4 = 5- (an integer) +4. 

© Case 3: I might be that n = 5g + 3 for some integer q. In this case, n? = 25g? + 
30q +9 = 5 (Sq? +64 +1) +4 = 5 (an integer) + 4. 

© Case 4: It might be that n = Sq + 4 for some integer q. In this case, n? = 
40g +16 = 5- (Sq? - 8g +3) +1 =5- (an integer) +1 





(an integer) +1 





25q? + 

















Therefore, n is of the form 5- k+ 1 or 5- k--4 for some integer k in eve 
possible case. 





We conclude this section with a final word on remainders, The operation that 
returns the remainder of a division problem is not only useful in mathematics proofs. 
but it is also very common in computer programming, especially when a pro; 
needs to "branch" to handle cases separately. 





am 








Definition In computer languages as well as in mathematics, the term mod 
is used to describe the remainder when one integer is divided by another. Thus, 
we write a mod b =r to mean that r is the remainder when a is divided by b 
(iea = b+ q+rand0 <r < b). Some computer languages (like C, C----, and 
Java, e.) Write this operation as a%b rather than as a mod P. 














Example 4 Compute each of the following: 


1. 73 mod 5 
2, -22 mod 6 

i, 8 mod 11 

Let n be any integer. What is (4n? — 12n + 3) mod 4? 
5. Lern be any integer. What is (4n? — 12n +9) mod 4? 





SOLUTION 


since 73 = 5 14+ 

2, since -22 = 6: (—4) +2. 

3. 8 mod 11 — 8, since 8 = 11-(0) + 8. 

4, Let n be any integer. Then (4r? — 121 +3) mod 
can be written as 4(n? — 3n) +3. 








=3, since 4n? — 120 +3 





106 Chapter 2 / A Primer of Mathematical Writing 


5. Let n be any integer. Then (4n? — 12n +9) mod 4 = 1, 


Am 


Practice Problem 3 











1204-9 — 4 — 2n 81 
= Mn? -3n+2)+1 


(@) For each of the following pairs of integers a and b, write a =b-q+r where 


O<r<hk 
i a=17,b=3 
il. a =10,b=12 
iii. a=-19,b=4 
m a=6,b=13 
(b) Calculate the following: 
i 17 mod 3 
fi, 100 mod 12 
iii. -19 mod 4 
iv 6 mod 13 








^ (259? + 10g + 1) mod 5 


vi. (25q? + 30g +9) mod 5 
(c) Complete the following sentence: “The division theorem with b =7 tell us that, 


for any integer a, we can 


following: j^ 


find a quotient q so that a can be written as one of the 


The mod notation is convenient for mathematicians, too. For example, if we use 
this terminology, Proposition 9 can be more concisely written as “If n mod 5 # 0, 
then n? mod 5 is either 1 or 4.” 

To test your understanding of the division theorem, proof by cases, and the mod 


p 


Explore more on 
the Web. 


Solutions to Practice Problems 


1 (a) Proof. Letintegers mand n, both divisible by 4, be 
given. This means that there isan integer K such that 
‘m= AK, and there isan integer L such that n = 41. 
It follows that 


m+n=4K +4L = 4K + L) 


Since K + L is an integer, the sum m + n is divisible 
bya L 
(b) Proof. Let integers m and n, with m divisible by 4 
and n even, be given. This means that there is an 
integer X such that m= 4X, and an integer Y such 


notation, try this problem before moving on to the section exercises. 


Practice Problem 4 Prove that every integer not divisible by 3 has a square that 
is of the form 3K + 1. Also write the statement of the theorem using mod notation. 


that n = 2Y. It follows that 
min (AX): QY) =8-(X-¥) 
And so, since X - Y isan integer, the product m-mis 
divisible by 8. = 
2 Proof. We prove the contrapositive “For any real num- 
ber x, if is rational, then 2x is also rational.” Let rational 


number x be given and write x = £, where a and b are 
integers and b # 0. Then 





Since 2a and bare integers and b #0, this shows that 2x 











is rational = 
3 w 17=3:5+2, 100=12-8+4, -19=4-(-5)+1, 
and 6 =13-(0)+6 
(h) 2,4,1,6, 1,and 4 
(©) @=7-4.a=7-q+1,a=7-q+2,a=7-q+ 
3,a=7-q+4.a=7-q+5,0ra=7-9+6 
4 Proposition If'n mod 3# 0, then n? mod 3 = 1. 





Proof, Let n be an integer not divisible by 3. By the di- 
n theorem, every integer n must be of the form 34, 
34 + 1 or 3q + 2, but since this is not divisible by 3, we 
have just two cases to consider: 





Exercises for Section 2.2 


1. Fill in the blanks to create multiplicative equations in 
the style of the division theorem. (Nore: To be com- 
pletely consistent with the division theorem, the 
‘only one way to fill in each blank!) 














(w) 3 = 6+ 

(by 18 ne 
(e =1,234= 154 
(a) -24 m 
fe) 1,000 1+ 





(DOES) — 3 
2 Compute each of the following: 
(a) 73 mod 6 
(b) 187 mod 11 
(0) —1234 mod 15 
(a) -24 mod 4 
(©) 1,000 mod 7 
40 (9K +5) mod 3 
3. Compute each of the following: 
(9) 5$ mod 6 
(b) 55 mod 5 
(©) 6 mod 5 
(a) -55 mod 6 
(e (60! — 3 -2k +8) +5) mod 6 
(0 (9K — 3€ — 30k 11) mod 3 
(2) 10(9k-+1) mod 9 
(hy (1(5k-+2"-")—25) mod S. (Hit: Write 2° as 
2.201) 
4. Provide a counterexample to each of the following 
# statements about integers that is false. You do not need 
to prove the statements you believe to be true. 
(a) If (aoc) = (Pc), then a = 
(b) If (a%b) = c, then ((a +1)%b) =e +1 
(©) Ia < b, then (a*sc) + (b%c) = ((a + 5)%e). 





22 Proofs About Numbers 107 
© Case 1: ifn = 3q +1, then 


9q? +6q+1 
300 +24) +1 


w= (q+1F 





son? is of the form 3- (an integer) +1. 
© Case 2 Ifn = 3g 2, then 


n= Gq+2) = 94? + 129-44 
30g +4g +1) +1 


son? is ofthe form 3- (an integer) +1. . 


(@) V b <c, then (a%b) + (a%c) = ((a%(b + c). 

5, For the values of n given in Table 2-6, find the values of. 

F mand k described in the proof of Proposition 7. 

^. Fill in the missing details in the following proofs: 

(a) Proposition If 3 divides b and b divides c, then 

3 divides c 
Proof Let integers b and c be given, and assume 
that 3 divides b and b divides c, This means that 
b= 3- k for some integer k and c = m- b for some 
integer m. 





Since. isan integer, this establishes that 

3 divides c. . 
(b) Proposition If n? is even, then so is n. 

roof of the contrapositive. Let an odd integer n be 
given, Then we may choose an integer m so that 





n=Im+1. 
Hence, n’ is odd, completing the proof. . 
4€) Proposition If 3 divides 4"! — 1, then 3 divides 
*-1 


Proof Let the integer n be given, and assume that 
3divides4*-! — 1. Thismeans that 4"-! — 1 = 3kfor 
some integer. So 4* = 3k-+ 1, and 








aar) 
U) 
=x )+1 
mj 4 |25 | 49 | 144 | 225 | 1024 
k 





Table 2-6 Table for Exercise 5 





roro EM 


108 Chapter 2 / A Primer of Mathematical Writing 








Therefore, 4-1 |. — Since 
is an integer, this means that 3 divides 
7 " 


7. Prove each of the following propositions: 
F (x) Ifa divides b and a divides c, then a divides 6 + c. 
(b) If a divides b and a divides c, then a? divides b. c. 
(c) Ifa divides b and c divides d, then ac divides bd. 
d) If c divides a and x is any nonzero integer, then cx 





divides ax. 

(e) If 9 divides 10°? — 1, then 9 divides 10° — 

D If6 divides n* — n, then 6 divides (n 1) — (n +1). 
(Hint: Multiply out (n-- 1 — (n--1) and sim- 
plify.) 


È. Fillin the blanks to complete the proof of the following 
statement, 

Proposition The sum of two rational numbers is a ra- 

tional number. (That is, if a and b are rational numbers, 

then a + bis a rational number.) 

Proof Let rational numbers a and b be given. Since 
we know that a = ¥ for some integers x 
and y with y 2 0. Likewise, since we know 
that b= 3 for some integers zand w with w # 0. From 
the rules for adding fractions, we know that 














a+b 
» 





We know that id yw are both integers, 

and yw 4 0 because. Hence, we know that 

a + bis rational, by the definition of “rational.” — m. 

Using Exercise 8 as a guide, prove that the difference 

between any two rational numbers is a rational number. 

10. Using Exercise 8 as a guide, prove that the product of 
any two rational numbers is a rational number. 

11, The average of two numbers a and bis the number 5 
Prove that the average of two rational numbers is a ra- 
tional number. 

12. Prove that every integer is a rational number. 

13. The following propositions have to do with number se- 
quences defined recursively as we saw in Section 1.2: 
(a) Suppose the sequence a, is defined by the recur- 

rence relation a, = a, i 2n. 

4, Prove that if a, ., is even, then sois a, 

ii, Write out the first five terms of this sequence 

with the additional assumption that a, = 10. Is 
the statement you proved in (i) true for this 
sequence’? 
Write out the first five terms of this sequence 
with the additional assumption that a; = 7. Is 
the statement. you proved in (i) true for this 
sequence? 








D 











(b) Suppose the sequence b, is defined by the recur- 

rence relation by = 30,., 2b, s. 

Å Prove that if. is a multiple of 3, then so is Bw- 

i, Write out the first eight terms of this sequence 
with the additional assumption that b = 0 and 
b; = 1. Is the statement you proved in (i) true 
for this sequence? 

lli. Write out the first eight terms of this sequence 
with the additional assumption that 5; = 1 and 
b; = 2. Is the statement you proved in (i) true. 
for this sequence? 

(©) Suppose the sequence c, is defined by the recur- 

rence relation c, = c,., 2771. 

i. Prove that if c, i <2", then c, <2" 

i. Write out the first five terms of this sequence 
with the additional assumption that c, — 0. Is. 
the statement you proved in (i) true for this se- 
quence? 

fii, Write out the first five terms of this sequence 
with the additional assumption that c, =3. Is 
the statement you proved in (i) true for this se- 
quence? 

14, Fill in the missing details in the following proofs: 

F (u) Proposition if n? is divisible by 3, then so is n. 
(Her: Check each of the three possible cases for 
remainders when n is divided by 3.) 

Proof Considering the contrapositive statement, 

letan integer n be given that is not divisible by 3. By 

the division theorem, when any integer is divided by 

3, it leaves a remainder of 0, 1, or 2, That is, one of 

the following cases must be tr 

© Case 1: It might be that n = 39 for some inte- 
ger q, However, for this particular integer n, 
We know this case does not happen, because 











© Case 2: It might be that n = 3g + 1 for some in- 
teger q. In this case, 


D 








(9 


© Case 3: It might be that n = 3g +2 for some in- 
teger q. In this case, 





=a rg 
Thus, in every case that satisfies the hypothesis, we 
see that nè is not divisible by 3, completing the 
proof. . 
(b) Proposition Every prime number greater than 3 
has a square that is of the form 12k + 1. (For exam- 
ple, ? = 12-441 and 19 = 12-30 +1.) 





15, 


T7. 
18. 


19. 


a 


Proof Let a prime number n greater than 3 be 

given. By the division theorem, when n is divided 

by 6, it leaves a remainder of 0, 1,2, 3,4, or 5. That 

is one of the following cases must be true: 

© Case 1: It might be that n = 6q for some integer 
q: But we know this ease does not happen for 
this n, since s 

© Case 2 It might be that n = 6q +1 for some in- 
teger q. In this case, 














= 12-( )+1 


© Case X It might be that n = 6g +2 for some in 
teger q. This case does not happen for this n, 
since 

© Case 4: It might be that n = 69 +3 for some in 
teger q. This case does not happen for this n, 
since n 

© Case S It might be that n = 6q + 4 for some in- 
teger q. This case does not happen for this m, 














© Case 6 It might be that n = 69 +5 for some in- 
teger q. In this case, 








Dn )»n 
Therefore, n is of the form 12- (an integer) +1 in 
the only possible cases. " 

Prove that if n is divisible by 3 and n is divisible by 4, 

then n is divisible by 12, (Hint: Write n = 12g +r and 

consider the cases.) 

Prove that if n is not divisible by 3, then r+ 2 is divis- 

ible by 3. 

Prove that ifm is even, then n? + 2n is divisible by 4. 

Use the previous three exercises to prove that if n is 

even, then n° + 2n is divisible by 12, 

Prove that if n? is divisible by 5, then n is divisible by 5. 

(Her: Consider the contrapositive.) 

Prove that the sum of any three consecutive perfect 

cubesisdivisibleby9. (Hinr: Agree with your READER 

to denote the three perfect cubes (n— 1), m^, and 

(n+ 1), and be prepared to use Exercise 16.) 

Prove that no perfect square ends with the digit 2. 

(Hist: The ones’ digit of a number is mathematically 

the same as the remainder when that number is divided 

by 10. So write n = 10g +r and calculate sP.) 




















22 Proofs About Numbers 109 


22. Prove that for every integer n, r^ — n is divisible by 5. 
(Hir: Write n = 59 + r.) 
23, Prove that if 2 
F (Hive: 2" = 28.23 
24. Fill in the blanks in the proof below: 
Proposition If one person says they can write a num- 
ber n in the form 5A + B, where A and B are integers 
with 0 < B <4, and another person says they can write 
the same number in the form 5C + D, where C and 
D are integers with 0 < D <4, show that B = D and 
A-C. 
Proof. 
SA+B 





then 2° mod 7 = 











Let A, B. C, and D be given such that 
SC + D withü < B < 4and0 < D < 4. Since 
. D — Bis divisible by 5. But 





<D-B 





and 


is the only number in this interval that 
Wy 5, so we can conclude that B = D. From. 
this it follows that (A — C) = 0 because 
from which it follows that A =C as well 7 
Emulate Exercise 24 to prove the following proposition: 
Proposition If one person says they can write a num- 
ber n in the form 7A +B, where A and B are integers 
with 0 < B < 6, and another person says they can write 
the same number in the form 7C + D, where C and 
D are integers with 0 < D <6, show that B = D and 
c 
We discussed in this section that the rather obvious fact 
that every integer is either even or odd follows from the 
division theorem. Let’s see how we can use this starting. 
Point to prove something less obvious 
(a) Whatis the contrapositive of the statement “I 1? is 
‘even, then so is 1"? Prove it. (You may use the fact 
that if an integer is not even, then it must be odd.) 
(b) Prove that every perfect square is of the form 4k or 
Ak-- 1 for some integer k. 
(©) Prove that in Pythagorean triples 
is even, then so are both a and b. 
Ifa and bare both odd integers, show that the polyno. 
mial x? + ax + b cannot be factored, 
Prove that no perfect square is of the form 3k +2 
Prove that if x is an integer greater than 5 such that 
both x — l and x + 1 are prime, then x is divisible by 6, 
(Nore: When two consecutive odd numbers are both 
prime, they are called twin primes.) 








25. 


















+P fe 


n 





28, 
29, 





DEE ’S"”’”=~—— ARE IKE 


O Chapter 2 / A Primer of Mathematical Writing 











Notice that each of (Hint: Consider the four possible forms c= 4. q+ r 
n sion theorem, and make use of Proposi- 
eer UU theorem, and make use of Propas 
Ad enn 31. The Fermat numbers are defined by the rule F, = 
2742-16 2 +1. Prove that for all n = 0, Fasi = (F, - 19 1. 


i 22. 
is a perfect square. That is, the product of each pair Explain why it follows that F, always divides Fai — 2. 
pf integers from the set [1,2,7] is 2 less than a per- 32. It has been proven (by Lagrange in 1770) that every 


Teak ike a Show that the set cannot be extended to positive integer can be written sa sum of four perfect 
look like (1, 2. 7, c) and still have this property. Specifi- squares. (For example, 23 = 3? +3°+2! + 12) Show 
cally youshould explain why notall of 1 -c +2,2- e+ that no integer of the form 81 +7 can be written as 
and 7. c +2 can be perfect squares for any value of c. the sum of three perfect squares, 


2.3 Mathematical Induction 





The word “induction” usually refers to a type of reasoning that is fundamental to 
the experimental or observational sciences. Tt is the inference of an event from past 
events. For example, concluding that the sun will rise tomorrow, based on our ex- 
perience that it has risen every day before, is an instance of inductive reasoning. It 
cannot be proven, but it sits well with our experience. Since mathematics is an ex- 
act science, this type of reasoning is not mathematical reasoning. So what does the 
phrase “mathematical induction” mean? 

Mathematical induction does have something in common with inductive reason- 
ing in the experimental sciences. When using mathematical induction, we will, for 
example, use the fact that we know a property is true of the integers 1 through 999 to 
Prove that the same property is true for the integer 1,000. In a way, this is similar to 
the inductive reasoning of the experimental scientist. The big difference, however, is 
that we will prove the truth for the next integer, not merely infer it from a pattern. 

Imagine, for example, that one could conclusively prove that “if the sun rises to- 
day. then it will rise tomorrow." Then one would have to accept the inference that the 
sun will rise tomorrow without a doubt. In fact, one would then prove that the sun will 
rise every day forever. This is analogous to what we would like to do in mathe: 

Inductive reasoning in mathematics is not only valid, itis quite natural. For 
example, suppose that a teacher's idea of a good arithmetic assignment is to have 
Students sum all of the integers from 1 to 100 inclusive. Imagine that you are a student 
of this teacher, but you have heard about this assignment from your friends in an 
earlier class. Your friends faithfully tell you that the earlier class all arrived at the 
answer of 5,050 to the sum 1 + 2+-3+ «.-+ 100. When you arrive in your class, your 
stomach sinks when you see a different problem on the board: 























192-3410 


What can you do? Ofcourse, you could just start adding up the numbers, but hopefully 
you will realize that this new problem could have been written as 


12341004101 =? 





"This result appeared as the three-page paper “The set of numbers (1, 2,7)" in the Proceedings of the 
Calcutta Mathematics Society (v.72, pp. 195-87, 1980) 


23 Mathematical Induction 111 


That is, the sum of all the integers from 1 to 101 is the same as the sum from 1 to 100 
but with one additional integer added on at the end. Since you are sure your friends 
answer was correct, you figure that 





L4+243-4-++4100+4 101 — (12-2 34-4100) 1-101 
5,050) +101 


= 5,151 





and get the correct answer, not from a clever method or lightning calculation, 
but rather from simply using relevant information that has already been carefully 
checked. In fact, presented in this way, it would seem silly nor to use the previous 
information to answer the question at hand. 








Induction as a Game 


In Section 1.2 we played with number sequences in a game of "What's next?” At 
the time, we saw that some sequences had descriptions based on both recurrence 
relations (i.c., recursive formulas) and closed formulas. For example, consider the 
sequence given by the recurrence relation 





4, = a4 +(2k—1), for k> anda, =1 


If we calculate the first few terms of this sequence, we noti 





c a pattern: 





a 4124, a3=9, a=16, as=25, 





It looks like a, 
continues forever? 

As with our proofs in Section 2.1, it is Aurnor’s job to convince a skeptical 
Reaper that the pattern is true for every positive integer n. So Avrnon builds Table 
2-7 and shows it to READER, 

Of course, READER is skeptical as usual and wants to play the game from Sec- 
tion 2.1. As before, the game consists of Reaper choosing an integer and AUTHOR 
demonstrating the truth for that number. Now RrADER is no dummy, and so he does 
not choose a number already in the table. Suppose READER says, “I choose n = 50.” 
Because the sequence of numbers is only described recursively, Arion has to work 
for a while to compute aso. The benefit is that she has to generate ar, a, .. . dg along 
the way, and so she can check every value of the sequence up to as; as well. In doing 
this, her table now looks like Table 





for all n = 1, but how can we be sure that this pattern 























Value Calculated. | 
n | Formula fora, | fora, Isa, =n 
1 | a= 1 Yes 

a +(2 4 Yes 
3 a; +(2 9 Yes 
4 a 16 Yes 
5 a 25 Yes 





Table 2-7 Checking the First Few Values 


ESOS :'S'!C ¢<S«CN*S: =O ee 


112 Chapter 2 / A Primer of Mathematical Writing 











Value Calculated 
n| — Formula for a, fora, Po|ba-mt 
1 1 1 yes 
2 +@-2-1) 143=4 4 yes 
9 ta + (2-491) | 2305972401. | 2401 yes 
E) 1 +(2-50—1) | 2401499 2:590 |2500 yes 


Table 2-8 After the First SÔ Values 


But Reaper is still skeptical, and he will not give up the game. Aurior realizes 
she must find a pattern in what she is doing if she hopes to win, 

We can see the pattern if we examine the last row of our table for n = 50, It 
basically states that 


as = ayy + (2-50 —1) = 





401 +99 = 2,500 





The “aw = a + (2. 50 — 1)" part comes from the recursive formula for the se- 
quence, ax = ai, + (2 - k — 1). Then the “2,401” part is simply the value calculated 
or aw in the previous row of the table, which has already been shown to be equal to 
49. So the formula for as, can really be written as 








s = ay + (2-50—1) = 49 + 2.501) 
This can be generalized as 
dy = am + (2m —1) = (m= 1)? +(2-m—1) 


We can see this statement is valid for the same reason the statement about as) was. 
valid. The first step, am = am-1 + (2- m — 1), is simply the recursive definition of the 
sequence. For the second step, we simply replace am., with (m — 1)? because we know 
that by the time we get to the step that does the calculation for dy, we will already 
have done the calculation and verified the proposed closed formula for everything. 
in the sequence from a, to ami. AUTHOR is ready to write her letter. 





Dear READER, 





You have already seen that my guess fora closed formula, a, = n?, is correct 
forn = 1,2, 3,4, and 5. You should feel free to: check it for as many more values. 
of n as you wish. Let m be the number of the first row that has not yet been 
checked. Table 2-9 shows the m® row and the one just before it. Keep in mind 
that the (m — 1)^ row is the last one that you did check. To compute the value 
Of dm, we must use the recursive description, dy = dy; + (2 m — 1). Since we 
are now filling in row m, we already know that a,, ; = (m — 1). Substituting 
this into the formula for aw, we get 






Gm = (m—1)  Q-m-1) 


and a little algebra will show that (m — 1)? + (2 -m — 1) simplifies to (m? — 
2m-+ 1) + (2m — 1) = m. This means that dy, =m? must be true, so we have 
completed row m of our table. 


23 Mathematical induction 113. 




















n Formula for a, 4% simplified | Ld Isa, =n? 
i [ei 1 1 T ww 
2 &-acd2-0 1+ 4] oye 
3 |a22t83-1) IM s| X 
a-i |ia eQ4m-D-0| m-i |o] Ya 
m d. m my + (2m 1) [ m 





Table 2-9 Illustrating the General Case 


I know this is a little more complicated than the letters I've written for 
our earlier games, but this argument should convince you that no matter how 
many statements you check, the very next statement must also be true. A little 
thought will tll you that since the first five statements were explicitly checked, 
this means that all the statements must be true. 





Your exhausted friend, 





Aumor 
Apro wersion Practice Problem 1 Try out Aurnow’s reasoning in this proof by answering the 
the We following questions 


(a) Here is the row in the table for n = 29: 











n Formula fora, | Value Calculated fora, | m 
2 | ay =ay+(2-29~1) E | a9 


Perform the calculations indicated in Avrwow's letter for m = 30. Does the result 
verify that ayy = 307? 


(b) Here is the row in the table for n = 73 











n Formula fora, | Value Calculated fora, | nt { 


=1) 7 | n 





Perform the calculations indicated in Avmuon's letter for m — 74. Does the result 
verify that ay, = 74? 


(c) Imagine a different sequence with the recursive definition 


y= Oy) +2-n, with ay =2 





Aurion sets up Table 2-10 for this sequence to inve 
sequence really has the closed formula a, = n(n + 1), 

Rewrite the paragraph of Auruor's letter that begins “To compute the value of 
rs --. " so it applies to this different sequence. 





tigate her suspicion that the 








o Oe 


114 Chapter 2 / A Primer of Mathematical Writing 














Value Calculated 
n Formula for a, fora, ma) | Isa, — n e D? 
1 2 12 Yes 
2 2 244 23 Yes 
m- Gna +2-(m=1) | (m—1ym | (m- 1m Yes 
m ani+2-m | 2 | mm) m 











Table 2-10 Table for Practice Problem 1 


Closed Formulas for Sums 


Although Reaper is stubborn, in the face of flawless logic, he is a good sport, He 
admits that there is no counterexample, and he is ready for a new challenge. The first 
example we gave to illustrate inductive thinking involved the sum of the numbers: 


142434 





+n 


‘This can be equivalently expressed using the sigma notation introduced in Section 
12as 


Mis1424344n 


Now suppose that Aurox has a friend who has taken tl 
a shortcut formula for computing this sum: 





class before who tells her. 


n:(n+1) 








This seems too good to be true, so fresh from her victory in the previous game, AUTHOR 
sets up a similar table and begins to explore. She immediately realizes that cach new 
sum can be calculated from the sum in the previous row, and uses parentheses to 
indicate this reasoning, Here is this table through row 34: 






















n| Enaereikecen | Simplitedsum | S8 | Isum 

1 | heer 1 Yes 
2 142 142 Yes 
3 =(142)43 343 Yes 
4 14243)44 6+4=10 x 
5 1424344)5 105-15 Yes 
E] (1+ 2460-433) +34 |. 561-34 = 505 Yes 





£ 


Explore more on 
the Web, 


23 Mathematical Induction 115 


Practice Problem 2 Answer the following qu 
this table: 


sstions about subsequent rows of 





(a) What will row 35 of the table above look like? 


(b) Assuming that the pattern continues, row 66 will be as shown below. What will 
row 67 look like? 











»oERdetereeskas 0| Simptiedsum | | issum 
66 | Xfi-0424--65)46 | 215«65221 | & -22n | Yes 
As before, Arion looks for a pattern in what she is doing. She visualizes two 
rows, the last one that she has already checked, and the next one to be checked, Once 
again, she labels the one she is about to check as row m: 
a Perles n Simplified sum 














Pi=14240-4(m—1) it mija Yes 




















+(m=1))+m "m 





Now she uses the fact that the sum up to mis the same as the sum to m — 1 plus 
one more number (m), and the fact that row m — 1 has already been checked, to write 





bem 


=(1+2+:-+(m-1))+m 


m -1)-n 
(ID im (omrowm-1) 





(common denominator) 


(factor out m) 





She is now ready to write her letter, which will have to wait until a bit later in 
the section. However, before we move on, we should check some examples to see 
that her logic is sound for any possible positive integer. 





in 
© Example 1 Apply the Avriors logic to these specific situations 





1. Having checked rows 1 to 97, and in particular knowing that 1+2 +-+: +91 = 
2798. verify that the conclusion is true for m — 98. That i, verify that 1+2 +++ 


9g = 32 








EE cc 


16 Chapter 2 / A Primer of Mathematical Writing 


2. Do the same for m — 146. That is, use the fact that 1+ 2+... +145 = 4548 1o 
verify that 1+2+---+ 146 = 82, 








3. Do the same for m = 1341. 
SOLUTION 
L142 40-498 =(1424---+97) +98 = TS gg Eos 2$. 
BED mm 
2R 1424.4146 = (1424+ 4145) +146 = HSU 4 146 = 
145.14 "m 
Hi mut 





BDFD + 1341 = (L2 1340) + 1,341 = OSH + 1,341 = 





Lamy paaa sts OL i 
Formal Proofs by Mathematical Induction 


Now that we have informally discussed this form of reasoning, let us move toward 
formalizing the underlying principle as we will use it in our proofs. First of all, there 
are specific types of statements to which this form of reasoning can be applied. 
‘The underlying issue is that there must be some way to order an infinite family of 
statements so that there is a first statement, a second statement, and so forth and 
so that for every statement proved, there is a next statement to consider. We can 
formally guarantee this by insisting that our statements be given in a way that makes 
it easy to “index” them with positive i 

















Definition A statement about the positive integers is a predicate P(n) with 
the set of positive integers as its domain. That is, when any positive integer is 
substituted for n in statement P(n), the result is a proposition that is unam- 
biguously either true or false, 





For example, "n is even,” “2" — 1 is prime,” and "every single-climination tour- 
nament with n players must have n — 1 games" are all statements about the positive 
integers. On the other hand, “nè + 1," for example, is not a statement—when one 
substitutes 7 for n one gets "50," which is, of course, a number and neither true nor 
false, 


ll Example 2 Which of ie folowing are statement about the positive integers, 
according to the definition given above? 


1. nè+ nis even. 

2, 10-7 

3. 100-n> 83 

4. John has fewer than n apples in his refrigerator. 


SOLUTION All but #2 are valid statements. When we ‘substitute 12, for ex- 
ample, into “100 — n,” we get “100 — 12," which is neither true nor false. o 


23 Mathematical induction 117 





WI Example 3 Foreach ofthe predicates below, write the corresponding sentences 
when n = 2 and n = 30, and decide which are true propositions. 





1. E(n)is the statement “n? +n is even.” 
2. G(n)is the statement "00 — n > 83." 


3. S(n) is the statement 1 +24 «n = ee 


SOLUTION 


1. E(2) is the statement “2? +2 is even.” E(30) is the statement * 

even,” Both are true statements. 

G(2) is the statement *100 — 2 > 8: 

100 — 30 > 83," This is fal 

3. S(2) is the statement “ 
30 = X080 





P +30 is 








This is truc. G(30) is the statement 








(30) is the statement “1 4-2 4... 
3002041) 


$2." Using our Y" notation, wecould also write "Y: i 
Both 5(2) and $(30) are true. 





Practice Problem 3 


(a) P(n) is “If there are n students in the class, the room will be too small." What is 
(35)? What is P(m — 1)? 


(b) S(n)is “1+ 4+9-+ +--+? = DD » Rewrite S(n) using Y notation. Then 
write S(1), SQ), SG), and S(4), and decide if they are true. Also write S(m — 1) 
and simplify i 








(c) Consider the sequence defined recursively as a, = 11, and ay = a, 1 + 4, and let 
R(n) be the statement "a, = 4n +7.” Write R(1), R(2), R(3), and R(4) and decide 
if they are true. Also write R(m — 1) and simplify it. 





The simple idea of having the statements we wish to prove given in some order 
allows us to use inductive reasoning. This form of reasoning is central to many con- 
cepts in discrete math, so it is of the utmost importance in this course that everyone 
have the correct mental picture of how inductive reasoning works. Before we give 
some examples, we will state the underlying principle. This principle is stated in the 
terminology of a generic predicate P(n) over the domain of positive integers. 











The Principle of Mathematical Induction Let P(n) be astatement 
about the positive integers If one can prove that 


6) P1) is true, and that 


(il) for every integer m > 2, whenever P(1), P(2),..., Pim — 1) have all been 
checked to be true, it follows that P(m) is true, 


then we can conclude that P(n) is true for every positive integer n. 











118 Chapter 2 / A Primer of Mathematica Writing 





This precise statement might seem to be a long way from the intuition in the 
discussion leading up to it, but it is a very natural principle. Imagine a long line Of 
dominoes numbered with the positive integers in sequence, and statement P(n) iS 
“Domino n falis." In this analogy, condition (ii) essentially says that all the dominoes: 
arespaced so that if all previous dominoes fall, they knock down the next one, Clearly. 
if that condition holds of all the dominoes and someone tips the first domino (which 
is condition (i)), then we can reasonably conclude that all the dominoes will fall. 

Note that technically there is no reason for the first statement to correspond to 
n = 1—all that matters is that there is a first statement that is checked explicitly to be 
true. In reality that statement might be called P(0), P(5), or something else. The con- 
clusion of mathematical induction is that every statement is true from that point on- 

To illustrate the principle stated in this way, we revisit an earlier example and 
give a formal proof. 


Proposition 1 For every positive integer n, 


ka (n +1) 
2 





PROOF (By induction on n) Let S(n) be the predicate "Y77., i = "^f^ It is 
easy to check that statement 5(1), which states “1 = “12 " is true, Now let the 
integer m > 2 be given such that the statements S(1), ...S(m— 1) have already 
been checked to be true. We now consider statement S(m): 








Yi-re1ereeeem 
i 
-( 2E (n- 1) m 
-1 
= OP sm, since Sim—1) is true 
(mm | 2m 
CERES 
min 











)+2) 





This shows that S(m) is true, completing the induction. = 


Practice Problem 4 Let P(n) be he statement “Y? 2-1 = 2^ — 1" Byimitating 
eee 1, give a logically convincing reason that P(n) is true for every positive 
teger n, 


Our final example of this section revisits the problem on recursively defined 
sequences in the first letter from AurHoR to Reaper in this section, We give a 
semiformal proof, midway between the “Letter to READER” and the terse proof 
of the preceding proposition. 


Proposition 2 Letthe sequence of numbers a, be given by the recurrence relation 
ak = at- + (2k— 1) fork > 2 and a; = 1. Show that ay = n? for all n > 1. 


23 Mathematical Induction 119 


PROOF (By induction on n) Consider the statement “a, = n?." We are given 
that a; = 1. Clearly, the first statement a; = 1? is true, We check a few of the initial 
statements, then imagine we are about to check statement n. That is, we imagine 
that everything in Table 2-11 above the line has been checked. Now we simply 
calculate am according to the recursive formula, using the fact that row m — 1 has 
already been checked, 





ds = As + (2 - m— 1) 

im— 1X +(2m-1), using “ay; = (m—1)" 
(m — 2m 1) - Qm - 1) 
"?-2mle2m-1 


m 











We therefore have shown that am = mè, which is just what we need to com- 
plete row m of our table. We have thus shown how knowing the truth of statements 
UP to “dyy-1 = (m — 1)?" (and a little algebra) leads to knowing the truth of the 
next statement, “am = nr" This establishes the truth of all statements "a, = n” 
for all n > 1 by mathematical induction, . 








Mathematical induction gives us a rigorous way to show that a number sequence, 
for which we know a recursive description, has a particular closed description, as well, 
Sce Exercise 3 for some practice with this idea. 





d 
Bedere mor on. Practice Problem 5 Ler the sequence of numbers a, be given by the recurrence 
lore. 
the Web relation ax = aii + 2k for k > 2 and ay — 2. Show that ay = n(n + 1) for all n > 1. 


Strong Versus Weak Induction 


According to many mathematics textbooks, the form of induction we have presented 
in this section is called strong induction because we are explicitly imagining having 
all previous statements P(1), P(2), .... P(m — 1) available for use when we are ready 
to prove statement P(m). In our actual sample proofs, you might have noticed that 
even though we had all the previous statements available, we only used the immedi- 
ately prior statement P(m — 1). In the proof, if we had assumed only that statement 





‘True? 











Yes 
Yes 
Yes 
Yes 
Yes 





and so on, up to 
m-l| dy) = dq +2 + ((m—1)—1) 














m amavi m-1) 





Table 2-11 Proof of Proposition 2. 





OO OO EEE ee 


120 Chapter 2 / A Primer of Mathematical Writing 


 P(m — 1) had been checked, then the proof would still be valid, but some math text- 
books would consider this a use of weak induction. These names are unfortunate 
since the two argument structures are equally valid. In this book, we will use only 
the so-called strong form of induction, This allows us to maintain our mental image 
of the Reaper checking all statements starting from the first one, In addition, we will 
encounter proofs (Exercise 12 in this section, e.g.) that cannot be done with the weak 
form, and itis simpler to consistently use the form that is more widely applicable. 


Solutions to Practice Problems 


1 (0 ay = ay +(2 30-1) =29+ (2. 390-1) = 





are true, S(m= 1) is “14449-4004 (m= 1) = 





S41 + 59 = 900, and this is the same as 30°, deget op “Er P = He) 
(0) au = ay (274-1) T (2 74 - (e) RD) is “ay 4 197" RO) is "m4 247" 





29 + 147 = 5476, and this is the same as 74% 


RG) is “as =4 (3-7, R(4) is “as 





(©) The formula for ay is given by dy = dai 2 «m. 
By the time I'm filling in row m, all the rows from 1 
to m — 1 will already be done, so T'Il already know 
that an- is equal to (m — t)r. Substituting this for 
m-i, Vll get dm = (m — 1)m + 2m, and a little alge- 
bra will show that this simplifies to (m — 1)m + 2m = 








4 So P(n) is DARA 


Since (according to the recursive definition) the se- 
quence begins with 11, 15, 19, 23, all four are true. 
Rim - 1)is ani 4 - (m— 1) 7^ 

4277 22^ — 1,” Table 2- 
13 shows the inductive reasoning for the proof, Here is 
the calculation that uses P(m — 1) to verify Pm): 





mè = m+ 2m - n? e m= mnl). 
2 We show both answers in Table 2-12. 
(a) PGS) is “I there are 35 students in the class, the 





D4 Dp Bp 2t 



























vate a2) 4 gmt 
room will be too small" P(mi — 1) is “I there are 2E ops de 
m — 1 students in the class, the room will be too ‘was already checked) 
small" aipa 
(hy If we use E notation, SQ) is “E= 22:7-1 
enen g(t) ig" = Mis » gy pra y ry 
am UB soyut ade on kingo o n E. 
SA) ds "149-16 M2440» AN four S Consider the statement "a, = n(n + 1)" Table 2- 
(UK Tett 4 fU shown tho inductive reasoning for tho proot, Here is the 
n | sum(=14+2+: +n) sum sh | sum = teg? 
35| U+2+--434)+35 | sos pas—60 | Mx Yes 
67 | Q42469):67. | 2211467 = 2278 | 2 = 2,278 Yes 
Table 2-12 Solution to Practice Problem 2 
Pon) Pin) Simplified True? 
1 JmÓ2--1 “=2-1" Yes 
2| 242a- “+2=4-1" Yes 
3 ff 2424221» “1424428 Yes 
4 [ume2e24222- “1424448= 16-1" Yes 
S |'femeeeT-z—15 tittie Yo 
Po andso on, upto : i 
m-i || meÉeeemri-rül mo 4324242 Yes 
m | Pittom am o Cpkbpei Tni m 





Table 2-13 Solution to Practice Problem 4 


23 Mathematical Induction 121 






























































n r “ay aD" | True? 
1 Gen" | Yes 
2 Yes 
3 «6-12 Yes 
4 1248-20 Yes 
5 2041030 Yes 
and so on, up to : 
m-l|| ani=ana+2:(m-1) | ays-(m- e| Yes 
m ducas a 2n "a = mim i | om 

Table 2-14. Solution to Practice Problem 5 

calculation that verifies the truth of “ay = mn + 1). 

D TED 
= (m= 1m 2m, using “ays (n — 1)(m)" 

nè -m+ 2m 
= nè +m 
mn Y) 
Exercises for Section 2.3 
1, (0) V P(n) is “n? + 1 is prime,” write (1), P(2), and — scribed by a given closed formula. For each of the se- 
P(12). Which it any are true? Also, write P(n — 1). quences defined recursively below, prove that the given 

(b) M Lr) is"? < 2^; write L(1), L(2), L(3), L(4), — closed formula is correct, As part of each proof, verify 
L(5), and L(6). Which if any are truc? Also, write — he given statement for at least n = 1, n = 2,7 3,and 
Lim - Y) ned, 

46) M SQ) is “SOM, P= tene write S1), S(2 (a) Show that the sequence defined by ay = a, i +4 
(3), 5(4), (5), and (6), Which if any are true for k= 2, where a, = 1, is equivalently described 
Also, write S(ni— 1) and simplify it by the closed formula a, = 4n — 3. 

D V PU) is "n can be written as the productof exactly (b) Show thatthe sequence defined by a, = ay_y + (k+ 
two different prime numbers; write P(4), P(S 3) fork = 2, where a; = 5, is equivalently described 
P(6). Which if any are true? Also, write P(m = 1). by the closed formula ay = 52. 





t the sequence defined by a = ay. + 
22, where a = 1, is equivalently described 
by the closed formula ay = 


e) IL/(1) is If the Josephus game of Section 1.1 starts 
with 2" people, and every second person is killed, 





then the person in position 1 is the last survivor, 1 
write 4(1), 2), /(3), and J(4). Which if any are (d) Show that the sequence defined by d, = 2ay. +1 
true? Also, write J(m — 1), for k= 2, where a, = 1, is equivalently described. 








2. For the sequence given recursively by a = 2a by the closed formula a, = 2 — 1. 





















for k= 2, where a, = 4, someone has guessed (©) Show that the sequence defined by ay — aii 

form and used the predicate R(n) for the statement ador for k > 2, where ay = 4, is equivalently de 
“ay = +3." scribed by the closed formula a, = = 

(a) Are R(1), R(), RG), and RA) true? (D Show that the sequence defined by a, = 2a.) — 3 

(b) Write R(E) and (9), for k= 2, where a — 4, is equivalently described 
(©) Suppose that Amanda has already verified R(8). by the closed formula a, = 2 +3. 

Show how to verify R(9) without starting all over, — 4 For each of the sequences defined recursively below, use 

3. In Proposition 2, we showed how to argue that a re- # mathematical induction to prove that the given closed 


formula is correct. 





# cursively defined number sequence is equally well de- 


a Re cr 


22. Chapter 2 / A Primer of Mathematical Writing 


(4) Show that the sequence defined by by = be 1 2^ 
for k= 2, where b, =4, is equivalently described 
by the closed formula b 
(©) Show that the sequence defined by by =3by for 
A> 2, where b, — 6, is equivalently described by 
the closed formula by =2 - 3. 
(©) Show that the sequence defined by bx = 3h. , +2 
for k= 2, where by =2, is equivalently described 
by the closed formula b, — 3* — 1. 
(d) Show that the sequence defined by by = 45, +3 
for k 2 2, where b — 3, is equivalently described 
by the closed formula b, 
(©) Show that the sequence defined by by = 2h... +k 
for k 22, where b, =3, is equivalently described 
by the closed formula b, =3 - 2*— n — 
5. Guess a closed formula, in terms of n, for the se- 
quence given recursively by a; — 1, and for k> 2, 
i = ay; + (2k — 3). Use induction to prove your guess 
is correct. Hint: Write the first eight terms of the 
Sequence, How do the terms relate to the sequence 
1,4,9, 16,25,36, 
6. Suppose (7) isthe statement Y? 
Rewrite S(n) without using sigma notation. Write S(1), 
SQ). SQ), and S(4) and simplify each to see if it is true. 
Write S(m — 1) without using sigma notation. 
I£ G(n) is the statement “E7, 1 > 4,” write G(n) with- 
‘out using sigma notation. Then write G(1). G(2). GG), 
and G(4) without using sigma notation. 
8. Use induction to prove each of the following. As part of 
F your proof, write and verify cach statement for at least 
n=2,n=3,andn=4 
(a) ELI i—1) =a foreach n> 1. 
©) ELQ -i4) m 5n foreach n x 1. 
(©) Xr, 1) 2277 —n—2 for each n 
(@ 272,37) 2 3* -Moreachn > 1- 
(9 Yt d m 1- d foreach nz 1. 
O XL aden = s foralln> 1 


© Dis ay xa 
















































forall n> 1. 


3. Use mathematical induction to prove each of the fol- 
F lowing 


a) Prove by induction that for all positive integers n. 


193464104 





uS) 


nln 1n 2) 
mus 


(b) Prove by induction that for all natural numbers 
nzil, 


13) +24) +315) +--+ n(n 2) 
nin - Yn 7) 
6 
10. Write each of the equations in Exercise 9 using sigma 
notation, 
11. Guess a formula in terms of n, for 


(02) + Q7) + BR?) +- 


and use induction to prove that your guess is correct. 

12. Show that the sequence defined by a; = ay, + 2ax-2 
for k > 3, where a = 1 and a; = 2, is equivalently de- 
scribed by the closed formula a, = 2". (Notice that 
because the recursive formula starts at k= 3 you will 
have toat least check the statements forn = land =2 
before considering the induction step, Also notice that 
when you verify a" =2"-', you will need to use the 
closed formulas for dy. and an-z that have already 
been verified.) 

13. Let a real number x # 1 be given. Prove by induction 
that for all positive integers n, 


HN) 

















y 





ltr totr" 





x 


14. Explain how Exercise 13 implies that when x is a num- 
ber between 0 and 1, the more terms of 


lrt trt 








you add, the closer the sums get to zi 


24 More About Induction 





In this section we explore induction in a little more depth, illustrating the wide variety 


of problems for which induction is an effecti 





¢ tool. Even though our proofs in this 


section will be written more formally, keep in mind that the formal exposition issimply 
a compressed version of the reasoning process we outlined in the preceding section. 

You may have noticed that both types of problems in the preceding section, 
sums and recursive sequences, had a certain common “feel.” In both situations, the 
calculations to verify statement P(m) used the result from P(m — 1) and some simple 


algebra. Our first objective in this sectio 





to explore this specific connection. 


24 More About induction 123 


Sums as Recursive Sequences 


Consider the proof from Proposition 2 of Section 2.3, that the recursive sequence 
a; = 1, ag = ag- + (2k — 1) has closed formula a, = n°. In proving “dy, = mr," we 
wrote 





s = am + (2m —1) 
—(m-1y-Qm-1) using “ag1 = (m— 1)" 
me ~2m+1+2m—-1 





=m 
The first step used the relationship between aw and aq_1, while the second step 
used the fact that “ay,_1 = (m — 1)?” Now consider the proof from Proposition 1 of 
Section 2.3, that the sum 1 +2+3+ -- -+n has the closed formula “2. In proving 
the statement S(m), we wrote 








2434---4m= 





1+24+3+---+(m-1))+m 
(m—1m) | 





m. usingS(m—1) 





m xm 





(m+) 





again, the first step established the relationship between statements 
S(m — 1) and S(m), and the second step used the information. It is no coincidence that 
this is exactly what happened in the other proof. Sums are recursive by nature—the 
sum of 101 numbers is simply the sum of the first 100 numbers plus the last number. 
Let's see how to make this connection explicit. In the examples that follow, we use 
s, rather than a, for the sequences to suggest that s, is defined by a sum. 





"Ef Example 1 Consider the sum Y. Qi — 1), which is the same as 1+3-+5+ 
-+ 2n — 1). Let us use the notation s, to denote this sum. For example, ss means 
1+3+5+7+9. Find a recursive description of Sq 


SOLUTION The key to the solution is realizing that s, means the sum of 
the first n — 1 terms, and that to sum the first n terms, we sum the first n — 1 
terms and then add the final term (ie., the term when i =n). In symbols, 


%=S@-1 


143454 (n-1) 
(1+345+---+(Qn-3))+(2n-1) 
=Sr1+(2n-1) 











Sos, = Sq-1 + (2n— 1) is the recurrence relation. However, this by itself does 
not determine the sequence of numbers—we must also provide a starting value. 
This comes from the somewhat silly case, s, = 1, since the sum Y. ,Qi — 1) 
isl. n 





— 


Chapter 2 / A Primer of Mathematical Writing 
Practice Problem 1 Lets, = Y? ,(4i — 1). Find a recursive description of s, 


If we write a sum recursively as in Example 1, it makes it somewhat easier to 
write an induction proof about the sum. In Example 1, we showed that the sum 
Sa = Df (2i — 1) is described by the recurrence relation s, = 5, 1 + (2n — 1) with 
3 = 1, but in Proposition 2 of Section 2.3, we proved that the sequence with this 
recursive description satisfies the closed formula s, =n’. So in a roundabout 
we have proved 








Yxi-)-5 


Perhaps it would be clearer to see another example of this process all in one place. 





"ll Example 2 show that Ynj- 


PROOF (By induction on n) Let s, represent the sum Y, 2. Then s, ha 
the recursive description s, = s,., + È with s, = Ẹ. So instead of the original 
statement, we will prove that 


— $ for every positive integer n. 











It is easy to sce that the first statement, “s = 1 — 4," is true, Now let a positive 
integer m be given such that the statements up to “sp; =1— yhy” have been 
checked to be true, Now considering the next term sp, we see that 





So we have veri 





the correct closed form for the next term. a 


Practice Problem 2 Emulate Example? to show that Y: 
for every positive integer n. 





ài —1) 2 n- (n1) 


There is no logical necessity in rewriting a sum as a recurrence relation, but it 
is an intriguing idea. It turns out that finding nice formulas for sums is really just 
a special case of finding closed formulas for recurrence relations. There has been a 
great deal of work done in mathematics to develop tools to do this. You will get an 
introduction to some of these ideas in Section 5.6. For now, understanding how to 
Prove a closed formula is correct is an essential first step. 


24 More About induction 125 


Other Uses of Induction 


‘Thereare times when it iseither very difficult or very messy to find a closed formula for 
a recursively defined sequence of numbers. In many of these cases, we might not even 
need a closed formula, but we do need to establish that the sequence has some par- 
ticular property. Mathematical induction gives us the tool to prove properties of a re- 
cursively defined sequence of numbers directly without finding a closed formula first 








Mil Example 3. Letthe number sequence (ay) be defined by the recurrence relation 
ün = 205.) + an-z Where a, = Sand az = 10. Use induction to show that. foralln > 3, 
a< Y. 





PROOF (By induction on n) Consider the statement "a, <3",” The first sever 
terms of the sequence described are 


5, 10,25, 60, 
so itis easy to verify that “ay < 3°" and "a, < 3*7 


Now let m > 5 be given such that all these inequalities up to “am~; < 3” 
have already been checked, and now consider the next statement 











üm = 2am + an-z, by definition of the sequence 


«2.37 tamg, since “amy <3" is true 
«2.Y" 37, since “amz < 3"? is true 
«2.375 43", since 3 «an 

3.37 
=" 


This shows that the statement "a, < 3” is true, completing the induction, — 8 


You may have noticed that in this example we first checked the statement for 
3 instead of for n = 1. This is simply because the statement being proved, "a, 
3^7" is only asserted true for n > 3—in other words, "from n = 3 on." In terms of 
our domino analogy, this is like having the “dominoes” properly spaced but then 
knocking down the third one first. If this happens, then our conclusion is that they 
all fall down from the third one on. This is still a useful thing to know as the previous 
example attests, 

‘The other item to notice in the previous example is that we checked the statement 
for both n = 3 and n = 4 before taking on the “inductive step." While itis certainly 
harmless to show more initial steps than necessary in any of our inductive proofs, 
in this case it is actually necessary in the proof. To see why, notice that the proof 
requires we know something about both ay; and a, ;. If we tried to start the 
inductive step at m = 4, we would be using the statements “ay < 3?" and “a, < 3°." 
However "a; < 3" was not checked, and, in fact, it is not even true! We need to 
check “a; < 3°" and “ay < 3*" by hand, and then prove the induction step for m > 5 
This raises the general issue that it is not always sufficient to show only one initial 
statement for an inductive proof to be logically sound. 

One of the number sequences we encountered in Section 1.2 was the sequence 
of Fibonacci numbers. These numbers are the source of a great many problems in 





























SS ee 


126 Chapter 2 / A Primer of Mathematical Writing 


Science and recreational mathematics. Most of us know the Fibonacci numbers as the 
Sequence obtained by starting with 1 and 1, and then using the sum of two successive 
numbers in the sequence to get the next number in the sequence. So the sequence 
looks like this: 


1,1,2,3,5,8, 13, 21,34, 55,89... 


Formally, this means that the Fibonacci numbers are defined with the recurrence 
relation 





FEE foralln>3 (23) 
where Fi = Land Fz 





um Example 4 Show that the Fibonacci number Fs, is always even. (This means 

that Fs is even whenever kis divisible by 3.) 
PROOF (By induction on n) Consider the statement “Fy, is even.” Since we 
rote out some of the Fibonacci numbers above, we can quickly see that F; = 2, 
Fi = 8, and Fy = 34,50 the statements "F; is even,” “Fg is even,” and “Fr is even” 
are each true. Now let m > 2 be given, and assume that we have already checked 
that the Fibonacci numbers F;, Fi, ... , Fus, jj are all even, The next statement 
under consideration is P(m), which states that “Fim is even.” With careful use of 
the recurrence relation (2.3), we can see that 


Fim = Fam + Fr 
= (Fom-2 Fus) Fs 
= Fim3 + 2Fin-2 
So Fim = Fim-3 + 2Fin-2 It has already been checked that Fi, 5 — Fi, i is 
even, and 2Fss. 2 is certainly even since it is twice an integer. Since the sum of two 


even numbers is even, we can conclude that the next statement “Fam is even" 
true. This completes the induction. " 




















E. Example 5 Show that the Fibonacci numbers satisfy the following for all 


nzi: 





Ee y 
a 


PROOF (By induction on n) Let P(n) be the statement “y” 
(Fi)(Fns1).” Since P(1) states that “(F;)? = (F; \(F:)." and by definition F; 
1, then P(1) is true. 

Now let the positive integer m be given such that statements P(1),. 


(Fi? + (a? Y = (F4) 
SBE ARO ALE Hh 






P(m — 1) have already been checked to be true—in particular, we already know 
for sure that 177 (F;? = (Fm1)(F). Now note that 





24 More About Induction 127 


= (FF) + (Fa). by statement P(m — 1) 
Fn)(Fn + Foi) 
Fori). by the Fibonacci recurrence relation (2.3) 








(Fn)(Fine1). That is, P(m) is true, as desired, 





So we have shown that E, (F; 





Practice Problem 3 Emulate Example 3 to show that F, <2" for all positive 
integers n. 


We have seen how induction can be used to prove that every number in a se- 
quence hasa particular property. The common theme has been the use of the recursive. 
description of the sequence to show that the property is preserved from one term to 
the next. This general idea can be extended to situations where we start with a closed 
formula for the sequence. In these cases, there are often perfectly good noninduction 
proofs of the same properties, but an induction proof might be more elegant or offer 
a new insight into the property at hand. 

g L. > 

Biais Example 6 Show that n° + 2n is divisible by 3 for all positive integers n. 

the Web, PROOF (By induction on n) Let D(n) be the statement “n? + 2n is divisible by 

3." Statement D(1)states that “I? + 2(1) isdivisible by 3; whichis true. Now let the 
positive integer m be given such that statements D(1), ..., D(m — 1) have already 
been checked to be true—in particular, we already know that (m — 1)! + 2(m — 1) 
is divisible by3. This means that there isan integer K so that (m — 1) +2(m — 1) = 
3K. This can be simplified algebraically to say n? — 3m? + 3m — 1 +2m— 2 = 3 
or m — 3n? + Sm — 3 = 3K. Note that 

















nf 2m = (n? — 3n? + Sm —3) + (3n? - 3m + 3) 

(3K) + (3m? —3m--3), using the simplified statement D(m — 1) 
3K 3n? 3m 3 

= 3(K +m? -m+ 1) 














That is, n? -- 2m = 3(K +m? — m+ 1), so “nt -- 2m is divisible by 
statement. This is statement D(m), as desired. 





It might not be clear from the examples we have seen up to this point, but 
‘mathematical induction can be applied to any situation where there is a sequence of 
statements about the positive integers, even when there is no “formula” involved, 


E Example 7 Show that in a single-elimination basketball tournament with n 

teams there must be n — 1 games played to determine a champion. 
PROOF (By induction on n) Let T(n) be the statement “In a single-climination 
basketball tournament with n teams there must be n — 1 games played to deter- 
mine a champion.” Certainly in any tournament with 1 team, no games need to 
be played. Hence, the statement 7(1) is true. Also in any tournament with two 
teams, only one game needs to be played. Hence, the statement 7(2) is true. Let 
m be given such that statements 7(1). ..., T(m — 1) have already been shown to 





a 7 REN 


28 Chapter2 / A Primer of Mathematical Writing 


be true—in particular, we already know that in any tournament with m — 1 teams, 
m — 2 games are required. Now let any tournament with m teams be given. After 
one game has been played in this tournament, one team is eliminated, leaving 
m — 1 teams still “alive.” Previously checked statement T(m — 1) told us that to 
determine a champion from among these remaining m — 1 teams, exactly m —2 
games are required. So to determine a champion in the tournament involving m 
teams, we need the first game plus the other m—2 games for a total of m— 1 
games. This means that T(m) is true, as desired, Li 














u Example 8 In the Josephus problem from Section 1.1, we placed a number 
f people in a circle and eliminated every other person until only one person was lefi. 
The puzzle is to find the correct place to stand so that you are the last person surviving. 
If the game starts with 2" people (numbered 1,,..,2") and every second person is 
eliminated starting with Person 2, where should you stand? 
SOLUTION Claim: You should stand in position 1. 
Proof by induction on n. Let J(n) be the statement “If this game starts 
With 2" people, then the person in position 1 is the last survivor." If the game is 
played out with two people, then since the Person 2 is eliminated first, Person 1 
is the last survivor. That is, /(1) is true. It is also easy to check J(2): If the game 
is played with 2? = 4 people, then the elimination order is 2, 
Now let m be given such that statements ID. (m — 1) have already 
been checked to be true—in particular, we know that when the game starts with 
2” people, then Person 1 is the last survivor. Now consider the game starting 
with 2" people, After one time around the circle, players in the even positions 
are eliminated, leaving only the people in the odd positions 1,3, §,7,...,2"— 1 
alive, This leaves half of the original 2” people, which amounts to 27-1 people, 
remaining, We know from statement J(m — 1) that in this game starting with 
2"! people, the person in the first position (which is Person 1) will be the last 
Survivor. Hence, Person 1 survives the game with 2" players, completing the 
induction, 




















E] Example 9. In the magic trick that opens Section 1.1, there are three ways in 
which the packet of four cards can be mixed. The packet can be cut, the top two cards 
can be turned over as one, or the entire packet can be turned over together. Recall that 
the packet starts off with one card (the spade) facing the “wrong way" from the rest. 
Prove that for all n > 0, after n of these shuffles, there will still be one card. facing the 
wrong way from the rest, 
PROOF (By induction on n) Let P(n) be the statement "After n of these shuffles, 
there will still be one card facing the wrong way from the rest." The first statement 
is P(0), which refers tothe packet before any shuffle has been performed. Certainly 
at this point, there is one card (the spade) facing the wrong way, so statement P(O) 
is true, Now let m > 1 be. given such that statements P(0), PCL P(m — 1) have. 
all been checked to be true, 

Once the packet has undergone m shuffles, we know that after the first m — 1 
of these shuffles, there is still one card turned the wrong way from the others, by 
‘statement P(m — 1). The final (m^) shuffle must be one of the three given types, 
so we consider each possibility as a separate case, 








24 More About induction 129 


Case 1: If the mi^ shuffle cuts the packet, then no cards are turned over, so the 
one card that was turned the wrong way before this last shuffle will still 
be facing the wrong way after the shut 

Case 2: If the m shuffle reverses the entire packet, then all the cards are turned 
over, so the one card that was facing the wrong way before this last shuffle 

still be turned the wrong way after the shuffle 

Case 3: If the m^ shuffle flips the top two cards as one, then things become a bit 
more complicated. If the card that is facing the wrong way before this final 
shuffle is among the top two in the packet, then the nr^ shuffle creates 
a new, single wrong-way card in the same position in the packet, On the 
other hand, if the card that is facing the wrong way before this final shuffle 
is among the bottom two in the packet, then the ^ shuffle creates a new, 
single wrong-way card in a different position among the bottom two cards 
in the packet 














In each case, after the m" shuffle, the packet still has a single wrong-way card. 
That is, statement P(m) is true, completing the induction, a 





We end this section with a result in mathematics that is so important itis called 
the fundamental theorem of arithmetic, This important idea was known to the ancient 
Greeks, so it is easy for us to take it for granted. Since its proofs a simple application 
of mathematical induction, this course is a good place to see why it is true. 








Theorem 1 (Fundamental Theorem of Arithmetic) — Every integer 
greater than 1 can be expressed as the product of a list of prime numbers. 


PROOF (By induction on n) Let P(n) be the statement “n can be written as the 
product of a list of prime numbers." It is easy to check that P(2) and P(3) are true, 
since 2 and 3 are prime numbers themselves, It might seem silly to think of 2 as 
being the product of the prime numbers in the list "2," but technically it is. Let's 
look at a few more statements before moving on to the induction step: 

















© P(4) is true since 4 is the product of the prime numbers in the list "2,2," 
© P(5) is true since 5 is the product of the prime numbers in the list "5." 
© P(6) is true since 6 is the product of the prime numbers in the list “2,3.” 





Now let m > 7 be given such that P(2), P(3), ..., P(m — 1) have all been checked 
to be true, and we are now considering statement P(m). Either the number m is 
prime or it is not 


© Case 1: If m is prime, then technically m is the product of the prime numbers 
in the list “m,” so P(m) is true in this case. 

© Case 2: If m is not prime, then by the definition of “prime,” we know that 
m= a x b for some positive integers a and b, each strictly between 1 and m, 





" The usual theorem also states that this product of prime numbers is unique—that is, a number cannot 
be written as the product, say, 5 x 7, and also as some other product using primes other than 5 and 7. 
We will not prove uniqueness here. 








CC SPEESSSES'SD OE 


130 Chapter 2 / A Primer of Mathematical Writing 


Since we already have checked statement P(a), we know that a can be written 
as the product of a list of primes, and since we already have checked statement 
P(b), we know that b can be written as the product of a list of primes. If we 


take the list of primes whose product is 
product is b, we will have a list of primes whose product 
P(m) is true in this case. 





a along with the list of primes whose 
ax b= m, Hence, 





Since P(m) is true in either case, this completes the induction step. m" 


solutions to Practice Problems 


(4i—1) is the same as 5,734711 
n 1). Since 5,,—3-74--— (in —1) — 
1) and s, =34+7 +-+- + (4(n — 1)— 1) + (4n — 1), we 
conclude that s, = sn; + (4n — 1). This along with the 
fact that s, = 3 is enough to determine all the s. 
Proofby induction onn. Consider the statement "s, = 











n (2n +1). The first statement says, "s; ‘which 
is true, Now suppose that statements “s; = 1-3" "s = 
2 + "Sint = (m= 1): Qn — 1) 4-1)" have all 


been checked for a given m 2. Then we know 

Sn = Sm.) +(4m—1), from Practice Problem 1 
m —1)- (m1) +1) + (4m=1), 
(m—1) + Qm 1) +1) 









mm41) 


xercises for Section 2.4 

. Write each of the following sequences as a recurrence 
relation (with sufficient initial values specified): 
(0 a, — 2G 3 3 e 3") foreach n> 1, 
©) b, Ede foreachn > 1. 






(ou ant + forall 
nzi 
© d, — xls ply tein tt ring forall 






The following refer to the sequences in the previous 
exercise. Use induction for each proof. 
= 1 for all positi 
$ for all positive integers n. 
for all positive integers n, 

for all positive integers n. 
Example 6 using the following out- 





e integers n. 









- Rewrite the pro 
line: 


This shows that the next statement "s, = m : (2m + 1)” 
is true, completing the induction step. L] 

3 Proofbyinductiononn. Consider the statement “F, < 
277^ We can easily check that F; <2! and F: <2", so 
the first two statements are true, Now suppose we have 
checked all the statements from *F, <2!" uptoFn.; < 
27-1," for a given m > 3. Then 








Fin = Feb En-a 
«O7 R27, since Fp- <2! and Fpa 272 
< 2142", since 273 < 20-1 
np" 
E 
This establishes that the next statement “Fy <2” is 
true, completing the induction, n 


(a) Define the number sequence [g,} with the closed 
formula gy = n° + 2n. Use algebra to show that gm, 
is the same thing as gm1 +3(n? — m+ 1), 

(b) Use the relationship from the previous part to ex- 
plain thatif g,.. is divisible by 3, then gq is divisible 
by 

(©) Using this relationship, prove by induction that g, 
is divisible by 3 for all integers n > 1, 

3. Use mathematical induction to prove each of the fol- 
F lowing: 
(a) Use induction to prove that for each integer n > 1, 
? —niseven, 

(b) Use induction to prove that for each integer n > 1, 
m — n is divisible by 3 

(©) Use induction to prove that for each integer n > 1, 
n — n is divisible by 5. 

5. Use mathematical induction to prove that for all inte- 
F gersn = 0, 107 — 1 is divisible by 9. 














6. Use mathematical induction to prove that for all in 
gers n = 2,2" — 1 is not prime, 

7. We define the Pell sequence by the initial values p, 
and p; = 2 along with the recurrence relation 


1 








Dem pea Ds 
ion on n that 
*CU' = va = py 


(This establishes that the left-hand side of the above 
equation is always a perfect square, thus solving Prob. 
lem 602 from the College Mathematics Journal, May 
1997.) 

8. The following problems refer to the Fibonacci numbers 
defined before Example 4: 

(2) Show that for all n > 2, Fy < 2°. 
(b) Show that for all n > 1, 


Prove by indu 








Fat Retin 





Fasil 


(©) Show that for all n > 1, 


Fit Fitt Fin = Fay 
(d) Show that forall n > 1, 
Fit hate m Farol 


9. The following problems refer to the Fibonacci numbers 

defined before Example 4 

(a) Prove by induction that for all = 1, Fay is divisible 
by3 

(b) Prove by induction that forall 1, 
bys. 

(©) Show that F.2 = ó^ forall n> 1, where à = 
(Hir: First check that ¢ satisfies the equation 
@ =o +1. This is the property of à that is impor- 
tant for this problem.) 

10, This problem refers to the Josephus problem from Sec- 

tion 1.1 and Example 8. 

(1) In the Josephus problem, if the game starts with 
2^ people and every second person is eliminated, 
where should Josephus's friend stand? Prove your 
answer by induction on n. 





"sa is divisible 














(b) Use the answer in Example 8 to explain where Jose- 
phus should stand in the Josephus game starting 
With n people where every second person is elimi- 
nated. 

(©) Use the answer to part (a) above to explain where 
Josephus's friend should stand if the game starts 
with n people and every second one is eliminated. 

u. 


For the magic trick that opens Section 1.1 (and revisited 
in Example 9), prove that after any n shuffles, there is 
‘one card between the "club" and the wrong-way card. 





24 More About Induction 131 


12. Prove by induction on n > 1 that the product of odd 
integers is an odd integer. 

13. (a) Prove by induction on # > 1 that the sum of n ra- 

tional numbers is a rational number. 

(b) Prove by induction on n = 1 that the average of n 
rational numbers is a rational number, 

14, The following statements prove the existence of the 
quotient and remainder in Theorem 8 of Section 2.2, 
the division theorem, 

(a) Prove by induction on # = 0 that there exist inte- 
gers q and r such that n — 3-q +r and 0 <r <2. 
(Hist: Use statement P(m — 3) in trying to prove 
statement P(m).) 

b) Prove by induction onn > Othat there exist integers 
4 and r such that n — 5 q ^ r and0 <r <4. 

(©) Let the positive integer k be given. Prove by indu 
tion on n = O that there exist integers g and r such 
that n= k-g-+rand0<q <k—1 

15. Suppose you have an unlimited supply of 3-cent and 
Scent stamps at your disposal, Show by induction that 
You can just use combinations of these stamps to make 
‘nents in postage for any n > 14, 

16, Suppose you have an unlimited supply of S-cent and 
8-cent stamps at your disposal. Find the smallest value 
Of N so that the statement "You can use just combin: 
tions of these stamps to make n cents in postage for any 
n> N” is true, and prove the statement by induction 

17. Prove that for eve 






























18. Nicole Oresme (1323-1382) used the result in the pre- 
vious problem to argue that the sum 
IUD Shai oe 


ititstat te 





can be made larger than any fid number if k is taken. 
to be large enough. Informally explain Oresme's argu- 
ment, In the language of modern calculus, this means 
that the infinite sum YX $ called the harmonic series, 
diverges. 
19. Find the error in each of the following “proofs” 
(a) Proposition All sets are infinite 
Flawed Argument. Let P(n) be the statement “Ev- 
ery set has at least n elements” We will prove by 
induction on n that P(n) is always true, from which 
the proposition will follow, 
First note that P(O) says, "Every set has at least 
O elements; which is true. Now let m > 1 be given, 
and assume we have verified that P(O), PC), 
P(m — 1) are all true. Let a set $ be given, and 














SS E?~?——— Ohl ———-« RN 


32 Chapter 2 / A Primer of Mathematical Writing 


choose an element a €S. Applying the induction 
hypothesis P(m — 1) to the set $ — {a} tells us that 
S — la] as atleast m — 1 elements. Since S has one 
more element than S — (a), it follows that S has at 
least m elements. Since S is any given set, this shows 
then that every set has at least m elements. That is, 
Pim) is true. : 

(b) Proposition Define the number sequence [a,} by 
the recurrence relation ay = asz + 2, where a; =3 
and a; = 2, For alln > 1, a, is odd. 
Flawed Argument. Let P(n) be the statement "a, 
is odd.” First note that P(1) says, "a, is odd.” which 
is true since we are given the value a; =3. Now 
assume that we have verified P(1).....P(m— 1) 
(where m 2 2), and we are now considering P(n). 
The recurrence relation tells us that a, = a... + 2 
and P(m—2) tells us that an-z is odd; hence, a. is 
‘odd. That is, P(m) is true, completing the induction 
step. 8 

(©) Proposition All horses are the same color. 
Flawed Argument. Let P(n) be the statement "Ev- 
ery corral containing n horses has all horses the 











Pini) is always true, from which the proposition will 
follow. 

First note that P(1) says, “Every corral con- 
taining one horse has all horses the same color,” 
Which is true. Now assume that we have verified 
PQ). PQ)..... Pim— 1)areall true (wherem > 2), 
and we are now considering P(m). To do this, we 
must allow the reader to pick any corral contain- 
ing m horses she wishes If we designate onc of the 
horses "Ed." we can have Ed temporarily removed, 
leaving us with a corral containing m— 1 horses 
Since we have already checked that P(m — 1) istrue, 
We know that every horse in this corral is the same 
color. Now if we return Ed and remove a differ 
ent horse, whom we will designate as “Silver,” we 
Will be led to the same conclusion. Hence, the cor- 
ral of horses without Ed has horses all of the same 
color, and the corral of horses without Silver has 
horses all of the same color. Since these two groups 
overlap, we conclude that all horses in the original 
corral of m horses are of this same color, Since this 
reasoning can be applied to any corral of m horses 
supplied by the reader, this means that P(m) is 








same color.” We will prove by induction on n that 
true. 


2.5 Contradiction and the Pigeonhole Principle 





The main emphasis in this chapter has been proving statements of the form “For all 
x in some domain of interest, if this hypothesis is satisfied, then this conclusion is 
true.” An example of this type of statement is “For all integers n, if n is even, then n? 
is even.” In the notation of Chapter 1, these statements are quantified predicates of 
the form vx € D, P(x) + Q(x). 

At this point, we have three major tools in our arsenal of proof techniques for 
Proving statements of this type: 





1- Direct proof. We imagine that the READER of the proof has selected an element of 
the domain for which the hypothesis is true, and the AuTHoR must demonstrate 
to the READER that the theorem’s conclusion must be true for that element. The 
term direct proof comes from the fact that we start by assuming the hypothesis, 
and logically move “forward” until we reach the conclusion, 

2. Proof by contrapositive. To use this technique, we form the contrapositive of the 
theorem. We then do a direct proof of the contrapositive statement. Since a the- 
orem and its contrapositive are equivalent, this also proves the original theorem. 
This is sometimes called an indirect proof because we have not directly proved 
the original theorem, but have instead obtained the proof in an indirect manner. 

3. Mathematical induction. This technique applies to theorems about the positive 
integers, or more generally to theorems about all integers greater than or equal 
to some fixed value. The heart of the proof is typically a direct proof that assumes 
P(1), PQ), .... P(m — 1) as the hypothesis, and proves the conclusion P(m). 


255 Contradiction and the Pigeonhole Principle 133 





In this section we examine a truly indirect proof technique known as proof by 
contradiction. We also study an important example of a theorem that is usually proved 
using this technique. 


Proof by Contradiction 


One way to think about direct proof and proof by contrapositive is that they both 
demonstrate, in different ways, that there cannot possibly be a counterexample to the 
theorem. Thus, we know the theorem must be true for all elements of the domain 

‘These proof techniques are based on the two properties that a potential coun- 
terexample to a given implication must possess: 


(i) It must make the hypothesis of the implication true. 
(i) Tt must make the conclusion of the implication false. 


In a direct proof, we show the Reaper that if she chooses an element satisfying. 
property (i), that same element cannot satisfy property (ii). In this way, we show that 
she cannot possibly find a counterexample. 

When we prove the contrapositive, we show the Reaper that if she chooses an 
element satisfying property (ii), that same element cannot satisfy property (i). Once 
again, the reader will be unable to find a counterexample. 

In a proof by contradiction, we allow the READER to imagine she has found 
a counterexample, and we show that this cannot be so because it would lead to a 
"contradiction" —that is, to a statement we know to be false. In so doing, we are 
essentially showing that properties (i) and (ii) are logically incompatible. 


£ T Example 1 


Explore more on 














xplain the interaction between Autor and READER when think- 





the Web, ing about the following proposition in terms of “proof by contradiction." 
Proposition Ifnis an odd integer, then n? + n is even. 
SOLUTION Remember that the Reaper sets out looking for counterexam- 


ples to the given statement. The method of “proof by contradiction” asks the 
Reaper what will happen (i.e, follow deductively) if he does happen to find a 
counterexample, So the Author might write the following: 





Dear READER, 


I appreciate your efforts to find a counterexample to the given statement, but 
did you ever stop to think what will happen if you find one? Suppose you do 
find an example (let's agree to call it n) that makes the hypothesis truc and the 
conclusion false. This means that your n is an odd integer n that makes n? + n an 
odd integer. Because of our definition of “odd,” this means that n = 2K + 1 for 
some integer K and n? LL +1 for some integer L. Since your example 
would create these two equations, then substituting the first equation into the 
second 

















1° +(2K +1) 
+6K+2 





CEI — —— Sn 


|. Chapter 2 / A Primer of Mathematical Writing 


will give us the equation 2L + 1 = 4K? + 6K +2. This might not seem so bad 
as written, but a little algebra will turn this into the equation 


ESIR 
Since L and K are integers, we know that L — 2K? — 3K is an integer, but we 
know that} is definitely nor an integer. This is clearly nonsense! However iti a 
logically valid deduction if the example you are looking for really exists I think 
We will both sleep better at night knowing that there is no counterexample to 
the given statement, 





Only concerned with your well-being, 
Autor a 
Tt is useful to examine how each type of proof would begin, and how it would 
Sh. Table 2-15 shows the outline of three different forms of proof for the proposition 
Eris an odd integer, then n? +n is even,” For a direct proof, we start by assuming 
the hypothesis is true, and eventually show that the conclusion must be true. For 
T contrapositive proof, we use the same approach for the contrapositive statement 
“It n? + n is odd integer, then n is even." For a proof by contradiction, we asume 
that there is a counterexample, and we try to show how a false statement follows 
from this assumption. In both the direct and contrapositive proofs, the Aurion has 
Well defined starting and ending points, With a proof by contradiction, by contrast, 
the Aurnor has more information to start with, but no particular goal toward which 
1o work, This is a typical trade-off when developing a proof by contradiction. You 
may assume two things (the hypothesis is true and the conclusion is false), but in 
exchange you lose the advantage of having a well-established goal 




















Practice Problem 1 For each statement, rewrite the statement in "if then" form, 
if necessary. Then create a table similar to that given above, showing how to sian andi 
end a direct proof, a contrapositive proof, and a contradiction proof for the statement 








(a) If 
(b) Even perfect squares are always divisible by 4, 


is odd, then n is odd. 











Direct Contrapositive 
Let n be an odd integer, Let n be given such that nè + n is odd, 
We conclude that 1? + n must be even. We conclude that n is even, 
Contradiction 





Suppose there is an integer n such that n is odd and nè + n is odd, 


We infer a false statement, 
This is a contradiction, so we know there cannot be a counterexample. 





Table 2-15 The Structure of Proofs 


255 Contadicion and the Pigeonhole Principle 135 


For the statement proved in Example 1, it turns out that a direct proof is very 
short and simple, (Refer to Exercise (10b) in Section 2.1.) You might wonder why 
anyone would prefer the proof by contradiction, Itis not really a matter of preference. 
but more a matter of the thought processes by which one discovers the proof 

Itis usually a good idea to try to find a direct proof, or a proof of the contraposi- 
tive statement, before resorting to proof by contradiction, However, there are some 
theorems for which proof by contradiction may be considered as the best alternative, 
One situation occurs when both the statement and its contrapositive have “ne 
conclusions, In general, it can be difficult to prove that something does not happen, 
Here is an example of a proposition that we have nor proved before, 








tal Example 2 Why might you try a proof by contradiction for the proposition 

‘If n has the form AK + 3 for some integer K, then n? — n is not divisible by 4"? 
SOLUTION Table2-16shows the starting and ending points for a direct proof 
and a proof of the contrapositive, In both direct and contrapositive proofs, the 
goal contains the word nor. That does not mean we cannot do such a proof, but 
it does suggest considering a proof by contradiction first, 





Now let's see the proof discovery process as well as the formal version of a 
“proof by contradiction” for this proposition, 


Proposition 1 — Ifn has the form 4K +3 for some integer K, then n? — n is not 
divisible by 4. 


PROOF (Informal version) 


Dear Reaper, 


In your search for a counterexample to the above statement, suppose you 
actually find one, Let's agree to call it 1 and talk about what properties n must 
have, Since n is à counterexample to the given statement, we agree that n 
makes the hypothesis true and the conclusion false, In other words, n = 4K +3 
for some integer K, and n? — n = 4L for some integer L, Substituting the first 
equation into the second, we see that 





4Lan—n 
= (4K +3) — (4K +3) 
= 16K? +20K +6 














Direct  Contrapositive 

~ Letn bean integerwith | Letn be given such that 
the form 4K +3. nè — nis divisible by 4. 

We conclude that n? —7is | We conclude that n does not 
not divisible by 4 have the form 4K 4-3. 








Table 2-16 Solution to Example 2 


EEE S: ~~ 


36. Chapter 2 / A Primer of Mathematical Writing 


So your value of n creates the equation 16K? + 20K +6 = 4L, which can be 
rewritten as 





The trouble with this is that L — 4K? — SK isan integer but 3 is not. So you see 
that if you do find a counterexample, it will lead (through si imple algebra) to 
an equation showing that 3 is an integer, Since this is nonsense, it must be the 
case that no counterexample to the original statement exists, 








Always failing to never be your friend, 
AUTHOR " 





PROOF (Formal version) 

Suppose that a counterexample nto the above statement exists. This means 
that n has the form 4K + 3, and n? — nis divisible by 4. In other words,n = 4K +3, 
and n° — n = 4L for some integers K and L, Combining these two facts leads us 
to say 








The equation 4L = 16K? + 20K + 6 can be rearranged to say 


á 3 
L—4K?—5K == 


from which it follows that 3 is an integer, which is a contradiction, Therefore, no 
counterexample to the original statement exists, n 


Another situation that suggests trying proof by contradiction occurs when the 
original “if, then” statement and its contrapositive seem equally difficult to prove. 
For example, the statement “If x > 0, then £ > 0” has contrapositive “If + < 0, then 
x <0," which is essentially the same. 





Proposition 2 For every real number x, ifx > 0, then 1 0. 


PROOF Suppose there is a counterexample to this statement, That is, a real 
number x exists such that x > 0 and 1 < 0. Since x > 0, mul iplying by x on both 
sides” of the inequ: 











1 
PETI 
x 
gives us the inequality 
1 
Srs0-x or 1<0 
Certainly, “1 < 0" is nonsense, so we have arrived at a contradiction. Therefore, 
no counterexample exists for the original statement. a 


7 We are of course, assuming that the Reaper acknowledges this is a valid rule for inequalities 


2.5 Contradiction and the Pigeonhole Principle 137 


If you ask yourself, “How did the Aurior of the proof know what to do to get 
a contradiction?" you have uncovered the difficulty with a proof by contradiction— 
one never knows in advance what the contradiction will look like. This is one reason 
we suggest first trying to find a direct proof or a contrapositive proof—at least then 
you have a definite goal to work toward. 











Existence and Nonexistence Proofs 


Most of our theorems to this point have been implications, characterized as “For all 
x in some domain D, if P(x), then Q (x). We have emphasized this type of theorem 
because it is such a frequently encountered form. However, some mathematical 
xistence or nonexistence of some mathematical object with 
a particular property. An existence proof is a proof of a theorem characterized as 

There exists x in some domain D such that P(x).” Likewise, the proof of a theorem 
characterized as “There does not exist an x in some domain D such that P(x)" 
called a nonexistence proof. 











ia 

4 Example 3 Determine which of the following statements are true. Give a con- 
vincing argument for your answer: 

1. There exists a positive real number satisfying the equation r* — r* P —r +r — 
120 

There exists a positive integer s satisfying the equation 65? — Ss 4-1 — 0. 








3. There exists a positive rational number t satisfying the equation ? — 2 = 


4. There exists a smallest positive integer. 


SOLUTION Only statements #1 and #4 are true. 


1, Thisisa true statement. If we test the value = 1 in the equation, we find that 
18-14 +181? +1 — 1 =0, so this shows that the positive real number 
r = L satisfies the equation. 





This is false because the quadratic formula tells us that the only real solutions 
to the equation 65? — 5s +1 =O are } and $, neither of which is an integer 
value. Hence, there are no integers with the desired property. 

3. This is a false statement, We will address the formal justification for this in 
Theorem 4. 





4. This is a true statement since we know that the number 1 is the smallest 


Positive integer. 





o 


To prove the true statements above, in each case, we simply supplied an example 
ofa number with the desired property. This type of proof is called a constructive proof 
of an “existence statement,” and it is very hard to take issue with, even for the most 
critical READER. 

Existence proofs are actually not new to you. We have seen one major theorem 
whose proof is an existence proof, the division theorem (Theorem 8 of Section 2.2). 
The proof of the existence of a quotient and remainder with the desired properties was 
addressed in Exercise 14 of Section 2.4, In addition, you have done small existence 











mE Sr 


38 Chapter 2 / A Primer of Mathematical Writing 


proofs within almost every proof in the first two sections of the c apter. For example, 
consider this proof from Section 2.2. 


Proposition 3 ifie integers m and n are both divisible by 3, then the number 
m+ n ìs also divisible by 3. 
PROOF Let m and n be integers, each divisible by 3. This means that there are 
integers K and L such that m = 3K andn = 3L. Hence m+n 3K y a 3(K + 
L). Since K + L is an integer, this shows that m+n is divisible by3. " 


To show that m + n is divisible by 3, we had to produce an integer Q such that 
detoiio dO. The last step of the proof shows that the integer Q = K c L worka The 
ons of even, odd, divisibility, and rational all involve the existence of integers 
With certain properties. Asa result proofs where the conclusion invelves any of these 
concepts usually contain a constructive existence proof. 
Noncxistence proofs, on the other hand, can be more. challenging, and may best 
be handled using proof by contradiction. Consider the following classic result: 








Theorem 4 The real number V2 is irrational. That is, there does not exist a 
rational number r such that r? — 2. 


‘The proof, given below, is a quintessential example of a nonexistence proof. This 
Tn eHary Of proof is another that is frequently attacked using proof by 'ontradiction. 
In a direct proof, the "goal" would be to show th something does nor exist, and 
itis not clear how one could reach such a goal. We would need to shox that “for 
every rational number r, r? 2." We can find rational numbers whose square is very 
Close to 2—for example, (355539 = 1,999990233, How. can we show directly that r? 
Will never be exactly 2? Rather than try, we will show that a counterexample to the 
theorem would lead to contradiction, The contradiction will center ne the following 
idea that we first saw in the definition of a rational number. 











Definition two integers are said to be relatively prime if they have no 
common divisor greater than 














TROOF (Theorem 4) Suppose to the contrary that this proposition is not true, 
‘That is there does exist a rational number r such that r? = 2. Since stay rational 
number, we know that there exist integers a and b such that r $- Moreover, we 
know that these two integers can be chosen to be relatively prime, 

incer? = 2, we have ($)? = 2. Usingalgebra, we rewrite this asa’ 
From this ensues the following chain of reasoning: 





2p. 





© Since a? = 267, we know that a? is even, which means that a is even.” 


© Since a is even, we know that a —2K for some integer K. This means a? 
QKY = 4x2, 








7 In Theorem 7 of Section 2.1, we proved the proposition “If nè seven then nis even.” 





25 Contradiction and the Pigeonhole Principle 139 





9 Since a® = 25^ anda? = 4K 
P^, and hence b, is even 





follows u 2K?. From this, we know that 





® Since a and b are both even, then they have 2 as a common divisor. 


© Remember that a and b are relatively prime. 


The final two statements are contradictory, so the assumption that the proposition 
is false must be wrong. We conclude that the proposition is true. " 








The same argument structure can be used to prove that other important numbers 
are not rational. For example, in art and architecture, the golden ratio is considered to 
| represent the ideal proportion of height to width for a rectangle. Formally, a rectangle 
isin this proportion ifthe removal of an appropriate square leaves a smaller rectangle 
SP that is similar to (ie. in the same proportions as) the original rectangle. As Figure 
2-4 illustrates, the implication is that the ratio of 1 tor is the same as the ratio of r — 1 
Figure 2-4 A to 1, That is, | = ^55, It can be shown that the positive real number satisfying this 
golden rectangle equation is approximately 1.62, but we can show that the very relationship defining 
this value of r can be used to prove that r is irrational. 














Proposition § There does not exist a rational number r satisfying &! = } 


ry that there is such a rational number r. That 
legers Now 1 = | 


PROOF Suppose to the cont 
means w 








nwriter = 1, wherea and bare relatively prime i 1 
can be rewritten as £ — 1 = 4, which implies that a? — ab = b°, or a? — b? = ab, 
This implies that (a — 5)(a + b) = ab. 

Since a and b are relatively prime, they cannot both be even. Thus, one of the 
following must be true: (1) a is even and b is odd; or (2) b is even and a is odd; or 
(3) both are odd. We can proceed by cases 








(© Case 1: Ifa is even and b is odd, then (a — b) and (a + b) are odd, from which it 
follows that (a — 6)(a + b) is odd. However, since a is even, we know ab is even. 
Weconclude that (a — bY(a + b) # ab, contradicting our earlier calculation that 
(a — bya + b) = ab. 

© Case 2: A similar contradiction arises if a is odd and b is even. 

9. Case 3: Ifa and bare both odd, then ab is odd, but (a — b) and (a + b) are both 
even, which means that (a — b)(a + b) is even. We conclude that (a — b)(a + 
b) # ab, contradicting our earlier calculation that (a — b)(a + b) = ab. 














In every case, a contradiction arises. Hen. 





our original assumption that £ 
1 is satisfied by a rational number r must be incorrect. 





Such a rectangle is naturally called a golden rectangle. 
In these cases, we have omitted some details such as the fact that the product of two odd numbers is 
always odd. These details are of the type you learned about in Section 2.1 so we wi 
proof with them. 





EE mE _ O 


AO Chapter 2 / A Primer of Mathematica witing 


More Classic Proofs by Contradiction 


In Book IX of Euclid's Elements, a series of proofs about prime numbers is given.” 
In Greek mathematics all numbers represent geometric measurements so iti fairly 
ranrresting that the notion of “prime number” was considered important at all. The 
following classic example. ‘of using “proof by contradiction" | is Proposition 20 in Book 
IX of the Elements. 


Theorem 6 There are an infinite number of prime numbers. 


PROOF Suppose to the contrary that there are only a finite number of prime 
Eumbers Then we can form the number x by multiplying all the primes together, 
Even though x is a large number, it will be evenly divisible by every prime num- 


the assumption that there are only finitely many prime numbers. Hence, that as- 
Sumption must be false, which means that there are, in fact, infinitely many prime 
numbers. = 


There is a specific kind of proof by contradiction that is related to ‘mathematical 


the principle of mathematical induction is to state that any nonempty set of positive 
oras Will always have a smallest number in the set, This fact is called the wall 
ordering principle of the set of positive integers, and it is used in the following proof 
of a property that is fundamental to the study of number theory. 








Theorem 7 For integers a and b, define the set Sa, to be the set of all integers 
of the form au + bv, where u, v € Z. (For example, Ses includes numbers like 20 and 
f since 20 —2-6-1-8and4— 2.62.8) Ifc i the smallest positive integer in 
Sa» then every number in S, y is a multiple of c. 





QROOF. (By contradiction) Let integers a and b be given, and let c = au 4 bo 
(for ie € 2) be the smallest positive integer in the set 5,,. Assume iba tee 
theorem is not true, and let d be the smallest positive counterexample to the 
di not di Talis d = ax + by for some x, y € Z, but dis not divisible by c Since 
is not divisible by c, we know from the division theorem that d q:cr for 
1 <r < c. But in this case, 





r=d-q-c 
= (ax + by) — q(au + bv) 
= a(x —qu)+ Oy — qv) 


` Ronald Calinger's Classics of Mathematics is a very worthwhile book fall of original sources from 


Finnoty of math including translations ofthese particular propositions from Beak DX aruit 
Elements 








255 Contradiction and the Pigeonhole Principle 141 


This means that r € Se», but r is a positive number less than c, so this is a contra- 
diction to the fact that c is the smallest positive number in S, 5. a 





How Not to Use Proof by Contradiction 


Some instructors complain, “Once I show my students proof by contradiction, they 
‘want to use it for every proof they do!” Some of these instructors have even taken the 
extreme point of view that one should never use contradiction, since there is surely 
a direct or contrapositive proof if you only look hard enough. 

Why is this a complaint? Isn't proof by contradiction a valid form of proof? The 
answer is “Yes, certainly!” However, there are some situations where a relatively 
small rewrite yields a more elegant and straightforward proof. This process sometimes 
disguises the hard work that was done in discovering a proof, but it is still worthwhile 
if brevity and logical clarity are among your proof-writing goals. 

How can a student just learning about proof-writing possibly avoid this com- 
plaint? First, you can always try first to find a direct or contrapositive proof. Second, 
when you do use contradiction, you can recognize two dead giveaways that a rewrite 
is possible and not difficult. Both situations apply to proving an “if, then” propo- 
sition. To help you understand the discussion, Table 2-17 summarizes how each of 
the three types of proof begins and ends. Now suppose you write a perfectly correct 
proof by contradiction, but in your proof, the contradiction you reach is “The con- 
clusion is true, and that contradicts our assumption that the conclusion is false.” In 
this case, 























© The first step of your proof includes the assumption that the “hypothesis is tru 
and 


© The last step of your proof is the inference that the “conclusion is true.” 





Ifyou look closely, buried in your proof you will find a direct proof of the proposition, 
since this is exactly how a direct proof begins and ends. 

On the other hand, suppose you write a perfectly correct proof by contradiction, 
but in your proof, the contradiction you reach is "The hypothesis is false, and that 
contradicts our assumption that the hypothesis is true." In this case, 





© The first step of your proof includes the assumption that the “conclusion is false"; 
and 


9 The last step of your proof is the inference that the “hypothesis is false.” 


Direct Contrapositive Contradiction 


Start | Hypothesisis true. | Conclusion is false. | Hypothesis is truc and 
Conclusion is false, 














Goal | Conclusion is true. | “Hypothesis is false. | Some contradiction 





Table 2-17 Summary of Proof Techniques 


LLL a O 


42. Chapter 2 / A Primer of Mathematical Writing 


If you look closely, buried in your proof you will find a contrapositive proof of the 
proposition, since this is how a contrapositive proof begins and ends. 

Tt will help to sec examples of each of these types of overuse of the contradiction 
technique. Notice how much shorter and direct the rewrite is in each case. 


a Example 4 The following is written as “proof by contradiction.” Rewrite itas 
a direct proof or contrapositive proof instead. 


Proposition — Ifnis an odd integer, then 3n is odd. 
PROOF Suppose you have found a counterexample to this statement. Let's agree 
to call it m. Since m is a counterexample, we must have m odd while 3m is even. 
That is m=2K +1 for some integer K, which implies that 3m = 3(2K +1) 
2GK +1) +1. Since 3K +1 is an integer, this means that 3m is odd. Hence, a 
counterexample m must make 3m even and 3m odd at the same time, which clearly 
cannot happen. Therefore, no counterexample to the original statement exists W 


SOLUTION Since the contradiction is that 3m is odd, which is the same as 
the conclusion of the original proposition, we suspect that we can write a direct. 
proof. 

Proof. Let an odd integer n be given. This means n = 2k-4-1 for some 
integer K, which implies that 3n = 3(2K + 1) = 2(3K + 1) +1. Since 3K + 1 is 
an integer, this means that 3n is odd. a 


Practice Problem 2 The, following is written as “proof by contradiction." Rewrite 

itas a direct or contrapositive proof instead. 

Proposition If n? is not divisible by 4, then n is odd. 
PROOF Suppose you have found a counterexample to this statement. Let's agree 
to call itm. Since it is a counterexample, it must make the hypothesis true and the 
conclusion false. That is, mè is not divisible by 4 and m — 2K for some integer K. 
The second equation implies 





Since K? is an integer, this implies that m? is divisible by 4. But m was originally 
chosen so that n? is not divisible by 4, so we have a contradiction. From this, we 
conclude that the original assumption there is a counterexample must be wrong. 
Hence, the original statement is true. Li 


The Pigeonhole Principle 


So far we have scen some classic results traditionally proved using the "proof by 
contradiction" technique. The next example is an instance of a more general result 
that can be proved and used in many applications It iscalled the pigeonhole principle’, 
and it will be the focus of the remainder of this section, 


7 The original version involves placing eters into the pigeonholes ofa desk. Although the analogy is out 
of date, the name persists, 


25 Contradiction and the Pigeonhole Principle 143 


ll Example 5 Prove thar if 29 tennis balls are given out to four players, then (at 

least) one player gets eight or more of the balls. 
SOLUTION (Proof by contradiction) Suppose to the contrary that this state- 
ment is false. This means that we can give 29 balls to four players, and each of 
the four players gets seven or fewer balls. Use the variables a, b, c, and d to 
represent the number of balls given to the four players. Since each player gets 
seven or fewer balls, we have a < 7,b < 7,c < 7.d < 7. Since there are 29 balls, 
we also have 








bict+d=29 








<7, and d <7 together imply that 


abu s 





+d<7474747= 





Since it is impossible that a +b+c+d=29 anda +b+c+d < 28, we have 
reached a contradiction. This establishes the claim. a 


‘The generalization of this simple fact is one version of the pigeonhole principle, 


Theorem 8  (Pigeonhole Principle, General Version) If m-n+1 objects are 
distributed among n different boxes, then there must be some box containing at least 
m+ L objects. 


PROOF (By contradiction) Suppose it is possible to distribute m n +1 objects 
into n different boxes, such that each box contains m or fewer objects. Label the 








n boxes 1, =n. After the objects have been distributed among the boxes, 
define xi, xz, . .. x, by the following rule: 
xj = the number of objects in box i 


Since each object can go into only one box, we know that 














x» 54 x =m-ntl 
On the other hand, we have x; < m, x2 < m, xs < m, ....x, < m. However, this 
implies that 

XxpEXRecRx, md ma mem 

and thus. 

B+R eR, men 
The statements xj +12 +13 +-+- X,  m-n-- Land xy +37 -xs dex < 
m-n form a contradiction, so we conclude that no counterexample can exist, 


completing the proot. " 


In light of our recent discussion, you might wonder whether “proof by con- 
tradiction” is necessary for this statement. Exercise 38 asks you to show that it is 
not. 





SSS ERE 


144 Chapter 2 / A Primer of Mathematical Writing 


Box number | 0 





One digit. llo | Loro | 2or8 





or7 | 4or6 
Table 2-18 Boxes for Example 6 


Practice Problem 3 (Basic pigeonhole principle) Use the previous proof as a 
guide to prove the following statement: "Ifn + 1 objects are distributed among n boxes, 
Then some box must contain more than one object.” 


This “basic” version of the pigeonhole principle is the form most commonly 
encountered in a discrete math course. Applying it is simply a matter of defining a 
rule for placing objects into boxes so that the conclusion "some box contains more 
than one object” matches the desired conclusion, Here is a typical example. 


u Example 6. Show that among any seven positive integers, there are two whose 
sum or difference is divisible by 10. 
SOLUTION We define six “boxes” so that each integer is placed in one box 
ased on that number's ones’ digit, as shown in Table 2-18. Now given any 
seven integers, when placed among these six boxes, some box will contain 
least two integers by the basic pigeonhole principle. If these two integers have 
the same last digit, then their difference will be divisible by 10. Otherwise, by 
the definition of the boxes, the two numbers will have a sum that is divisible by 














10. o 
Pd wm Example 7. Here are two different choices for the seven integers referenced 
the Web. in Example 6. Trace through the proof for the example by showing the placement of 


these integers into boxes and discussing the result. 


1. The numbers chosen are (13, 15, 28, 30, 1, 46, 58), 
2. The numbers chosen are (27, 86, 50, 35, 11, 44, 108], 


SOLUTION 


1. The numbers are placed into boxes as shown in Figure 2-5. In this case, there 
are two numbers in the third box. These two numbers have the same ones’ 
digit; hence, their difference, 58 — 28, is divisible by 10. 

2. They are placed into boxes as shown in Figure 2-6. In this case, there are two 
numbers in the fifth box. These two numbers do not have the same ones’ 

















30 1| pase [o] [as] [is 





























Lastdgt 0 1019 208 307 4x8 5 
Figure 2-5 Illustration of boxes for Example 6. 


25 Contradiction and the Pigeonhole Principle 145 

















m 
«] [»] [e] [2] [94 [E 
| | 4 | 
GAT tae Gub awn ae B 


Figure 2-6 Illustration of boxes for Example 6. 


digit; hence (because of the way the boxes are defined), their sum, 86 4- 44 
is divisible by 10. o 


The fascination that mathematicians have with the pigeonhole principle can be 
attributed to its relative simplicity and its utility in proving things in a variety of 
mathematical areas. 


Proposition 9 (Number theory) Given any four positive integers, some pair of 
them will have a difference divisible by 3. 


PROOF Let four positive integers be given. Place these numbers into boxes 
labeled 0, 1,2 according to the rule: A number x goes into the box labeled i if i 
is the remainder when x is divided by 3. By the basic pigeonhole principle, some 
box (let's call its label d) contains at least two numbers, Let's call these numbers 
a and b. Since a and bare in the box labeled d, then (by the division theorem) this 
means that a = 3& +d for some integer K and b = 3L + d for some integer L. In 
this case 











a—b=(3K +d)~(3L +d) 
=3-(K-L) 
Since K — Lisaninteger, this means that the difference between a and his divisible 
by3. " 
db 
a Practice Problem 4 Prove that for any eleven positive integers, some pair of them 
the Web, will have a difference divisible by 10. 


Proposition 10 (Geometry) Given any five points placed in a unit square, there 
‘must be two that are within 32 of each other. 


PROOF Divide the square into four equal quadrants as shown by the dashed 
lines in Figure 2-7. 

By the basic pigeonhole principle (thinking of the small squares as “boxes"), 
we know that no matter how the points are distributed, some small square will 
contain at least two points. Since the diagonal of each small square has length $ 
4 these two points must be within this distance of each other. È 

















------4 GBM Example 8 Figure 2-8 illustrates rwo different choices of points within the unit 
square for Proposition 10. Discuss what the proof of the proposition tells you in each 
case 











SOLUTION In the first choice, points C and D are those produced by the 
proof. In the second, the points are A and E. Note that the proof does not 


Rowe 2:7 Four necessarily produce the two points closest together. o 
quadrants of a square. 











SSE SSPESS'!&S_CSCiés‘éi<— E ERE 


146 Chapter 2 / A Primer of Mathematical Writing 











w 
E 











Figure 2-8 Two possible choices for five points. 


jolutions to Practice Problems 


(a) In Table 2-19, we give the starting and ending point 
for each type of proof of the statement “If n? is odd, 
then n is odd.” 

(b) We rewrite the statement as “If n is even and nis a 
perfect square, then n is divisible by 4.” Table 2.20 
on page 147 shows the starting and ending points for 
a proof of this statement. 

Since the contradiction is that rr is divisible by 4, the 

negation of the proposition’s hypothesis we try to find a 

Proof by contrapositive. The contrapositive of the origi- 

nalstatement is "lf nis even, then nis divisible by 4." To 

Prove this let an even integer be given. This means tha 

71 = 2k for some integer k, which implies that n? = 442 

Since K? is an integer, we conclude that n? is divisible 

by4. 

Proof by contradiction. Suppose there isa counterexam- 

ple. Then n + 1 objects can be distributed into n boxes, 

With each box getting no more than one object. Label 

the n boxes 1,2,3,....n. After the objects have been 

distributed among the boxes, define xi. x... x, by the 














but since each x, is no more than 1, we have 


LESER REDE ++ 


X REG Rx sn 


This is a contradiction, so there is no counterexampk 
and the theorem is true. Li 
Proof Let 11 positive integers be given. Place these 
numbers into boxes labeled 0, 1, 2, 3, .... 8, 9 according 
to the following rule: A number x goes into the box la- 
beled i ifi is the remainder when x is divided by 10 (ic. 
i is the ones’ digit of x). By the basic pigeonhole prin- 
ciple, some box (let's call its label d) contains at least 
two numbers. Let's call these numbers a and b, Since 
a and b are in the box labeled d, then (by the division 
theorem) this means that a = 10K + d for some integer 
K and 6 = 10L + d for some integer L. In this case, 


a-b- (10K +4)~(10L +d) 
=10-(K-L) 




















following rule Since K — L isan integer, this means that the difference 
S; = the mumber of objects in box between a and b is divisible by 10. n 
Since each object can go into only one box, we know 
that 
vw 
Direct | Contrapositive | Contradiction 
Start] mèisodd. | niseven. | n?isodd, and 
nis even. 
Goal! nisodd. | nžiseven. | Some contradiction, 








Table 2-19 The Structure of Proof 1 in Practice 
Problem 1 


rcises for Section 2.5 


Fill in the details to complete the following proof: 
Proposition If an integer n has the form 3K + 1, then 
it does not have the form 9L +5. 








Proof Suppose a counterexample to this statement 
does exist. Let's agree to call it m. Since m is a coun- 
terexample, it must make the hypothesis of the origi- 
nal statement true while making the conclusion false, 


. Use “proof by contradictio: 


25 Contradiction and the Pigeonhole Principle 147 

















Direct Contrapositive Contradiction 

Stat | niseven and n is not divisible by 4. niseven and 
nis a perfect square. nisa perfect square, and 
n is not divisible by 4. 
Goal | nis divisible by 4. nis odd or Some contradiction, 
nis not a perfect square, 
Table 2-20 The Structure of Proof 2 in Practice Problem 1 
That is, m for some integer K and values q and r in the division theorem, Theorem 8 of 


m Tor some integer L, Combining these 
‘equations gives us 


from which it 
is an intege 


follows that K-3L=4. Since 
this is nonsense. Therefor 








. 
Fill in the details to complete the following proof: 





Proof. Suppose there is a counterexample to this 
statement. That is, real numbers x and y exist such that 
and H follows from this that 


rty< 
=r+y 


Therefore, these x and y would have to satisfy x + y < 
x + y. Clearly, no numbers can do this, so we have a 
contradiction. Hence, there is no counterexample to the 
original statement, . 


. Earlier in this chapter we proved the following using 


the contrapositive statements. Prove them now using 
“proof by contradiction 

(a) If? is even, then mis even. 

(b) If n? is odd, then n is odd. 

Use "proof by contradiction" to prove that an even per- 
fect square cannot have the form 4-2. (Hinr: Use 
the previous exercise.) 

Use the previous two exercises to prove that the sum of. 
two odd perfect squares is not a perfect square. (This 
shows that no right triangle with integer sides can have 
both legs of odd length.) 

Use “proof by contradiction" to explain why no integer 
can be both odd and even. 








to explain why it is im- 
possible for a number n to be of the form SK +3 and 
Of the form SL. + 1 for integers K and L. 

Using the previous two proofs as models, develop an 
alternative approach to proving the uniqueness of the 


Section 22 
9. Using the proof of Proposition 5 as a model, show that 
there are no relatively prime integers a and b satisfy- 

ing a? = 26. (This gives an alternate proof that V2 is 

irrational.) 

Suppose we have already established the “sign rules 

for multiplication of nonzero real numbers: (i) the 

product of a positive number and a positive number 

s positive; (ii) the product of a negative number and 

a positive number is negative: and (ii) the product 

ofa negative number and a negative number is pos- 

itive. Use the appropriate one(s) and “proof by con- 

tradietion” to prove the corresponding rules for divi 

sion. (Interpret x-+ y = z to mean the same thing as 

yxier) 

(a) A positive number divided by a positive number is 
a positive number. 

(©) A negative number divided by a positive number is 
a negative number. 

(6) A negative number divided by a negative number 
is a positive number. 

1l Use “proof by contradiction” to prove that 
a rational number, then a + v is irrational. 








10, 














f a is 
(Hinr 








You will need the result in Exercise 9 from Section 
22) 
12. Use “proof by contradiction" to prove that if a is a ra- 


tional number and bis an irrational number, then a + b 

irrational. 

Carefully write the contrapositive of the statement in 

the previous exercise, and then prove it without using 

"proof by contradiction." 

Prove each of the following existence statements 

(2) There exists a positive rational number r such that 
67 lir 35 

(b) There exists a positive integer n such that 
0.001. 

4€) There existsa positive integer such that mod 7 
lands mod 3 

15, Prove that Vis irrational. 
Section 22.) 





nu. 

















(Hir: Use Exercise 19 of 








ES DEMER x4 


148 Chapter 2 / A Primer of Mathematical Writing 


16. Does the fact that VZ is irrational have to be proved by 
contradiction? Suppose that your READER has agreed 
with the truth of Propositions 1 and 2 below. How would 
you prove Proposition 3 to her without using "proof by 
contradiction?” 

(© Proposition 1: r is rational if and only if there exist 
relatively prime integers a and b such that r = a/b. 

© Proposition 2: If (a/b)? = 2, then a and b have 2 as 
common divisor, 

© Proposition 3: If? =2, then r is not rational. 

17. Prove that any positive real number satisfying r — 
5 must be irrational 

18. Not every nonexistence proof is best handled us- 
ing proof by contradiction. Consider the proposition 
“There does not exist a largest positive integer.” 

(a) The proposition is the negation of “There exists an 
integer n such that nis the largest integer.” We know 
that the negation of a “there exists” statement can 
be written asa forall” statement. Write the propo- 
sition in this form. 

b) Prove the proposition you wrote in part (a), 

19. The following proof is written using “proof by contra- 
diction.” Rewrite the proof so that “proof by contradic- 
tion" is not used. 

Proposition For all integers. if Sn + 4is odd, then n 

is odd. 

Proof. Suppose there is a counterexample to this 

statement. Let's agree to call it n. This number must 

make the hypothesis true (i.e, 5n +4 is odd) and the 
conclusion false (ie. n is even). Since n is even, then 
n= 2K for some integer K, and since Sn +4 is odd, 
then Sn-+4=2L +1 for some integer L. Combining 








these equations gives us 
W+1=sn+4 

= 50K) +4 

=10K+4 


The final equation 2. + 1 = 10K + 4 can be rewritten 
as L — SK = 3/2. Since L — SK must be an integer, this 
is a contradiction. Therefore, no counterexample can 
exist. E 
20. The following proof is written using “proof by contra- 
diction." Rewrite the proof so that “proof by contradic- 
tion” is not used. 
Proposition For all real numbers x and y, if 1^ + y? # 0, 
then x #0 or y 40. 
Proof Suppose there is a counterexample to this 
statement. That is, real numbers x and y exist such that 
2° y! # Dandit is not true that “x 4 Oor y # 0." This 
is the same as saying that x° + y? 20 and x =0 and 
Y. 0. Ofcourse, x = 0and y = Oimplies thatx? + y? = 
© +0? = 0. Therefore, these x and y would have to sat- 














lr 


isfy 2+ y! #0 on the one hand and x? + y°=0 on 
the other hand. Clearly, no numbers can do both, so we 
havea contradiction. Therefore, no counterexample ex- 
ists for the original statement. E 

21. Refer to Theorem 7. Prove that for all nonzero integers 
a and b, the smallest positive integer c in the set, is 
the greatest common divisor of a and b. (Hir: First 
show c divides both a and P, and then show that every 
common divisor of a and b must also divide c.) 

22. Prove the following statement, using either the contra- 
Positive statement or a proof by contradiction: “If the 
average age of four children is 10 years old, then (at 
least) one child is at least 10 years old." 

23. Prove the following statement, using either the contra- 
Positive statement or a proof by contradiction: ^f the 
average net weight of seven boxes of cerealis 17 ounces, 
then at least one of the boxes has a net weight of atleast 
17 ounces” 

24. Prove the following statement, using either the contra- 
Positive statement or a proof by contradiction: “If 10 
Teal numbers have an average of 89.63, and one of the 
‘numbers is less than 89.63, then one of the numbers 
must be greater than 89,63; 

2S. The previous three exercises are related to a slightly 
different form of the pigeonhole principle, often called 
the average version of the pigeonhole principle; “If zis 
the average of the collection of numbers 


E 


then at least one number in the list is at least =." Prove 
this statement, using either the contrapositive statement 
ora proof by contradiction. 

26. Use the average version of the pigeonhole principle 
from Exercise 25 to directly prove (a) the basic ve 
of the pigeonhole principle and (b) the general version 
of the pigeonhole principle. 

27. Prove the following statement, using either the contra- 
Positive statement or a proof by contradiction: “If inte- 
gers x, y and z satisfy x +y+z>11, then either x > 4, 
yzdorzz57 

28. Prove the following, using either the contrapositive 
statement ora proof by contradiction: "Ina collection of. 
mumbers 31.32... Xas if im dde Hy > 
ZA), then there is a value of i for which x, > i.” 

(a) By emulating the proof of Theorem 8. 
(b) By proving the contrapositive statement. 

29. Prove the statement in Example 5 by considering the 
contrapositive statement and not using “proof by con- 

tradition.” 

Prove the basic pigeonhole principle (Practice Problem 

3) by considering the contrapositive statement (and not 

using “proof by contradiction"). 














25 Contradiction and the Pigeonhole Principle 149 


31. Garrison Keillor signs off his news from Lake Wobe- 35. Use the pigeonhole principle to prove each of the fol- 


child is above average.” 
Explain to him 


gon by saying, “..,where ever 

Your little brother says, “I don't 

why this is funny. 

Use the pigeonhole principle to prove each of the fol- 

lowing statements about numbers: 

(a) Given anysevenintegers, there will be two that have 
a difference divisible by 6. 

(b) Given any five integers, there will be two that have 
a sum or difference divisible by 7. 

$. Fill in the details to complete the proof below: 

Proposition Given any five integers, there will be 
three for which the sum of the squares of those inte- 
gers is divisible by 3. 

Proof Let five integers be given. Think of two boxes, 
one labeled “divisible by 3" and one labeled “not di- 
visible by 3." By the distribution version of the pigeon- 
hole principle (with and m = 
‘we conclude that there are at least three of the numbers 
in one box. Let's refer to these three numbers as a, b, 
and c, and consider two cases based on which box they 
are in. 

© Case 1: Ifa, b,c are in the box labeled “divisible by 

37 then aè + P c is divisible by 3 because ... 

Case 2: If a, b, c are in the box labeled "not divisible 

by 3," then by Practice Problem 4 from Section 2. 

aè can be written in the form. BF can be 

written in the form, „and c? can be writ- 
ten in the form z Hence a? +6? c is 
divisible by 3 because - 

In either case, a? + 6° + c is divisible by 3, completing 

the proof. = 

Use the pigeonhole principle to prove each of the fol- 

lowing statements about numbers: 

(a) Given any seven integers, there will be four for 
which the sum of the squares of those integers is 
divisible by 4. 

(b) Show that among any 52 integers, there are two 
whose sum or difference is divisible by 100. 








"B 





























WE 





3 


* 


3». 


lowing geometric statements 

(a) Whenever five points are placed inside an equilat- 
eral triangle with sides of length I, there must be at 
least two points within 2 of each other. 

(b) Whenever nine points are placed inside a cube with 
sides of length 1, there must be at least two points 
within 2 of each other. 

(©) Let O denote the origin ofthe plane. Whenever five 
points (other than O) are placed on the plane, there 
will exist two points P and Q such that ZPOQ is an 





acute angle. 
For every integer n> 1, show that some number in 
Ix, 22, 37. .... er] is within 





following steps 

© Define n boxes into which we can place real num- 
bers that are between 0 and 1 according to the rule 
shown in Table 2-21. 

© Each multiple of x is placed into these boxes accord- 
ing toits fractional part. (For example, the fractional 
part of 2s = 6283 is 0.283.) 

(a) What does it mean if one of these multiples is in 
Box 1? 

(b) If no number from (s, 2x, 32, .. n] is in Box 1 
what does the pigeonhole principle say will happen? 

(©) Iftwo multiples ofa are in the same box, what does 
this mean about the difference between these mul- 
tiples of x? 

(d) How does this prove the original st 

Generalize the previous exercise to show that for any 

real numberr and for every integer n > 1,some number 

in the set [r2r, 3r... nr| is within } of an integer. 

Write a proof of Theorem 8 that does not use "proof by 

contradiction.” 

Prove that in any gathering of six people, there are ci 

ther three people who are mutual friends (i.e., each pair 

are friends) or there are three people who are mutual 

strangers (ie. each pair do not know each other), Thisis 

the easiest result in a field of mathematics called Ram- 

sey theory. 














ment? 











Table 2-21 





Definition of Boxes in Exercise 36 


S 


) Chapter 2 / A Primer of Mathematical Writing 


e 2.6 Excursion: Representations of Numbers 





An important mathematical issue that is relevant for both computer scientists and 
mathematics teachers is the nature of numbers and numerals. To see that numbers 
and numerals are not the same thing, we need only consider the abundant examples of 
numeral systems other than our own Hindu-Arabic system. Almost every culture has 
developed or adapted some system of numerals to express a number of objects. The 
earliest systems usually resemble something like the tally marks that children might 
use to keep score in a game. The Roman numeral system is the best known system 
that is clearly related to tally marks. The Roman numeral V and the Hindu-Arabic 
numeral 5 represent the number of fingers on a person's hand. Hence, a numeral is 
a representation system for a number, and a number is actually an abstract notion of 
quantity, 

Although we are all familiar with Roman numerals, few of us would ever consider 
trying to do arithmetic with them. If asked how many years there were between Super 
Bowls XIV and XXXI, we would all convert these two Roman numerals into regular 
Hindu-Arabie numerals and subtract with our familiar procedure, complete with 
regrouping in this case. It is not strictly necessary to have a numeral system to do 
arithmetic, but over the last eight hundred’ years, the versatility of the Hindu-Arabic 
system in performing arithmetic procedures has made it the most common system 

the world. In this section, we will refer to this ‘common system as the base ten, or 
decimal, numeral system. 








Decimal and Binary Numerals 


There are many properties of arithmetic that make computations simple. Some of 
them really are properties of the operations, but others are actually properties of the. 
decimal numeral system itself. 


E. Example 1 Which of the following properties of multiplication are true no 
matter what numeral system is used? 

1. (The identity property) For alla € Z, a -1 = a. 
2. (The commutative property) For alla, b € Z, a -b = b- a. 

3. (The distributive property) For all a,b,c € Z, a-(b+c)=a-b+a-c. 
4. (Shifting) For alla € Z, 10-a adds a "0" on the right end of a. 


SOLUTION Only the fourth property depends on the numeral system. Just 
try to make sense of that rule with Roman numerals! o 








So if the shifting rule above is not really a property of multiplication, then how 
can we prove that it is true? To address this, we need to first give a formal definition 
of the decimal representation of a positive integer. 





T The first “Western” book in which the Hindu-Arabic numbers were used was the book on arithmetic 
Liber Abaci published in A.n. 1202 by none other than our old friend Fibonacci This same book includes 
the famous Fibonacci number sequence. 


2.6 Excursion: Representations of Numbers. 151 





Definition Given a positive integer X, the decimal representation tor X 
is a string consisting of digits from (0,1, 2,3, 4,5, 6, 7,8, 9) that looks like 
+ ddd, where 











= d, 10" + da-i 1077 + -+ + do -10 + di 10 + do 10? 








ww i » 
Example 2 Write the positive integer 24,317 in expanded form as shown in the 
definition. Multiply the expanded form by 10 and simplify 
SOLUTION 24,37 2 2-10* 4-4 109 3-10? 4- 1-10 4-7 109. When we 
‘multiply by 10, we get 
10- 2-10* 4-410 3-10 1-10 7-109) 
10-2-10* + 10-4- 10° + 10-3- 10*  10-1- 10 + 10-7. 10? 
2-10 +4- 10 43-10 21:10 47-10 0-10 
= 243,170 





Proposition 1 Multiplying a decimal numeral by 10 shifts the digits one place 
to the left and places a "0" digit on the end. 
PROOF The proof simply repeats the steps of Example 2, but in general rather 
than for a particular integer. ase ten representation of a number x is given. 





by dud, i... didids, which means 
r= od ad 


d, «10 + thy - 1*7 =» + ty + 10? + dy -10° + dy 10? 





When this is multiplied by 10, we get 


Lah 


d, 10771 + dp- 107 + ++- 4 da 10? + chy 10 + dy + 10! 
= thy VO 4 yy -10° soe dp -10° di 10 + cy «10! +0- 10° 


10x 














Hence, 10 - x has the base ten representation dydy-1 ...dsd,do0—that is, the base 
ten representation of x with the digits shifted left and a O placed at the end. M 


Properties of decimal numerals are hard to think about critically since it is 
difficult for us to think about the concept of a number like 112 without picturing the 
decimal numeral 112. To overcome this difficulty, we should work in other numeral 
systems in which we cannot take the processes or properties for granted. We will 
continue using a place value system—that is, a system in which there are digits whose 
positions reflect their values—but we will change the base to numbers other than 10. 
Not only does this help us understand properties of our decimal system better, it is 
also of practical value for computer science applications. 





EI ENSZ-C Tee oOk= 


‘Chapter 2 / A Primer of Mathematical Writing 





Definition The base two (a.a. binary) representation of a positive integer 
X is a string consisting of digits from (0, 1) that looks like byby-1 -++ bzbi bo, 


where 
X22 
Dr 


= by 2 ba 21 eso by 23 + by 2! + by 20 


We will refer to binary digits as bits a term coined by Princeton University 
statistician and computer pioneer John W. Tukey (1915-2000). 











When our discussion leaves some doubt as to the base being used, we will simply 
write the base as a subscript, For example, (1001011 )en represents “one million, one 
thousand and eleven,” while the binary numeral (1001011),,,, represents a much 
smaller number, as we see in the next example. 


V Example 3 


1. What decimal numeral represents the same number as the binary numeral 
(1001011),,,2 


2. If we label the number in the first part as x, what is the binary representation for 
2x? For2x +1? 





SOLUTION 


1. For decimal numbers, the place values are ones, tens, hundreds, and so on— 
that is, 10°, 10!, 10", and so on. For binary numbers, the place values follow 
the same pattern, but using powers of 2 rather than powers of 10. From Table 
2-22, we conclude that (1001011), , represents 





1:240: 40-2 1.24 09.2 1.2 41.2 
=1-64+0-32+0-1641-840-441-241-1 
705 


2. As we did for decimal numerals in Example 2, we have. 





2r 22- (0.250.254 0.251.223 0.22 1.21 1.20) 
21.2 40.250.274 1:23 49.21.22 41.21 49.2 





= (10010110), 
Bit 1 o o 1 D 1 1 
Place value * a E] ?» 2 2 F 








Inwords | Sixty-fours Thirty-twos  Sixteens Eights Fours Twos Ones 


Table 2-22 Place Values for a Binary Number 


26 Excursion: Representations of Numbers 153 
and 2x +1 will be 
2x =2-(1-2° 40-27 40-24 41-2 40-27 1.2 1:2) 41 
1-2740-2540-2541-2940.2941-2241.21 41.29 
= (10010111),,,, 


The binary numeral for 2x is formed by shifting x's bits to the left and placing 
a "0" on the right; for 2x + 1 we place a “1” on the right after doing the shift. 





W^ Example 4 Suppose we have found the binary representations for all natural 
numbers up to and including 18 in Table 2-23. Use this information to find the binary 
representation for 19, 
SOLUTION | By the division theorem (applied to the division problem 19 + 
2), we can write 19 = 2.9.4 1. If we look in the chart, we find that the base two 
representation of 9 is 1001. Hence, the representation for 19 can be found by 
placing the remainder (1) onto the right side of the representation for 9, which 
gives us 10011. 








Practice Problem 1 Using the chart in Table 2-23, find the binary representations 
for the numbers 25, 39, and 50, 


It might not be clear that our definition allows every number to have a binary 
representation, so we will lake the opportunity to use induction to prove that this 
is so, We conclude (since every natural number has a binary representation) that 
the binary numeral system is just as good as the decimal system for representing 
numbers, Example 4 and Practice Problem 1 illustrate the crucial step of the proof, 








Proposition 2 
PROOF (By induction) Let P(n) be the statement "n has a binary representa- 
tion." Since (0),,. represents 0, and (1),,, represents 1, we have checked state- 
ments P(0) and P(1) without even trying, Now let m> 2 be given such that 
P(0), P(1), ..., P(m — 1) have been checked. That is, every number up to and 
including m — 1 has a binary representation. We next must consider the number 
m. By the division theorem, there are integers q (the quotient) and r (the remai 
der) such that (1) m — 2- q +r and (2) r is from the set (0, 1]. Since q < m, then 


very natural number has a binary representation. 














statement P(q) has already been checked, so we know that q has a binary repre: 
sentation: let's refer toit as bibi. i +- babys. By this, we mean that = Y 5-2 
Now 


n |BaseTwo | | Base Two | n | BaseTwo | n | BaseTwo | n | Base Two 





2| w fsf mo fio] 1m0 E mo |18| 10010 


Table 2-23 Binary Numerals for Numbers | through 18 


L EE IEEE 


Chapter 2 / A Primer of Mathematical Writing 


m-2«qr 


(bes) 


rj 








In the above expressions, we see that multiplying q by 2 shifts the bits for q 
to the left and adds a “0” on the right—this is the same effect that multiplying a 
decimal number by 10 has in Proposition 1. Hence, when r is added, this essentially 
places r in the ones place, This means that the binary representation for m is 
bibis +++ brbybor. " 


Not only does this proposition prove that a base two representation always 
exists, it actually provides instructions for finding it, We can summarize the steps as 
follows, given the natural number n: 





1, Write n = 2g +r (ie. divide n by 2, finding the quotient q and the remainder r), 
2. Write down the base two representation for q, 
3 





Append r to the right of the string you wrote in step 2. 


We used this algorithm in Example 4 and Practice Problem 1, where we already had a 
table of binary values. The question arises, how do we proceed in general? If we don't 
already know the binary representation for q, what do we do? The answer is that we 
apply this same process to q. To make this easier to do, we can rewrite the process 
as a repeating process as shown below. It is important to notice that the instructions 
have the unusual feature of writing the digits of the base two number from right to 
left. 








Algorithm for Writing a Number in Base Two 


© Input a natural number n. 
© While n > 0, do the following: 
Divide n by 2 and get a quotient q and remainder r. 
Write r as the next (right-to-left) 
Replace the value of n with q, and repeat. 

















Bill pande 5; Coca eric page Hr te TT a 
SOLUTION We trace the algorithm as follows: 
© We begin the algorithm with n = 77 
9 77 +2 = 38 with remainder 1, so we write 1, and let 38. 

remainder 0, so we write 0, and let n = 19. 








e 19 





with remainder 1, so we write 1, and let n 


2.6 Excursion: Representations of Numbers 155 





© 9 +2 = 4 with remainder 1, so we write 1, and letn = 4. 
9 4+2 =2 with remainder 0, so we write 0, and let n 








© 252 — 1 with remainder 0, so we write 0, and let n 
© 152 = 0 with remainder 1, so we write 1, and let n = 0, 
© Since n = 0, we quit 





Thus, the numeral formed by the algorithm is 1001101, so this is the base 
two numeral, To check, we can compute 





1:254-0.25 0:25 1:2 1:2 40-2! 41-29 5644844412 77 


Practice Problem 2 — Write the base ten numbers 37 and 125 in base two, 


Why should we care about properties of numbers represented in different nu 
meral systems? Just as with the shifting property in Proposition 1, there are properties 
of arithmetic that can only really be understood in terms of our decimal numeral sys 
tem. An interesting one is the well-known test for divisibility by 9: 











A number is divisible by 9 if and only if the sum of its digits is divisible by 9. 


As you will see in the exercises, this statement is not true when the number is 
written as à numeral using a base other than ten. We will show how a number is 
related to the sum of its decimal digits, and we will leave the actual explanation of 
the rule above as a practice problem. 





Proposition 3 — Ifihe digits of the decimal representation of a positive integer x 
sum to s, then x — v is divisible by 9. (Hixr: Exercise $ from Section 2.4 proves that 
10" — 1 is divisible by 9 for all n = 0.) 
PROOF The base ten representation of a number x is given by the string of 
digits dlp + dado, Which means. 
r= Yao 


= da 10" + d, 1077 fo dy 109 + dy 10! + do 107 


On the other hand, if we use the letter s to stand for the sum of these digits, 
then we have 
s=) d = yt dyer Ho td + d; + dy 


We can put these two facts together to 





x-s=) d 10 - d 


= ^ 


- sd (10-1) 





Since 10’ — 1 is divisible by 9 for each of the values of in the above sum, it follows 
that x — s is divisible by 9. a 





a nn 


! Chapter 2 / A Primer of Mathematical Writing 


uw Example 6 Trace the above proof with the value x = 1,934. 


SOLUTION Using the expanded base ten representation, we have 
1,934 — 1- 10* - 9-107 -- 3. 10! -- 4- 10? and the sum of the digits is s = 
14+943+4,s0 





1-10° +9. 10? 3-10 +4. 10°) - (1494344) 
(10 —1) -9- (10? -1) +3 (10' — 1) +4- (10° — 1) 
=1-9994+9.99+3-944.0 





Since each term in the sum is divisible by 9, then x — s is divisible by 9. n 


Proposition 4 A natural number is divisible by 9 if and only if the sum of its 
(decimal) digits is divisible by 9. 
PROOF Since this statement has an "if and only if" form, we must actually prove 
two things: 


© Claim 1: If a natural number X is divisible by 9, then the sum of its digits is 
divisible by 9. 
Proof Let a natural number X that is divisible by 9 be given. This means that 
X =9K for some integer K. We will use S to denote the sum of the decimal 
digits of X. Proposition 3 tells us that X — $ is divisible by 9, which means that 
x 9L for some integer L.. Combining these facts tells us that 











X~(X-S) 
9K —9L 
-9(K-L) 


Since K — L is an integer, we can conclude that 5 is divisible by 9. 
© Claim 2: If the sum of the digits of a natural number divisible by 9, then X 
is divisible by 9. 
Proof. See the practice problem below. 





Practice Problem 3 Prove that if the sum of the digits of a natural number X is 
divisible by 9, then X is divisible by 9. 


Numbers in Other Bases 


We have discussed our usual decimal representation (using the digits 0 through 9), 
the binary representation (using the digits 0 and 1). These ideas can be generalized 
to other bases. For example, in base five we would use the digits 0, 1,2, 3,4, and 
place values would be powers of 5. In general, a natural number x has the base b 
representation 








X = dydy1 --- daddy 


where each of the digits is from the set (0, 1,2, ..., b — 1). The exact meaning of this 
place value numeral is 


26 Excursion: Representations of Numbers 157 
Law 


Wf Example 7 Write the base ten number for each of the following 








1. (412). 

2. 2046), 

3. (4011) vce 
SOLUTION 
L4 S $1.51 42 4:2541.542.12 107 
1.2.7 40.7 4.7 46.7 =2.34344-746-1= 
H 


4 is not a legal digit for a base three numeral. 








Proposition $ Let any integer b 2 be given. Every natural number has a 
representation in base b. 


PROOF The proof is left as Exercise 23. It is an induction proof almost identic 
to that for Proposition 2 a 





Since the proof for a general base b is almost identical to the proof for base two, 
you should not be surprised to find out that the proof also yields an algorithm for 
conversion to base b. Moreover, that algorithm is essen 
by b rather than by 2 in each step. 





ly the same, but we divide 





"ll Example 8 waite the base ten number 1,964 in base eight 


SOLUTION We trace 





he algorithm (dividing by 8 rather than 2) as follows: 
© We begin the algorithm with n = 1,964 andi = 0. 

© 1,964 + 8 = 245 with remainder 4, so we write 4, and let n 
30 with remainder 5, so we write 5, and let = 30. 
3. 








245. 








© 30 +8 = 3 with remainder 6, so we write 6, and let n 
9 3+8 = 0 with remainder 3, so we write 3, and let n = 0. 
9 Since n 








), we quit. 


Thus, the numeral didsd;d, formed by the algorithm is 3,654, so this is the 
base eight numeral. To check, we can compute 


3-8 46-8 45.8 4.8 = 1,964 
Practice Problem 4 Write the base ten number 1,992 in base five. 


Hexadecimal Numerals in Computer Science 


Binary representation are the most important in computer science, but if you look 
at machinc-level addresses and values, you will often sce mysterious strings of digits 





— JEU —- a — 72. GM 


58 Chapter 2 / A Primer of Mathematical Writing 


and letters instead of zeroes and ones. When you see these strings, you are looking 
at a higher representation of the binary numerals that represent the lowest level 
of machine representation. This higher representation is called the hexadecimal, or 
base sixteen, system, and we will see shortly how these numbers stand in for binary 
numerals. 


"lll Example 9 Write ihe hexadecimal numerat 321 in base ten 
SOLUTION (321),,, means 3- 16 +2- 16+ 1 = 801 in base ten. o 


Hexadecimal (base sixteen) is strange in that its base is larger than our decimal 
base of ten. This means that the digits from Proposition 5 must be from the set of 
numbers from zero to 15. But we cannot use 15 as a single digit, for example, since 
(15)jex represents 1 in the sixteens’ place and Sin the ones’ place, which is the decimal 
numeral 21. To overcome this difficulty, we introduce some symbols whose use as 
digits will not cause this confusion. We will use the letters A, B,C, D, E, and F to. 
represent the numbers 10, 11, 12, 13, 14, and 15, respectively. 


E] Example 10 Write the decimal numeral 8,940 as a hexadecimal numeral. 
SOLUTION We can simply follow the algorithm. 
@ We begin the algorithm with n = 8,940. 
© 8,940 + 16 = 558 with remainder 12, so we write C and set n = 558, 
© 558 + 16 = 34 with remainder 14, so we write E (to the [eft of the previous 
digit) and set n = 34. 


16 = 2 with remainder 2, so we write 2 (to the left of the previous 
digit) and set n = 3. 


with remainder 2, so we write 2 (to the left of the previous digit) | 

















@2+16 
and set n = 0. | 
© Since n = 0, we quit. 


We conclude that the hexadecimal numeral 22 C represents the decimal 
numeral 8,940, o 


Practice Problem 5 — Which of the following (decimal) numbers make an English 
word when you write their hexadecimal representations? 


(a) 4013 
(b) 65261 
(c) 700,638 


As we mentioned before, the main benefit of hexadecimal numerals is that they 
actually stand in for binary numerals when the latter would be long and cumbersome. 
Before we continue that discussion, we recap what we know so far about binary 
numerals and hexadecimal digits in Table 2-24. Note that the set of hexadecimal digits 
corresponds exactly to the set of four-bit binary numerals. With current technology, 
à common computer configuration has internal words consisting of 32-bit binary 
numerals. If you look at a binary numeral of this length, it can be rather daunting: 





s. LE eee eee 2 


26 Excursion: Representations of Numbers 159. 





Binary 0000 0010 0011 0100 010 0110 O111 1000 1001 1010 1011 1100 1101 1110 1111 


Decimal | 0 1 2 3 4 S 6 7 8 9 n m np w x 
0001 
Hexadecimal 0 1 2 3 4 5 6 7 8 9 A B C D E F 


Table 2-24 Binary-Hexadecimal Conversion 


10110100010010100011001110010111 
By breaking it into eight four-bit pieces, it is a little easier to read: 
1011 0100 0100 1010 0011 0011 1001 Olli 


and when we realize that each of these four-bit pieces corresponds to a single hex- 
adecimal digit, the representation improves even more: 


01391000100 1010 001100111001 0111 

XC Eos q 3 $ 
Hence, the hexadecimal representation of 

(10110100010010100011001110010111),,... 


is (84443397), ,. Since this gives us a quick and easy procedure for changing between 
hexadecimal and binary numerals it is clear why, even when the real representation 
of an address or value is stored as a binary numeral, a human being would prefer to 
look at the hexadecimal representation. 


"lll Example 11 conver the binary numeral (111011110110010101111),,,, 1o its 
hexadecimal representation. 
SOLUTION Since every block of four bits in the 
111011110110010101111 represents a single hexadecimal it, we will need 
to pad the left side of the binary representation with three leading 0's in order 
for the total number of bits to be a multiple of 4. This will not change the value 
of the numeral. 





21-bit string 






0001 1101 1110 1100 1010 1111 


oo E € W » 
By blocking the bits in fours and converting each block of four bits to the 


corresponding hexadecimal digit, we see that the hexadecimal representation 
will be IDECAF. o 





Practice Problem 6 Convert the hexadecimal numeral (2FAC ED),,, to binary 
and then to decimal. 


Some Mathematical Applications of Binary Numbers 
Characterization of the solution to the Josephus problem 
Recall the Josephus problem (as described in [32]) from S 


In the Jewish revolt against Rome, Josephus and 39 of his comrades were holding 
‘out against the Romans in a cave. With defeat imminent, they resolved that, like 
the rebels at Masada, they would rather die than be slaves to the Romans, They 


1.1 of this book: 








a —— — — GENE" 


Chapter 2 / A Primer of Mathematical Writing 


decided to arrange themselves in a circle. One man was designated as number 
‘one, and they proceeded clockwise killing every seventh man. .. Josephus was 
among other things an accomplished mathematician; so he instantly figured out 
where he ought to sitin order to be the last to go. But when the time came, instead 
of killing himself he joined the Roman side. 





For a binary numeral bby . .. bby bp, we define the “cyclic left shift” to be the 
result of moving the leftmost bit by to the rightmost place, For example, the cyclic 
left shift of the binary numeral 1010011 is the binary numeral 0100111, If we form 
Table 2-25 showing the result of performing this operation on the binary numerals 
that represent the first 10 positive integers, a familiar pattern emerges. If possible, 
we would like to find some algebraic “closed formula” that will directly relate the 
numbers in the top row to the numbers in the bottom row of this table. If we think 
about what the cyclic left shift does arithmetically, this is a fairly simple process that 
we can study in a specific example before we state it more generally. Let's use the 
example n = 11 since it would be the next value to be entered into the table above. 




















9 (gen = (1001), 

9 Removing the leftmost 1 from the binary numeral 1011 would give us 011. This 
is the same thing as subtracting from 11 the largest power of 2 that is less than or 
equal to 11. In this case, 11 — 2° = 3 , and 011 is the binary numeral for 3. 

© Placing this 1 on the right of the remaining string is actually done in two steps: 

1. First shifting the string to the left, In our example, 011 shifted one place to the 
left would give us 0110. This is equivalent to multiplying the number for 011 by 
2. In this case, 2. 3 = 6, and 0110 really is the binary numeral representing 6. 

- After shifting these bits one place to the left, we can simply add 1 to put a 1 

digit on the rightmost end of the string, to give us 0111. Adding 1 to 6 gives us 
7, Which is indeed the number represented by the binary numeral 0111. 

© Hence, the result of a cyclic left shift on the binary numeral representing 11 is the 

binary numeral representing 7. 


This example can be generalized to give us a formula for the result of a cycli 
left shift, 


Proposition 6 Given a positive integer number n, find the largest value of k 
for which 2* < n, and the number in the bottom row will be given by the formula 
2:(n- 2) 1. 

PROOF Given any pos 





e integer n and the number k as defined above, 








Value of n ji 3$ndsciaa 7T X* 9 3 
Binary representation of || 1 10 11 100 101 10 111 1000 1000 1010 
Cyclic eft shift 1 Ol n om O11 101 112 (0001 oi 0101 
Decimal after shift. "TAL RUE EN 


Table 2-25 Cyclic Left Shift of Binary Numerals 


2.6 Excursion: Representations of Numbers 161 





© Removing the leftmost 1 from the binary numeral for n is the same thing as 
subtracting 2* from n, where 2* is the largest power of 2 that is less than n. 
© Placing this 1 on the right of the remaining string is done by first shifting the 
string to the left (Which we have seen is done by multiplying the number by 2) 
and adding 1. 
" 


BB Example 12 Testte formula forn = 10,» 
the binary numbers) that it works for these values. 


SOLUTION For n= 10, the formula predicts that the cyclic left shift will 
yield the number 2 (10 — 2°) 4-1 = 5, which is correct according to the table 
above, For n = 20, the formula predicts that the cyclic left shift will yield the 
number 2 - (20 — 24) + 1 = 9, which is correct because (20), = (10100), and 
the cyclic left shift on (10100), yields (01001), ,, = (9), For n = 30, the for- 
mula predicts that the cyclic left shift will yield the number 2 - (30 — 24) +1 = 
29, which is correct because (30), = (11110),,,, and the cyclic left shift on 
(11110), yields (11100), = 29) eye a 


=20,andn 





30, and verify (using 














Theorem 7 Inthe Josephus game with n play 

eliminated, the number of the last remaining pe 

largest integer for which 2* < n. 
PROOF In Example 8 of Section 2.4, we proved by induction that for every 
integer m > 0, “In the Josephus game with 2" players in which every second 
person is eliminated, the first person passed over (person 1 in the given labeling 
of the circle) will be the last one left at the end,” Given this result, it is easy to 
find a formula for the last person left at the end given any starting number n. 
Let a positive integer n be given, and let k be the largest number such that 24 < 
n. Eliminate. people 2,4,6,... until 2^ people remain. You will have eliminated 
exactly n — 2^ people. This number must be less than half the total you started 
with, so all the eliminated people have even position numbers. Hence, the list of 
people eliminated looks like 





rs in which every second person is 
von is 2+ (n — 2^) + 1, where k is the 














24,6,...,2 





At this point, it looks like the game starting with 2^ people in a circle, and the first 
‘one you are going to let live is the very next one, person 2 - (n — 24) + 1. By the 
example cited above, this same person will be the last one left at the end of this 
game, " 








Corollary 8 — In the Josephus game with n players in which every second person 
is eliminated, the number of the last remaining person can be found by performing a 
cyclic left shift on the binary representation of n. 





Perfect sl 





fles 





The so-called perfect shuffle really refers to two separate permutations (rearrange- 
ments) that can be performed on a packet of cards. In order to make this shuffle fairly 
easy to do, we will use only one suit from our deck, so assume we have a packet of 13 
cards, The basic premise of the perfect shuffle is that the packet is divided in half and 





2 ov — «ET. — —T ÀB on 


62 Chapter 2 / A Primer of Mathematical Writing 


then the cards are perfectly interwoven, with the original top card either staying on 
top (called an out-shuffie) or moving to the second position from the top (called an 
in-shuffle). See Figure 2-9 for clarification. The only further issue is the meaning of 
"divide in half" for a packet containing an odd number of cards. We will follow the 
convention of magicians by always dividing our packet so that the resulting shuffle. 
has the larger half straddling the smaller half. 

The perfect shuffle is a component of many card tricks performed by sleight-of- 
hand artists, and itis certainly the magic trick most thoroughly studied by mathemati- 
cians. Brent Morris's book [39] offers a rich amalgamation of math results and magic 
history. In particular, using the notation I for in-shuffle and O for out-shuffle on a 
packet of 13 cards, Morris provides the following basic facts, which can be verified 
by the reader on at least the examples shown in Figure 2-9. 











Proposition 9 Assume the card positions are labeled 0, 1, 2,3, 4,5, 6, 7,8, 9, 10, 
11, 12, with 0 denoting the top card. 


1. The card initially in position p is in position Kp) after the in-shuffle, where 


Xp) 





2p-- 1) mod 13 (24) 





The card initially in position p is in position O(p) after the out-shuffle, where 


O(p) = 2p) mod 13 (25) 


Because each of these shuffles corresponds to simple arithmetic operations on 
binary numerals, we can use the binary representations of position numbers to pre- 
dict the movements of individual cards. As an example, we will show how to ob- 
tain a sequence of shuffles to move the top card of the packet of 13 to any de- 
sired position in the packet. Use 6 = (110),,, as an example. As we go through 
the shuffles, we will keep track of the original top card's position using its bin 
representation. 





1. The position (0),,, card (the top) is moved to position (1),,., via an I shuffle. 
2. The position (1),,, card is moved to position (11),,., via an I shuffle. 
3. The position (11),,, card is moved to position (110),,,, via an O shuffle. 


Hence, after three appropriate shuffles, the card that was originally on top can 
be moved to position six (which is seventh from the top) in the packet. 





10 
" 
12 


il 


T 





 Outshuffe O Inshufle 1 I 
Figure 2-9 The out- and in-shuffles with an odd number of cards. 


26 Excursion: Representations of Numbers 163 


Theorem 10 In a packet of 13 cards, to move a card from position 0 (the top) 
to position k, one must do in-shuffles (1) and out-shuffles (O) according to the binary 
number for k, using an O shuffle for each 0 and an V shuffle for each 1, reading the 
binary number from left to right. 


PROOF Simply emulate the example above for a given position number n. Bl 


Practice Problem 7 (Do a magic trick!) Get a packet of 13 cards of the same suit 
with theace on top. Aska friend to namea number between 1 and 10. Do the appropriate. 
sequence of in-shuffles and out-shuffles to move the top card to that position in the 
pack. Have your friend take the pack and move cards from top to bottom, counting 
each card until his or her chosen number has been reached. The top card on the packet 


is now the ace! 


Solutions to Practice Problems 


12 + 1, sosince (12),, = (1100), ., then adding 
digit to the right side of this gives us (25)... 
(11001), ,. 39 = 2- 19 + 1, so since (19), = (10011), 
then adding a “I” digit to the right side of this gives us 
(39), = (100111), ,. 50 = 2-25 + 0, o since (25), 
(11001), ,, then adding a “0” digit to the right side of 
this gives Us (50), = (110010), 

100101 and 1111101. 

Proof. Leta natural number X be given such that the 
sum S of the decimal digits of X is divisible by 9. This 
means that S — 9k for some integer k. Proposition 3 
tells us that X — 5 is divisible by 9, which means that 
X — S = 9m for some integer m. Combining these facts 
tells us that 


















(x-s)es 
m--9k 
=9%m+k) 





Since m+ k is an integer, we can conclude that S is di- 
visible by 9. . 
4 We trace the algorithm with 6 = Sas follows: 
© We begin the algorithm with n = 1,992. 
© 1,992 5 = 398 with remainder 2, so we write 2and 
set n = 398. 
(© 398 +5 = 79 with remainder 3, so we write 3 (to the 
left of the previous digit) and set n = 79. 
© 79 +5 =15 with remainder 4, so we write 4 (to the 
left of the previous digit) and set n = 15. 




















6 


9 15 +5 =3 with remainder 0, so we let d; = 0 (to the 
left of the previous digit) and set n 
© 3=5= 0 with remainder 3, so we let d, — 3, n — 0. 
and i 














9 Since n =0, we quit 
We conclude that (1992), 
They all do (sort of)! 

(9) (4013),, 
(b) (65261), 
(©) (700638), , = (ABODE),,, 
Using the table 











0107117 101011001110110 


and dropping the unnecessary leading zeroes, we see 
that 


QFACED),,, 





1011111010110011101101),,, 


Using the meaning of the hexadecimal digits, we can 
‘compute 


(FACED) nex = 2- (16) +15 (16)! + 10- (16) + 
12- (1) + 14- (16)! +13- (16)' 
3124461), 





" Ifyou cannot master the perfect shuffe with a small packet, here is a way to fake it. Divide the packet 
as evenly as possible and give one pack to your friend. Holding your packets face down, take turns 
dealing your cards off the bottom into a single pile on the table. You will have to give some thought to 
the division of the packet so that the person with the bigger pack always starts the deal and the deal 
results in the correct shuffc but a little practice should make this possible. 








C OMEN n o c0 = oe 


64 Chapter 2 / A Primer of Mathematical Writing 
xercises for Section 2.6 


l. Write the following decimal numerals using their binary 
representations: 
(a) 35 
(b) 125 
(e) 123 
n 1024 

. Write the following decimal numerals using their base 
five representations: 
(a) 35 
(b) 125 
(c) 123 
(a) 1,024 

. Write the following decimal numerals using their base 
eight representations: 
(a) 35 
(b) 125 
(c) 123 
(a) 1004 

. Write the following numerals, each given in the indi- 
cated bases, in their base ten representations: 
[I 
O) (25), 
1e) (1010), 

(d) (1024), 

- Use the shortcut method of Example 11 to convert each 
of the following numerals from hexadecimal to binary 
or vice versa, whichever is appropriate: 

(à) (DAD) exer 

(0) FB) 

(©) (EFEFUTO? sn 

(2) (11001) 

(©) (1011000011011110). 

40 (11001010111111101111000000001101 jy 

. The method of Example 11 can be adapted to convert 
between base eight (octal) numerals and binary rep- 
resentations using Table 2-26 for individual octal dig- 
its. Use this adapted method to convert each of the 


10. 


n. 


B 


following numerals from octal to binary or vice versa, 
whichever is appropriate: 
(8) Q7). 

(b) (2005). 

AE) 4601)... 

14) (11001), 

(©) (1011000011011110),, 

D (110010101111111011110101),,, 

Use the ideas from Exercises 5 and 6 to convert each 
of the following numerals from octal to hexadecimal 
or vice versa, whichever is appropriate, (Hint: Asan 
intermediate step, you should produce the equivalent 
binary numeral in each case.) 

(2) OTa 

4) (1245). 

(©) (13570246), 

(@) (DAD) zee: 

(©) QD 

(D (IA2B3CAD) ge, 

Prove that a decimal number is divisible by 3ifand only 
if the sum of its digits is divisible by 3, 

What can the ones digit be of a perfect square written 
in base eight? 

What can the ones digit be of a perfect square written 
in base five? 

Which of the following decimal numbers is divisible by 
9 

(a) 765 

©) 1.234 

(© 123678 

d) 1,909,876 

Prove that if a number n is represented as dedd: do 
in base b, then the product n-b is represented as 
ddd.d ds in base b. 

Demonstrate that the following rule is not true of bi- 
nary numerals: “If a number is divisible by 3, then the 
sum of that number's binary digits is divisible by 3." 











maa i X eT ee ee 
Binary | 000 oo oo On 10 10 mo m 
Dua LOW. NES HUE SD cuo cs ART 


‘Table 2-26 Binary-Octal Conversion 





ase eight is often called octal 











26 Excursion: Representations of Numbers 165 





14. Demonstrate that the converse of the rule in Exercise (a) Try using the RGB value FF0000, 00FF00, and 
13 is also not true for binary numerals OOOFF to sce all red, all green, and all blue. 
15. Demonstrate that the following rule is not true of base (b) Can you make the background of your screen yel 
eight (octal) numerals: “If a number is divisible by 3, low? 
then thesum of that number's octal digitsis divisibleby (e) How many different colorsis it possible to express 
£ in this way? 
16. Demonstrate that the converse of the rule in Exercise 20, Write the following decimal numbers in base twelve 
15 is also not true for base eight numerals. (You will need to invent symbols for the numbers 10 
17. Prove the following rule for base eight (octal) numerals: and II for your system.) 
“A number is divisible by 7 ifand only ifthe sum ofits) 7369 
‘octal digits is divisible by 7" mi 
18. Refine the proof of Proposition 4to show that x mod") sis 
5 mod 9 when x isany natural umber ands isthe — (nis 


sum of the decimal digits of x. 
19. Hexadecimal comes up frequently in computer science 
applications. For example, colors on a Web page are 
typically expressed using RGB values. If you open 
"Notepad" in Windows (or whatever text editor your 
‘computer might have) and type the following single line ds ii PEN 
ina file that is then saved as “colorist html.” you will Sum the ccled numbers and the sum will reveal your 
see colors when you open this file with your browser: ae eee en ce eis ie 
< BODY BGCOLOR = #777700 > 22. Design a magic trick like the one above taking advan- 
lage of base three representations of numbers, 
23. Prove Proposition 5 


21. Here is a magic trick that uses binary numbers, Choose 
any number you see on any of the cards in Figure 2-10, 
Notice that most numbers are on several cards. For each 
of the five cards, if your number is on the card, circle 
the number in the upper left corner of that card. Now 




















Bis ill baton ss | teas. o 
ters aal riis i| omen om 
vehe ue Um 
27 9n a x 5 2 
575 as 2 in| 
[5 9,17 | 16 37 2 
m a | 
J» om | m n 
stel aret | 
re ied M ec nae 
a = 28 i: 
ate "LEM 

















Figure 2-10 Binary magic cards. 





a ——— c 


ter2 | A Primes of Mathematical Writing 


e 2.7 Excursion: Modular Arithmetic and Cryptography 





In Section 22, we examined properties of integers related to divisibility and remain- 
ders. The point there was to provide some familiar mathematical territory as the 
starting point for our journey into formal definitions and mathematical writing. It 
turns out that the ideas of divisibility and remainders actually can be used to formu- 
late a whole new system of arithmetic with properties of interest in their own right. 
It might be surprising to some that there can be "whole new systems of arithmetic 
Qut there, but itis nothing to be worried about. The new systems (yes, there are more 
than one) have many connections to the arithmetic we are familiar with,” 

We first establish a new notion of "equivalence" for integers where we use the 
notation 





amb 


to denote that a and b have the same remainder on division by the positive integer 
n. In the notation of Section 2.2, this means that a mod n = b mod n. 

This connects our discussion with the division theorem, but to be honest, that 
might not be the most useful way of thinking about this relationship. We give a less 
intuitive but more useful definition instead, 








Definition For all integers a, band n with n > 1, a =, b means that a — b 
is divisible by n. In this case, we say that a and b are congruent mod n. 











‘We will first prove that this really does mean the same thing as the statement 
above about remainders In addition to giving us two ways to think about this sort of. 
‘equivalence, this theorem will give us more practice in mathematical writing. 





Theorem 1 Forintegersa, band n with n > 1, a mod n =b mod n ifand only if 
n divides a — b. 


PROOF To establish this, we must prove each of the following propositions: 


1. Proposition. I a and b have the same remainder on division by n, then n 
divides a — b. 
Proof Let integers a, b and n > 1 be given, and suppose that a and b have the 
same remainder r on division by n. This means that for some integers qı and | 
qı. we have a =n -qı +r and b =n -q +r. But this means that 





b= (ng tr)—(neg +r) 
6-2) 





which means that n divides a — b. 


2 Proposition. If 5. 
division by n. 





livides a — b, then a and b have the same remainder on 


7 The general study ofthese systems is the starting point for the mathematical field of abstract algebra. 


27 Excursion: Modular Arithmetic and Cryptography 167 


Proof Let a, b and n > 1 be given, and suppose that n divides a — b. By the 
division theorem (Theorem 8 of Section 2.2), there are integers qı and 7i, qz 
and rz such that 





a=ngtn, withO<n<n 
b=n-gtn, with0<n<n 


Sowe have 
a-b=(n-gtn)—(n-m+n) 
=n: (h —H)+(n—r) 


Now the fact that a ~ bis divisible by n implies from this last equation that ry — 
divisible by n. Because cach of r; and r; is from the set (0, 1,2,...,~ 1}. 
this can only happen if ry -rz = 0. This means that a and b have the same 
remainder on division by n. " 








The right way to think of this is to imagine the n being fixed in advance, and 
having the =, equivalence applied to pairs of integers. In this way, the integers sort 
themselves into coherent cohorts as the following example illustrates. 





BE Example 1 Lern = 6 Allnumbersin the same row ofthe tabte below are con- 
gruent mod 6. For example, 20 and 2 are in the same row because both have remainder. 
2 on division by 6, and because 20 — 2 = 18 is evenly divisible by 6. Notice that every 
integer will appear in one of the six rows because the division theorem guarantees that. 
every integer when divided by 6 will have a remainder from (0, 1, 2, 3, , 5], and there 
is one row of numbers for each of these values. 











sors “18, 712, 6,0, 6, 12, 18, 
17, -11, —5,1, 7, 13, 19, 
«16, -10, —4,2, 8, 14,20, 

» 15, 9, -3,3,9, 15, 21, 
14, -8, -2. 4, 10, 16, 22, 
--13,-7, -1, 8,11, 17,23, 








In this example, we consider all the integers in the same row as being equivalent 
in this new sense. This is certainly a different meaning of equivalence from the usual 
one, but we still have not seen how this constitutes a new system of arithmetic. Le 
experiment a bit with the table above to see what “mod 6 arithmetic" is all about. 














"llli Example 2 we wit refer to cach row in the previous table by the remainder 
associated with the numbers in that row. Investigate the result of adding or multiplying 
numbers, one from row 2 and the other from row 3. 
SOLUTION Since there are many numbers in row 2 and many numbers 
in row 3, we should try addition and multiplication with different choices of 
representatives to see what difference it makes. A few such trials are shown in 
Table 2-27. It seems that no matter which two representatives from row 2 and 
row 3 we choose, their sum will be in row 5 and their product will be in row 
0. Thus, we can sensibly say that “row 2 plus row 3 equals row 5" and “row 2 








th) EE 


168 Chapter 2 / A Primer of Mathematical Writing 


Choice from Row 2 | Choice from Row 3 || Sum? | Row? Product? | Row? 











E 3 "uj Ee 72 D 
8 -15 -| 5 -120 0 
2 -B| 5 -30 0 
n ! 3 7! 5 2 o 
Table 2-27 Solution for Example 2 
times row 3 equals row 0,” or more simply, 
This gives us our new system of arithmetic on the numbers (0, 1,2, 3, 4, s. g 


We can extend this example to build entire addition and multiplication tables 
for our new mod 6 arithmetic. 


uw Example3 Figure2-11 shows the addition and multiplication tables for mod 6 
arithmetic. 


Practice Problem 1 Give the complete multiplication table for mod 5 arithmetic. 
In what way are the rows in your table different from the rows in the mode multipli- 
cation table in Example 3? 


The key to these tables being well-defined is the observation in Example 2 that 
the row number of the “output” of a sum or product does not depend on which 


of this section. 


Theorem 2 Consider an integer n > 2. For all integers a, b, c, and d ifa=yb 
and c =» d, then 


1. (a-c) s, (b-d), and 
2. (a+c) = (b+d). 





PROOF Leta, b,c, d and n > 2 be given, and assume that a e, b and c =, d. 
(We will prove the first conclusion here and leave the proof of the second concia. 
sion to Exercise 2.) By Theorem 1, both a — b and c — d are divisible by n. This 



















































































OBE 2TsTaTs 
S Top1]2]s[«[s »[o[o[o 
1 [rp ]s[«]s]o 2[s|4[s 
2 Jz[sj«]s]o[1 «[o]z]a 
3 [sT«[s]o[1]z o[s[o[s 
a [isloj lfs z|o[4|2 
s [s]o[1]2]s]« EEG 
Figure 2-11 





27 Excursion: Modular Arithmetic and Cryptography 169 


means that a — b= K -n and c — d = L -n for some integers K and L. Rewrit- 
ing these equations as a = b + K -n and c = d + L - n, respectively, allows us to 
substitute and expand 


a-c — (b-- K-n)d Ln) 
=b-d+(b-L+d-K+n-K-L)-n 


This means that a - c — b - d = (an ugly integer) - n, so a -c — b - d is divisible by 
n. We conclude (using Theorem 1 again) that (a - c) =, (b: d) a 


This fact alone can be used for some amusing math puzzles. A little later on we 

































Remainder Will see a more practical side to these ideas. 
n| 3% | modo 
LLL 
i| 3 19. Example 4 What is the ones digit of the rather large number 
2| 9 E OLUTION This is equivalent to asking, “What is the remainder when 3205 
3m 7 is divided by 102” or equivalently, "What number from the set (0. 1.2, --- , 9) 
4| a 1 is congruent t0320% mod 10?” To answer the latter question, we first observe 
Table 2-28 Small a pattern in Table 2-28. 
Powers of 3 in mod 10 Proposition 3 uses mathematical induction to show that for every integer 
Arithmetic n> 1,3 2; 1. From this it follows that 3 1, and so 








3-1, by Theorem 2 








Hence, 3° has a 3 in the ones digit. 
Here is the promised proof of the assertion from the previous example. 


Proposition 3 For all positive integers n, 3" = 1. 








PROOF Consider the statement » 1." The table in Example 4 illustrates. 
that the first statement, true, Now let an integer m > 1 be given 
such that all the statements up to "3*1 =o 1" have been checked to be true. 
Now consider the next statement: 











by Theorem 2 and the facts that 3* 2; 3“ 





This proves the statement “3 





completing the induction. 


Practice Problem 2 Use mathematical induction to show that for all positive 
integers n, 2 2o 6 








We will now get back to the general idea of a “new system of arithmetic.” In 
arithmetic, once we have a way to add and multiply, itis only natural to ask whether 
we can subtract and divide. Since our modular relationship is originally defined on 





"o — SEREMNECC— HE CERO ——Ó 


Chapter 2 / A Primer of Mathematical Writing 


all integers (including the negative ones), the notion of subtraction works out as one 
would expect—namely, subtracting a from b is the same as adding —a to b. Division 
is more complicated, however. We will take the idea that division should amount to 
multiplying by a "reciprocal" and see how far we can get. 





u Example 5 n Example 4, we saw that 3? 2, 7 and 3* =o 1. A particular side 
effect of this is the fact that 


3.7291 
Just as 2 and Y are reciprocals in normal “rational number arithmetic" because their 
product is 1, it makes sense that we can call 3 and 7 reciprocals in mod 10 arith- 


metic. Hence, dividing by 3 is the same as multiplying by 7 when doing arithmetic 
mod 10. 





This fact has the practical advantage of allowing us to do some algebra in our. 
new system of arithmetic. 


Til asi sos the equation 3 -x m 4 for x. 


SOLUTION By Theorem 2, multiplying on both sides of the equation by 7 
gives us 

7T.3.x mo 7-4 or l:x m8 
So any integer x =o 8 solves the given equation, ü 


Practice Problem 3 For each of the following, find an integer for x that satisfies 
the equation, if possible: 


(a) 2-x m3 
(b) 6-x ms 
(c) 4-xun3 





The second equation in the previous practice problem shows that one cannot 
always do division mod n. In this example, this happens because 6 has no reciprocal 
mod 10. This is equivalent to saying, “No multiple of 6 can have a remainder of 1 
when divided by 10,” which can be proved simply in terms of even and odd numbers. 
‘This problem arises from the fact that 6 and 10 have a common divisor other than 1. 

To avoid this issue, we can just stick to arithmetic mod p for prime numbers 
p. and then all nonzero numbers have reciprocals, which allows us to do division. 
‘Therefore, in the exercises we will look mostly at mod parithmetic for prime numbers 
p. 

The fact that reciprocals always exist in arithmetic mod p can be proved in many 
ways, We prefer to see it as a corollary to the following useful theorem. This theorem 
is not proved here, but it is a consequence of the fact that every number can be 
factored into a product of prime numbers in a unique way. 


Theorem 4 Given a prime number p and integers a and b, if a -b is a multiple 
of p, then a is a multiple of p or b is a multiple of p. 


27 Excursion: Modular Arithmetic and Cryptography 171 


sour final proof ofthis section, we will show that the property of prime numbers 
given in the previous theorem does, in fact, ensure that all numbers have multiplica- 
tive reciprocals in mod p arithmetic. 


Proposition $ Let p be prime and a be a number from (1,2,...,p—1). It 

follows from Theorem 4 that thereisanumber bin (1, 2, .. . p — 1) such thatb a =p 1 
PROOF For the given prime p and numbera from (1,2, ..., p — 1), we can form 
the following list of numbers: 





1:4,2:2,3-4,... (p - 1)-a 





Theorem 4 guarantees that none of the numbers in this list is divisible by p. That 
is, none of the numbers in this list is congruent to 0 mod p, Exercise 12 at the 
end of this section will show that this implies that no two numbers in this list are 
congruent to each other mod p. 

Now the conclusion follows if we think about the situation carefully, The list 
above contains values congruent (mod p) to different numbers in (1.2, .. ., p — 1], 
andsoeachof the values], 2, ... p — 1isrepresented once in the list. In particular, 
one of the numbers in the list is congruent to 1 mod p. 

Since all numbers in the list are of the form b - a, this means that some number 
of the form b- a is congruent to 1 mod p, completing the proot. " 














This proof might be unsatisfying in that it logically concludes something exists 
(the mod p reciprocal of a in this case) without giving any clue at all about how 
you might find it! There are several effective ways to find a reciprocal, one of which 
derives from the following classic result of number theory. 





Theorem 6 (Fermat) — If p is a prime number, then for any positive integer 
acp 





PROOF Letprime pand positive integer a < p be given. In the proof of Propo- 
sition 5, we said that every value in the list 


1-a,2-a,3-a,...,(p-1)-a 


is congruent mod p to a different value from the set {1, 2,..., p — 1), so the prod- 
uct of the numbers in the list is congruent mod p to the product of the numbers 
in (1,2,..., p — 1). That is, 

(1-a)(2-a)(3-a)---((p—1) +a) =, (p- 1)! 
which is the same thing as 


(p—1)!-a? =, (p—1)! 





Since (p — 1)! s, 0, we can multiply on each side of this equivalence by the re 
ciprocal of (p — 1)! to get the desired conclusion, a^- 





(^ — ^ a a 1M 


Chapter 2 / A Primer of Mathematical Writing 


Corollary 7 Givena prime number p and a number a from (1,2, ..., p — 1), the 
number a?? mod p is the mod p reciprocal of a. 


kA Example 7 Letting p = 17 and a = 4, we can compute 








"a 
1,073,741,824 
21713 
And it is easy to check that 4. 13 = 52 sy; 1, so 13 is the reciprocal of 4 in mod 17 
arithmetic. 
RSA Cryptosystem 


“Encryption” means to make a message readable only to an intended receiver. For 
years, the science of cryptography was associated only with military secrets and 
national security, However, cryptography is now a part of everyday life. There is 
some level of encryption in most computer systems, from credit card transactions to 
simple everyday e-mail, Consequently, many cryptologists continue to develop new 
encryption schemes that are effective but easily implemented, 

Tn the remainder of this section we will discuss a popular encryption scheme 
and see the mathematics behind it. The RSA system of cryptography is named for 
Ronald Rivest, Adi Shamir, and Leonard Adleman, who published a paper about it 
in 1977, It uses the strange idea that the information necessary to encrypt a message 
can be made public while the information needed to decrypt the message remains 
Private, Somewhere at the heart of such a system, there must be a mathematical 
operation that is easy to compute but difficult to reverse. Surprisingly, the operation 
in the RSA system is simply the multiplication of large numbers, 

We will first give a general description of the encryption/decryption process, and 
then we will look at the mathematics behind the system, 


























1. We first must translate our message into a number. We will use the following 
imple system: 
© Break the message into strings of four letters each, (The last string might 
be smaller.) For example, the message "PLEASE COME HERE" would be 
broken down into the strings PLEA, SECO, MEHE, and RE. 
© Each small word is changed into a number using the simple concatenation of 
numbers from Table 2-29, 






































A[5[|ce|5n|rg Jr |a [n | 13 [X | |M 
4 zi ~ 
oa |o jo |o | os |o | o |o | e fo fu | e fos 
4 
N Oy Re [us |r fu: |w|wepexe| v [ z 
—— 4 — 
m 7 [as [19 [20 [a |2 |a |o | 2s [25 
































Table 2-29 Changing Letters to Numbers 


2.7 Excursion: Modular Arithmetic and Cryptography 173. 


© So the string PLEA becomes the number 16,120,501 (ie, the concatenation of 
the number 16 for P, 12 for L, 05 for E, and 01 for A), and similarly, 
The string SECO becomes the number 19,050,315, 
The string MEHE becomes the number 13,050,805, and 
‘The string RE becomes the number 1,805. 
2. Each of these four numbers will be encrypted separately and be sent in order, so. 
the receiver will need to decrypt them, change them back into strings, and then. 
put the strings together to form the original message. 
3. Prime numbers p and q are chosen so that their product n = pq is more than 
eight digits long, and hence each number to be encrypted is between 0 and n. For 
this example we will use the primes p = 5,021 and q = 25,013 giving us a product 
of n = 125,590,273, 
4. The value of n is made public, but the values of p and q are eventually erased, 
shredded, and burned, because the security of the encyphering scheme is based on 
the fact that it is hard (i.e, it takes a long time) to recover p and q from knowing 
n. Imagine how long it would take you to factor the value n = 125,590,273 into the 
product of primes p and q by hand. If our p and q values had each been over 100 
digits long, as they are in secure systems, then even the fastest computers would 
strug 
5. Before p and q are destroyed, we need them for a couple of other tasks to make 
our encryption scheme ready to use: 
© First we compute the number k = (p — 1)» (q — 1). This also will have to be 
destroyed a bit later. In our example, k = 5,020 + 25,012 = 125,560,240, 

© Next we need another prime number, this time between 1 and k, that is not 
a factor of k. We will call this number e, and it will be the publicly available 
encryption key. For our example, let's pick an easy number like e = 7. 





























© We will use the k and e to find positive integers d and v so that the equation 


dee-vek=1 





is true. This is the only computationally difficult step, and it has to do with what 
is called the Euclidean algorithm, We will not discuss this in this course, but 
rather we will give you these values when you need them, In our example, with 
k = 125,560,240 and e = 7, the equation is 











1074 





063.7 — 6. 125,560,240 = 1 


That is, d = 107,623,063 and v = 6. 

6. Only the numbers n, e, and d are kept—the others are disposed of, If this is to be 
your public key code, you should publish your value of n and e for the general 
public, The value of d is your decryption key, and you should not share it with 
anyone, Notice that if anyone were able to factor n, they would be able to compute 
kand then find d just as you did above. The security of the whole system depends 
on n being hard to factor. 

7, The point is that now if I have a number M to send you as a message, I can encrypt 
it by computingC = M“ mod n, since your values of n and e are publicly available. 











74 Chapter 2 | A Primer of Mathematical Writing 





120,501 | 106,118,249 
1050315 | 108,681,457 
050805 | 30,461,159 

1805 | 59,000,375 


ble2-30 Plaintext to 
Cyphertext 


When you receive the encrypted message C, you compute C^ mod n, and you will 
see M. No one else knows d, and to find it, they would have to be able to factor. 
the rather large number n. 


@ Example 8 Lers see how I would send you the message “PLEASE COME 
HERE" once I have looked you up in my public key book to find out that your public 
keys are n = 125,590,273 and e = 7. 

SOLUTION 





1. First the message is broken down into the four numbers to be sent: 
16,120,501; 19,050,315; 13,050,805; and 1,805, 

2. For each of these numbers M, I compute M" mod 125590273 with a com- 
puter, as shown in Table 2-30. 

3. 1 send you the encrypted numbers: 106,118,249; 108,681,457; 30,461,159; 
107,623,063. 


To decrypt my message, you must compute M^ mod 125590273 for each 
of these four numbers, using your secret decryption key d = 107,623,063 and a 
friendly computer, as shown in Table 

Notice that the numbers we got back in the end are precisely the original 
‘ones before encryption. The method works! n 








The Mathematics Behind RSA 
We will now see why this public-key cryptography works. The basis for the RSA 
system is the following consequence of Theorem 6, called Fermat's litle theorem: 
Theorem 8 For any prime p and any positive integer a < p, 
a? mod p=a 
In order to show that the RSA scheme really works, we need to show that with 


all the values selected as in the example, we will always get back the original message 
M after it has been encrypted and then decrypted. 





Proposition 9 Forn, e, and d selected as in the previous discussion of the RSA 
system, we will have for any message M (between 1 and n), 


(M) mod p = M 


MINES mod 125590273 












106,118,249 16,120,501 


108,681,457 19,050,315 
30,461,159 13,050,805 
59,000,375 1805 


Table 2-31 Cyphertext to Plaintext 





27 Excursion: Modular Arithmetic and Cryptography 175 
PROOF Fermat's theorem tells us that M7! mod p — 1, from which it is 
straightforward to infer that M mod p = 1 since 
M* mod p = MP) mod p 
= (MP)! mod p 
= =1 


Similarly, we know that M* mod q = 1, and so (with a bit more thought) since 
n= p: q, we know that 


M* mod n= 1 (2.6) 
By applying this to the encryption/decryption scheme, we find that 
(ary! = Met 
=m 
=M-(M‘)" 
But we know from (2.6) that M* mod n = 1,50 
M- (M°) mod n e M-1 =M 


So successively encrypting and decrypting will return a message unchanged. 





a 

Solutions to Practice Problems 
1 Figure 2-12 shows the mod $ multiplication. One way in (h) 6 -x ms 4 has no solution for x because 6x — 4 ean 
which this differs from the mod 6 multiplication table is not be divisible by 15 sinc it cannot he divisible by 
that the one here has no 0 entries except for those that 3. (Look at the remainder when 6x ~ 4 is divided by 


come from multiplying by zero. 3) 

2 Consider the statement "2^ æ 6." It is easy to check — (c) 4-x 
that the first statement 6” is true. Suppose we doit. 
have checked all the statements up to the statement 


3Moranyx e, $4;in particular, x = 54 will 










































2-1 mp6," for some given integer m > 2. Then it 
follows that 
W624, since 24-0 mg 6 
This shows that 2" = 6. completing the induction. 
3. The second equation has no solution. 


























(a) 2- x =s 3orany x = 14;in particular, x = 14 will 
oU 3 Les E Figure 2-12 Multiplication in mod 5 arithmetic. 








Cro COPOHEEWCOO Oo OA: 9 


76 Chapter 2 | A Primer of Mathematical Writing 


xercises for Section 2.7 


. Each of the following statements is false. Provide a 
‘counterexample to each. 
(a) Ia -b= 0, then a =, orb =, 0. 
(b) For every bæ, 0, there is an integer a such that 
i. 

0, then a? =, 0. 

à) Ifa =, 1 and b=, 1, then a-b =m. 1. 

(€) We will say that Sis a perfect square mod n if there 
is some integer a such that a? =, 5. If 5 is a perfect 
square mod m, then 5 must divide n(n — 1). 

4, then either a =, Qora =, 

sn b, then 2” e, 2°, 

details in the following proofs: 

(a) Proposition For any integer n # 0 and for all in- 
legers a, b, c, and d, if a=, b and c e, d, then 
acm bd. 

Proof Let a, b.c, d, and n be given, and assume 
that a =, b and c =, d. By Theorem 1, both a — b 
and c — d are divisible by n, 





























Since(a + c) — (b + d) isdivisible byn, weconclude 
(by Theorem 1 again) that a +e =, b + d. 

(b) Proposition For every integer n > 0, if nis divisi 
ble by 4, then 2" 2. 1. 
Proof by induction. Let P(k) be the statement 





72^ =s 17 We first check the first few statements: 
© P(O) states, . Which is true. 
© PU) states, . Which is true. 
© PO) states, . Which is true. 


Now assume that for some given mz 1, we 
have established all the statements P(0), P(1)...... 
P(m — 1), and we are now considering P(m). 


16 
1-1 by Theorem 
andP(m — 1) 

1 








Since 2" ms 1, this establishes P(m), completing 
the induction. 

c) Proposition For every integer n, n = n, 
Proof Let the integer n be given. By the division 
theorem, either n =; 0, n = 1, orn = 2, so we can 
simply address each possible case: 
© Ene. 0, then n = 





by Theorem 








10. 


n" 


1 


nu. 


by Theorem 





9 dino 2 then n’ by Theorem. 


Hence, in every possible case, n° =; n. Li 
Make complete multiplication and addition tables for 
mod 7 arithmetic, 

Make complete multiplication and addition tables for 
mod 8 arithmetic. 


Find reciprocals of each of the values 1,2,3,....10in 
mod 11 arithmetic. 
Find reciprocals of cach of the values 1, 2,3, ... 12 in. 


mod 13 arithmetic. 

What is the remainder when 2!" ig divided by 197 
(Hinr: First find a so that 2 2, 1.) 

Let x be any positive integer and s, be the sum of the 
(base ten) digits of x. Prove that x =s. (Hint: Use 
Proposition from Section 2.6.) 

Suppose that the number 24 is written out and its 
digits are summed, then this resulting number is writ- 
ten out and its digits are summed, and this process is 
repeated until there is only a one-digit number result. 
What is this single digit? (Hrvr: Use the previous ex- 
ercise.) 

Find all solutions for x in each of the following equa- 
tions: 





(28-5250 

Find all primes p less than 100 for which the equation 
2 +1 =, 0 has an integer solution. Make a conjecture 
about what these values of p all have in common. 
Show that if none of the numbers in the list 14,2 
4.....(P—1)-aare congruent 100 mod p. then no two 
numbers in the list are congruent to each other mod p. 
(Hint: If two of the numbers are congruent to each 
other, what do you know about their difference?) 
Prove that for each natural number n, n has the same 
ones digit as n does. 

Mwe want to send a message one letter at a time, we can 
use the RSA system with small primes like p — 11 and 
4 = 7, and ust send the letters A through Z as numbers 
1 through 26, respectively. 





15. 


16. 


[A 


(a) It use the encryption key e = 43, what is the small 
est positive number you can use for the decryption 
key? 

(b) Using publickeysn = 11-7 
you the following message 

41 26 26 69 01 69 41 6& 
Decipher the message using the value of d you 
found in the previous exercise. 

(6) Give two reasons why you would not want to send 
a sensitive message using this schem 





Tande 





3, Isend 





Suppose I want to send a message one letter at 

using the RSA system with the primes p = Sand q = 7, 

and just send the letters A through Z as numbers 1 

through 26, respectively. 

(a) If Tuse the encryption keye = 11, what is the small- 
st positive number you can use for the decryption 
key? 

(b) It Tusc the encryption keye = 13, what is the small- 
est positive number you can use for the decryption 
key? 

(©) IFI use the encryption key e = 17, what is the small- 
est positive number you can use for the decryption 
key? 

(d) Using public keys n = 5-7 
you the following message: 














3S and e = 17, I send 





24 10 33 
10 17 


23 10 20 24 12 21 04 23 23 


Decipher the message using the value of d you 
found in part (c). 
Here is a message to you in which Lam using an RSA 
system with public key n = 2,773 and encryption key 
2-487. 
0245 2040 
1979 1130 








1698 1439 1364 1758 0946 0i 


Thave broken my original message into pairs of char- 
acters and converted these pairs to numbers as we did 
in the text. For example, the word "MATH" would be 
broken into "MA" and "TH" that would be converted 
to the numbers 1,301 and 2,008, respectively, and en- 
erypted. Figure out the decryption key d, decrypt the 
message, and answer the question that it asks. 





Define a k-pseudoprime as a number n > 1 for which 

ke" =, 1, and define a Carmichael number as a num- 

bern > 1 thatis.&-pseudoprime forall positive k which 

have no factors greater than 1 in common with n. 

(a) Find all three-digit 2-pseudoprimes that are not 
prime. 

(b) Find all three-digit 3-pseudoprimes that are not 
prime. 





18. 


19. 


27 Excursion: Modular Arithmetic and Cryptography 177 


(c) Find all three-digit S-pseudoprimes that are not 
prime. 
(d) Find the smallest Carmichael number. 


The next few exercises involves 
equation of modular arithmetic 
A single number satisfies many different congruences 
using modular arithmetic. For example, the number 16 
satisfies all the following: 


tems of more than one 





16 





Some of these are redundant. For example, ifa number n 
satisfies n =+ 4, we know that n = 6k + 4 for some inte 
ger k. From this form (specifically sin (3k +2), 
we can see that if n is divided by 2, there will be a 0 
remainder, and also (since n = 3(2k +1) + 1) that if n 
is divided by 3, there will be a remainder of 1. Hence, 
we have proved the following: 


























Proposition If =, 4, then n =; 0 and n =; 1 
Now you prove each of the following: 
(a) Hass 5,then m=: Land n = 2 
(b) Iz =x 7,thenn =, 3 and n =s 2, 
(© In mss 11, then n = 1 „andn =s d 





Another interesting fact is that each of the above state- 
ments also has a converse that is true. In other words, 
complete information about mod 2 and mod 3 deter- 
mines mod 6 information for any given number. Let's 
see how to prove this: 
2 Oand n 2 2, then n 
Proof Let an integer n be given, and assume that 
n=: O and n — 2. The division theorem leaves us only 
six possibilities for the remainder when n is divided by 
6 
© Case 0: Suppose n = 0. Then reasoning as in (a) 
above, we can conclude that n =; 0 and n =s 0. The 
latter of these conditions contradicts the assumption 
that n =; 2. 





























© Case 1: Suppose =, 1. Then reasoning as in (a) 
above, we can conclude that n = 1 and n =; 1. Both 
these conditions contradict the assumptions that 
n= Oand n= 2. 

© Case 2: Suppose n = 2. Then reasoning as in (a) 
above, we can conclude that n =; 0 and n =, 2. This 
does not create a contradiction with the given infor- 
mation. 

© Case X: Suppose n =s 3. Then reasoning as in (a) 





above, we can conclude thatn =: 1 and n =; 0. Both 
these conditions contradict the assumptions that 
0 and. 
Case 4: Suppose n =, 4, Then 
above, we can conclude that n 








'asoning as in (a) 
2 Dand n =; 1. The 











D[— — — — OHET — a —.—— SE 


78 Chapter 2 / A Primer of Mathematical Writing 


latter of these conditions contradicts the assumption 
that 1 = 2. 

@ Case S Suppose n = 5. Then reasoning as in (a) 
above, we can conclude that n 
first of these conditions contradicts the assumption 
that n= 0. 





‘The only case that does not create a contradiction is 
Case 2. Hence, it must be the case that m=, 2. 


For each of the following. find the smallest positive 
integer x that satisfies all the given congruences. If you 
believe that no such an integer exists, explain why. 


-hapter 2 Summary 


) r2 3and x es 1 
Tand x 2,3 





(e) xm 2,1 m 3, and x m 1 
(D x254,x 2, 2 and x =y 18 

20. The Chinese remainder theorem states that when n and 
m have no positive common divisors other than 1, the 
two congruences x s, a and x =, b can be satisfied by 
the same number x. Use facts from this section to prove 
this is true in the special case when m and n are prime 
numbers 











1 Mathematical Writing 
erms and concepts 


You should be familiar with implications and the circum- 
stances in which an implication is false. 

‘You should be familiar with the term counterexample, 
and you should be able to find counterexamples to false 
implications about simple properties of numbers. 

‘You should be able to identify the hypothesis and conclu- 
sion of an implication, even when the statement is written 
informally. 

You should be able to write the contrapositive of an im- 
plicational statement, and you should understand that it 
is equivalent to the original statement. 

You should know the formal definitions of even and odd 
as they will be used in proofs in this text. 


vos 


You should be able to write a proof for a simple impli- 
cation involving basic properties of numbers like “even” 
and “odd.” 

You should be able to prove an implication by forming 
its contrapositive, then proving that contrapositive im- 
plication. 

You should be able to trace a proof for a simple implica- 
tion involving basic properties of numbers. 


2 Proofs About Numbers 
rms and concepts 


You should know the formal definition of divisible by as 
it will be used in proofs in this text. 





© You should understand the division theorem and be able 
to identify quotients and remainders for any division 
problem. 

© You should know the definition of rational number as it 
will be used in proofs in this text, 

© You should be able to use the mod operation on integers. 


Proofs 


© You should be able to write a proof fora statement whose. 
hypothesis or conclusion states that a particular integer 
is divisible by another. 

99 You should be able to correctly use the di 
o define cases for a proof. 

© Youshould be able to write a proof fora statement whose. 
conclusion states that a particular integer has a particular 
form. 

© Youshould be able to write a proof fora statement whose 
hypothesis or conclusion states that a particular real num- 
beris rational. 


2.3 Mathematical Induction 
Terms and concepts 


ision theorem 





‘© You should be familiar with a predicate over the natural 
numbers, and you should be able to write statements like 
P(3) or P(m — 1) given a specific statement of the form 
Pin). There is more discussion of predicates in general 
in Section 14, 

€. You should be comfortable with sigma notation. There is 
‘more discussion of sigma notation in Section 1.2. 

9. You should understand the principle of mathematical in- 
duction. 





— ——— — — — 7 ë hl 





Proofs 


99. You should be able to give an informal “table-oriented” 
proof by mathematical induction, for these situations: 

— Given a recursive description of a sequence of inte- 
gers, you should be able to prove that the terms of the 
sequence also satisfy a given closed formula. 

- Givena summation (whether insigma notation or not), 
you should be able to prove that the summation satis- 
fies a given closed formula. 

© You should also be able to write these proofs by mathe- 
‘matical induction using the formal textbook style. 


2.4 More About Induction 
Skills and proofs 


© You should gain exposure to using mathematical induc- 
tion in a variety of contexts. 

© Inparticular, you should be familiar with the use of math- 
ematical induction in some of these settings: 

— Given a summation, write a recursive description for 
s, the sum of the first n terms of the sum, and prove a 
given formula for s, using induction. This ties together 
the two types of problems in Section 2.3. 

— Write proofs concerning the Fibonacci numbers and 

other second order recurrences. 

Write induction proofs involving inequalities satisfied 
by recursively defined sequences 

Use mathematical induction to establish divisibility 
properties. 

Prove statements that revisit some problems from Sec- 
tion 1.1 


2.5 Contradiction and the Pigeonhole 
Principle 
Terms and concepts 
© You should understand the idea of proof by contradiction 
and how it differs from direct proof and from proof of 
the contrapositive statement. 


© You should be familiar with some classic proofs by con- 
tradiction: 


~ The real number v7 is irrational, 
~ There are an infinite number of primes. 


© You should be familiar with the various forms of the pi- 
geonhole principle: the general version, the basic version, 
and perhaps the average version which is introduced in 
the exercises. 


27 Excursion: Modular Arithmetic and Cryptography 179. 


Skills 


© Fora given implication, you should be able to identify 
how a proof by contradiction would begin. 

© You should be able to correctly negate an existence state- 
ment. This revisits a topic first appearing in Section 1.4, 

@ You should be able to recognize instances where a con- 
tradiction proof can be converted to a direct proof or to 
a proof of the contrapositive statement, 

© You should be able to apply the pigeonhole principle to. 
appropriate problems. 





Proofs 

© You should be able to write a proof by contradiction for 
a statement about basic properties of numbers. 

© You should bc able to prove that a particular real 
mumber is irrational using cither contradiction or 
‘contraposition. 

© You should be able to prove the various versions of the 
pigeonhole principle using either contradiction or con- 
traposition. 


2.6 Excursion: Representation of 
Numbers 


Terms, concepts, and skills 

© You should be familiar with the terms binary, octal, dec- 
imal and hexadecimal as relating to various bases for nu- 
merals. 

© You should be able to convert numbers between decimal 
and any other base. 

© You should be able to convert directly between binary, 
octal and hexadecimal without converting to decimal as 
an intermediate step. 

© You should be able to prove simple statements about 
representations of numbers such as rules for divisibility. 

© At the end of this section we learn about some ap- 
plications of binary numerals to specific problems like 
the Josephus problem and perfect shuffles of cards, You 
should be able to follow these examples, 


2.7 Excursion: Modular Arithmetic 
and Cryptography 


Terms, concepts, and ski 





9. You should be able to do basic computations using the 
mod n equivalence, denoted =, in the section. 








EEE __ 


180 Chapter 2 / A Primer of Mathematical Writing 


© You should be able to construct the addition and mul- — You should know that multiplicative inverses (of non 
tiplication tables for mod m arithmetic for any positive zero elements) do not alw 
integer n. 


ays exist in mod n arithmetic 
unless n is prime. 


© You should understand the statement and proof of Fer- @ You should understand how to encrypt and decrypt sim- 
mat's litle theorem. ple messages using the RSA system. 








Sets and Boolean Algeb 


In the previous chapter, we introduced the fundamentals of mathen 
proof-writing, and we practiced both writing about and reasoning with 
bers. In this chapter, we study sets and their generalization to a more al 
structure, Boolean algebra. In addition to providing another setting in 
to practice mathematical reasoning and writing, these topics will illustr 
important attribute of higher mathematics. Moving from the concrete 
abstract can often make computational problems easier for those will 
adapt to the more abstract setting. 

The chapter begins by introducing the general operations on, and 
erties of, sets. After we become comfortable with the formal terminoloc 
notation, we will practice our proof techniques using sets as a new c 
for mathematical writing. The concept of "set" naturally leads to the at 
concept of a Boolean algebra, which resolves some connections amo 
operations, logical connectives from the first chapter, and the study of c 
in computer science. The chapter concludes with a practical technique fc 
Plifying circuits that is based on the abstract properties of Boolean alge 


3.1 Set Definitions and Operations 





Sets are among the fundamental building blocks of mathematics. They also | 
simple logical structure, so they provide good practice to strengthen our re: 
and writing abilities We will work with the informal definition of a set as a: 
collection of objects, called the members or elements of the set. It has bee 











pter 3 / Sets and Boolean Algebra 


established that this loose definition can lead to paradoxes.” To avoid this, we will 
take the pragmatic point of view of most mathematicians. In practice, we will work 
only within a small number of well-understood sets, so we will not worry about the 
paradoxical set constructions that are better discussed in a course on the foundations 
of mathematics. 








Definition Here are some of the common sets of numbers we will use: 


© Nisthe set of natural numbers—these are numbers that can answer counting 
problems (N = (0,1,2,3,...1). 

© Zis the set of integers 
(Z=4...,-3,-2,1,0,1,2,3,..4) 

© Qis the set of rational numbers—these are characterized as ratios of integers 
such as } or - or. 

© Ris the set of real numbers—these can be thought of as decimal numbers 
with possibly unending strings of digits after the decimal point. 








At times variations of these basic sets are used, so it's good to adopt | 
consistent notation for the most common circumstances: 


RY is the set of positive real numbers. 

RŽ? is the set of nonnegative real numbers. 
Q* is the set of positive rationals, 

Q>? is the set of nonnegative rationals, 

2 is the set of positive integers. 

Z9 is the same as N. 














ll Example 1 Lis three numbers thatare 


- Integers but not natural numbers. 
. Rational numbers but not integers. 
|. Real numbers but not rational numbers. 


SOLUTION 


1. —100, 5, and —1 are all integers, but none are natural numbers. 


2. $, SH, and 2.12 (since 2.12 = 8) are all rational numbers, but none arè | 
integers. 


3. In Section 2.5 we proved (between examples and exercises) that the real 
numbers V2 © 1.414213562..., /3 © 1.732050808... . and the golden ratio 
145 ~ 1,618033988... . are not rational numbers. 








a 


* The most famous of these is Russell's paradox. See Exercise 33 for details. 








31 Set Definitions and Operations 183 


Subsets 


The sets in the above definition are clearly related to one another. For example, every 
rational number is also a real number. To make relationships like this an integrated 
part of our mathematical language, we need some new terminology and notation: 











Definition 


1. The notation x € A means “x is an element of A,” which means that x is one 
of the members of the set A. 

2. A is a subset of B (written A C B) if every element in A is also an element 
in B. Formally, this means that for every x, if x € A, then x € B. 

3. A is equal to B (simply written A = B) means that A and B have exactly the 
same members. This is expressed formally by saying, “A C B and B C A." 

4. A set that contains no elements is called an empty set, and is denoted by (] 
orð. 

5. For any given discussion, all the sets will be subsets of a larger set called the 
universal set or universe, for short. We commonly use the letter U to denote 
this set, 











For example, since 2 is a natural number, we can write 2 € N. Since 2 is also a 
positive real number, we can also write 2 € &*. With this notation, we can express 
relationships between some of the common sets of numbers described above, For 
example, 


© N C Z conveys the fact that every natural number is also an int 

© ZC Q conveys the fact that every integer is also a rational number since, for 
example, 3 = 3. 

© Q C R conveys the fact that every rational number is also a real number, 








ince many problems involve sets of numbers of one type or another, frequent 
choices for the universal set are R, the set of all real numbers, or Z, the set of all 
integers, We will be explicit about the universal set whenever itis not clear from the 
context of a problem. 

To describe small sets, we can simply list all the elements within braces. This is 
called the roster method for set description. 











V Example 2 


1. (,2,3,4,5] is a set containing exactly five positive integers. Notice that 
{1,2,3,4.5) ON, 

2. These three sets are all the same: the set (1,2, 3,4, 5), the set (2,4. 1,5, 3), and the 
set [1,1,2,2,3, 3,4,4, 5, 5,5, 5]. Each consists of the numbers 1, 2, 3,4, and 5. The 
order in which we list the members is irrelevant. Listing the elements more than 
once does not change the set. 











a 


|84 Chapter 3 / Sets and Boolean Algebra 


3. (2,4) € (1,2,3,4, 5] isirue, but (1.2, 3,4, 5) C (2, 4] is false. 1 is a counterexample 
to the statement “If x € (1,2,3.4,5], then x € (2,4). 

4. 8 C (1.2, 3] is true, since there is no. counterexample to the statement “If x € 9 then 
X € [1, 2, 3)." The empty set is a subset of every set. 

5. Woe, Tom. Sue, Mar y) isa set containing four names. (Here the universal set might 
be the set of all names of people.) 

6. (1.3), (2,5), (3, 7)) isa set of ordered pairs such as one might plot on graph paper 
in an algebra class. 

7. (13.4), (5.6, 71] is a strange-looking set, but it is completely legitimate. This set 
contains two members, each of which is itself a set. This is really no more shocking 
than having a box containing two smaller boxes or an envelope containing two 
smaller envelopes. We will see more examples like this in the next section. 





Practice Problem 1 Which of these statements are true? 


(a) (abba, aabba, ababa) is a legal ser. 
(b) (4,1,2,3) € (1,2,3,4,3,2, 1) 

(©) (6,8, 10} C (1,2,3,4,5,6,7,8,9, 10). 
@ QcN. 

(e) Ø (abba, aabba, ababa). 





It is often impossible, or at best impractical, to list every element in a set. To 
describe larger sets, we usually use set builder notation, which actually has two some- 
what different forms. Here are some examples to illustrate the first of these ideas. 


uu Example 3 


1. To describe the even natural numbers, we can write (x : x € N and x is even]. This 
is read, "The set of x such that x is an element of N and x is even." Here are two 
other ways to describe the same set 


© (x €N: xis even). 
@ (x EN: x =2k forsome ke N). 
2. (x € R: -2 < x <2} can also be written as the interval” (-2,2] 


3. In € N : n has exactly two positive divisors) is better known as the set of prime num- 
bers 


4. (xeR: 








+1 = 0) contains no elements at all, so (x € R : 3? +1 =0) = (J. 


In each of these examples, there is a universe U given and the description of the 
set has the form 


WeU:xis___) 





" See Exercises 20 and 21 at the end of this section for more on interval notation. 








db 


Explore more on. 
the Web. 


31 Set Definitions and Operations 185 


which is read, "The set of all x in U such that " Since the large blank 
contains some property of x that determines whether a particular element of U is 
oris not to be included in the set, we will refer to these as property descriptions for. 
set-builder notation. 








Practice Problem 2 Write each of the following in set-builder notation using a 
property description: 


(a) The set of even integers 
(b) The set of real numbers bigger than 10 
(©) The set of rational numbers whose square is less than 2 


In the first example above, we sce that the defining “property” for x is really 
matter of the form of x. For example, when describing the set of even natural 
numbers, the property “is even” is the same as “has the form 2k for some k € N.” 
Hence, we can describe this set based on the form of its elements as follows: 











(2k: ke N) 


We read this, “All numbers of the form 2k where k is from N 
as a form description within our set-builder notation. 





We will refer to this. 


ll Example 4 Wri each ofthe following sets se-uller notation usinga form 
description: 


1. The set of integers that are multiples of 3 
2. The set of perfect square integers 

3. The set of natural numbers that end with a 1 

*Q 

SOLUTION 

L (Bk: ke Z) 

2, (m? im € Nl or (n? : m € Z) (They describe the 
3. (10k+1:keN} 

4. (giae ZandbeZ*) 





ime set.) 


Practice Problem 3 ^ Each of the following sets is written using a property de- 
scription. For each one, write five elements of the set and then give a" form description" 
of the whole set 


(a) Ix eZ:x 





Sk for some k € Z} 
(b) (y € N: yis one more than a perfect square} 
(© Ir €Q:r = gi forsomea e Zand ke N) 











JS SS XM 


‘Chapter 3 / Sets and Boolean Algebra 


New Sets from Old 


Everyone is familiar with the many ways a scientific calculator can operate on a | 
number or numbers to produce some other related number. For example, the + 
button takes two numbers and returns the sum of these numbers, and the e* button. 
takes a single number and returns the value of the constant e raised to the power 
x. Similarly, there are many ways to build new sets from given sets. Just as we learn | 
simple arithmetic before more complicated numerical functions, in this section we 
will begin with the simplest set operations, and proceed to more complex operations 
in the next section. 





Definition Given two sets A and B of elements from a universal set U, 
1. The intersection of A and B (written A N B—think of "n for intersection”) is | 


the set that contains those elements common to both A and B. In set-builder 
notation, we write 


ANB=(xeU:xeAandx e BJ 


2. The union of A and B (written A U B—think of “U for Union) is the set that 
contains those elements in either set A or B. In set-builder notation, we 
write 


AUB 





(xe U:xeAorx e BJ 


3. The difference of A and B (written A — B) is the set that contains those. 
elements in A which are not in B. In set-builder notation, we write 


A-B=(xeU:xeAandx g B) 











E] Example 5 LerU =N. For the sets A = (1,2,3,4,5), B = (2,4, 6, 8, 10}, and 
C ={8, 10, 12), find (1) An B, (2) AU B, (3) A — B, (4) B A and (5) ANC. 





SOLUTION 

1. An B- (2,4) 

2. AUB — (1,2,3,4,5,6,8, 10) 
3.A-B-(,3,5) 

4. B- A = (68,10) 

5. 


- A NC does not contain any elements—that is, A NC = (I H 


The last example illustrates an important relationship between sets. Since A 
and C have no common elements (i.e. A MC = (]), we can refer to them as being. 
nonoverlapping, mutually exclusive, or disjoint. We will use the latter of these terms 
later when we discuss counting and probability, so it is worth making it a formal 
definition now. 








31 Set Definitions and Operations 187 





Definition Sets A and B are disjoint if A AB = Ø. 














£ 


Explore more on 
the Web. (1,2 





Practice Problem 4 Let U 
4,3] 


|. For the sets A— (1,3), B— 9. and C = 





(a) What is AN B? 
(b) Whatis AU B? 

(c) Whatis AUC? 

(d) Is A a subset of B? 

(e) Is B a subset of A? 

(D. Give an example of two disjoint sets D and E whose union is C. 





As we mentioned earlier, to avoid some well-known paradoxes we will work on 
within well-understood sets. Although we do not always mention it, there is always 
a “universal set” so that every set we are discussing is a subset of that universal set 
(We usually use the notation U for this universal set.) Under this assumption, for a 
given set A it makes sense to talk about the set containing those elements that are 
not in A. We call this set the complement of A. 








Definition Given a set A with elements from the universe U, the comple- 
ment of A (written A’) is the set that contains those elements of the universal 
set U which are not in A. That is, A’ = U — A. 











ll canoe 6 orm niri rd 88785; 10] and the sets 
A = {1,2,3, 4, 5] and B = (2, 4, 6, 8, 10], find each of the following: 





LA 
28 
3, ANB! 
4. U' 


SOLUTION 


A' = (6.7, 8,9, 10] 
B =41,3,5,7,9) 
- ANB’ = (1,2,3, 4,5) n {1,3,5.7,9} = {1,3,5}. This is exactly the same 
thing as A — B. 
4. U' = P. Since U is the entire universal set, “the 
ments of the universal set U which are not in U 
it. 








t that contains those ele- 
"is a set with no elements in 


o 











B. Chapter 3 / Sts and Boolean Algebra 





Practice Problem 5 For the universal set U = (1.2, 3.4, 5,6,7,8, 9, 10) and the 
sets A = (1,2,3,4, 5] and B = (2, 4,6. 8, 10), calculate the following: 


(a) (A n BY 
(b) A' UB" Is this the same as (A N B? 
ow 


Each of the notational conventions for describing sets has a particular form 
when it comes to expressing the result of these set operations. As an example, we 
look specifically at set-builder notation below. 


ll Ecanpie 7 Given he vex, e GECE EN PES D ER pl priv, a 
press each of the following using set-builder notation: 

LANB 

2 AUB 

A-B 

4 B-A 


SOLUTION 

1. A NB = [x € N: x = 3) (= (3] via the roster method) 

2. A UB = (z € N : zis prime or a multiple of 3) 

3. A— B = [x € Nix # 3 and x is a multiple of 3) or [3k : k € Nand k> 2)) 
4. B-A = |x € N: x # 3 and x is prime] 





o 


Properties and Venn Diagrams 


When we study operations on numbers, we take advantage of many properties of and 
relationships between these operations to allow us to write expressions in different 
ways. For example, the distributive property allows us to factor Sx + 15 as $ - (x +3), 
and we know that this is useful in the algebraic context of solving equations. Prop- 
erties like the commutative property, the associative property, and the distributive 
property are clearly important for performing algebraic manipulations and generally 
understanding numerical functions. 

The set operations that we have just introduced have many of these same prop- 


erties, as well as some properties that the numerical operations definitely do not 
have. 


Proposition 1 For sets A, B, andC, the following properties hold: 


Commutative property for ^ AnB-BnA 
‘Commutative property for U AUB=BUA 
Associative property for 0 (AnB)nC - An(BnC) 






Associative property for U (AUB)UC 
Distributive properyofnovrü — An(BUC| 
Distributive properyofUovern — AU(BnC] 


U(BUC) 
AnB)u(Anc) 
AUB)n (AUC) 











ANE 





31 Set Definitions and Operations 189 


We will compose a more extensive list of these propertiesin Theorem 6 of Section 
3.3, and we will see some striking similarities between the set properties given in that 
theorem and the properties for elementary logic given in Theorem 2 of Section 1.3, 
In the section on Boolean algebra, we will discover that this is not a coincidence, 

In the remainder of the current section, we will discuss how we might assess 
à new statement about sets whose truth is in question. To do this, we will use a 
visualization technique called a Venn diagram. 

A Venn diagram is simple, We use a large square to represent the universe U, 
and then add circles within the square to represent sets. We imagine that elements 
of the universe in a set $ fall inside the circle for $ and elements not in the set $ fall 
outside of that circle. When the sets are small, we can actually draw the whole picture 
without taxing our imaginations too much, 














un Example 8 Let U — (1.2,3.....14,15, 16, A — (1,3,5,7,9,11, 13,15], 
B = (2,3,5,7, 11, 13), and C = (1,4,9, 16). Draw the Venn diagram for these sets 
Showing all elements of the universe U. 
SOLUTION The rectangle in Figure 3-1 denotes the universe U, so all 16 
elements from the universe are shown within the rectangle. Numbers within 
the labeled circles indicate the elements of the respective sets. Notice that the 
picture has empty regions where there are no set elements. So, for example, the 
fact that BNC = (} is evident from the picture, o 











6 
10 
c p 
4 
Figure 3-1 Venn 


diagram for Example 8. 


Explore more on 
the Web. 


While this visualization tool is helpful for many aspects of the study of sets, we 
will use it mainly to analyze statements about sets like those in Proposition 1. To 
investigate a statement about generic sets using Venn diagrams, we try to draw the 
most general situation for two or three sets so that we can easily see how a proposition 
might fail. Since the sets in such a statement are not specifically given, we use shading 
to indicate regions in the picture that correspond to the results of the set operations 
within the proposed statement. This process is best illustrated with an example. 


ua Example 9 Use Venn diagrams to illustrate the truth of the distributive prop- 
erty, An(BUC) = (An B)U (ANC). 
SOLUTION We do this by considering the two sides of the equation sepa- 
rately, in cach case thinking about how to “build up" the set described, On the 
left-hand side, the set A N (B UC) is built by first computing B UC and then 
An (BUC). Figure 3-2 shows how to draw these two steps. 
On the right-hand side, the set (A N B) U (A NC) is built by first computing, 
ANB and A NC, and then forming (A N B) U (A NC). Figure 3-3 shows how 
to draw these three steps. Since the diagram for A N (B UC) is the same as the 
diagram for (A N B) U (A NC), we have our visual evidence that the statement 
is true. n 


Practice Problem 6 Draw a Venn diagram to illustrate each side of the statement 
A — (BUC) = (A — B)n (A — C). Use your diagram to decide if the statement is true. 


Venn diagrams do not exactly constitute formal proofs since they are based less 
on deductive logic than on visual intuition. They are also limited since they cannot 
be easily applied to situations involving more than three sets. These diagrams can, 





90 Chapter 3 / Sets and Boolean Algebra 





CD 





u 


igure 3-6 A Venn 
iagram for two sets. 





Buc An(BUQ) 


Figure 3-2 Venn diagram for the 
left-hand side of Example 9, 

















A a} [a 8] [4 B 
| 
c e e 
AnB Anc (AnBun o 
Figure 3-3 Venn diagram for the right-hand side of 
Example 9, 


however, be quite useful in determining whether we believe a proposition, and in 
finding counterexamples for false statements. 


u" Example 10 Find a counterexample to the following statement: 
AU(B-C) - (AUB)-C 


SOLUTION On the left-hand side, the set A U (B — C) is built up from com- 
puting B — C and then A U (B — C ), as shown in Figure 3-4. 

On the right-hand side, the set (A U B) —C is built up from computing 
AU B and then (A U B) — C. as shown in Figure 3-5. We notice that in the two 
final diagrams above, the A NC region is shaded in the diagram for (A U B) —C 
but not shaded in the diagram for A U (B — C). Hence, choosing sets A, B, 
and C so that A NC is nonempty should provide a counterexample. Letting 
A = (1.2), B = (3.4), and C = (2,5), we see that (A U B) -C = (1, 3,4) and 
AU(B—C) = (1,2,3,4), which are not equal sets. n 





It should be noted that if only sets A and B are involved in a statement about 
sels, we can form a "two-set Venn diagram” by simply leaving out the set C in the 
diagram. For example, the Venn diagram for A' N B is shown in Figure 3-6. 








Explore more on 
the Web, 





31 Set Definitions and Operations 191 





A B| [a B 





B-c AU(8- C) 


Figure 3-4 Venn diagram for the 
left-hand side of Example 10. 





A a) fa 8 

















aus (AuB-c 


Figure 3-5. Venn diagram for the 
righthand side of Example 10. 





Practice Problem 7 — Draw a two-set Venn diagram for each side of the following 
equation. Use the result to give a specific example to illustrate that the statement is false. 


(AUB)-B-A 


The Inclusion-Exclusion Principle 


Because sets provide a common structure for grouping together mathematical 
jects, a natural question for a given set A is “How many elements does A hav 
will see in later chapters that this simple question is central to some fairly advanc 
mathematics. In the present chapter, we will merely address some basic questions so 
that we can build on the answers in the pages to come. 











Definition Y A is a finite set, we use the notation n(A) to indicate the 
number of elements in the set A. 











"ll Example 11 Find each ofthe following: 


1. n({15,16, 17, ...,22)) 
2. Let A = (me Z^ : m < 200 and mis a multiple of 7). Find n(A). 





Ss tae GO C DENT 


2. Chapter 3 / Sets and Boolean Algebra 








Figure 3-7 Venn 
iagram for Example 11. 


| 


3, Let A=(2k:keZtandk<15) and B=(3k:keZ*andk<10), Find 
n(AU B). 


SOLUTION 


1. n((15, 16, 17, 


n(A) — 28. To see this, we calculate 200/7 = 28.57, so the largest multi- 
ple that is less than 200 is 28-7 
2-7, ...,28+T}. 


3. Clearly, n(A) = 15 and n(B) = 10. A little thought tells us that 
ANB = (6k: ke Z* and k < 5} = (6,12, 18, 24, 30) 


so if we form A U B by simply combining the elements of A and B (via the 
roster method) 


AUB = (2,4,6,8,10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 
3,6,9, 12, 15, 18, 21, 24, 27, 30) 





we will find that the only elements listed twice are those in A N B. Hence, 
n(A UB) = n(A) +n(B) - n(A N B) = 15+ 10-5 = 20 


The Venn diagram in Figure 3- 
AnB,and AUB. 





ustrates this relationship among A, B, 


n 


The last example illustrates an important principle about the size of the union 
of sets known as the inclusion-exclusion principle. We state the two-set and three-set 
versions as a theorem, and we leave the more general statement to be explored by 
the reader in the exercises. 


Theorem 2 (The Inclusion-Exclusion Principle) Let sets A, B, andC 
be given. 


1. n(AU B) = n(A) +n(B)—n(AN B) 
2. n(A U BUC) — n(A) + n(B) + n(C) - n(An B) -n(A NC) - n(BnC) 
*n(AnBnC) 





The following informal proof for the two-set case will be sufficient for our pur- 
poses: When you add n(A) and n(B), anything that is in both sets has been counted 
twice, so you need to subtract n(A N B) to adjust for this overcount. At the end of this 
section, Exercise 24 asks you to give a similar explanation of the three-set version. 
Exercise 25 then asks you to discover the correct four-set version of the principle, 


Practice Problem 8 ForA = [n € N: 200 < n < 500 and n is a multiple of 7) and 
B = {n € N : 200 < n < 500 and n is a multiple of 5}, calculate the following: 


(a) n(A) 
(b) n(B) 
(© (ANB) 
(d) n(AU B) 





Solutions to Practice Problems 


1 (a) True. The universal set for this example might be the 
set of all strings consisting of the letters a and b, 

(b) True. Both sets contain exactly the numbers 1, 2,3, 
and 4, so these sets are equal. Any set is a subset of 
itselt 

(c) True, Every number in the set (6,8, 10) is also in the 
set (1,2,3,4, 5,6, 7,8,9, 10). 

(d) False. The number x = 1/2 provides a counterexam- 
ple, since it is in the set Q but not in the set 

(e) True, Itis impossible to find a counterexample, since 
a counterexample would have to be an element of 
the set on the left side, and () has no elements. 
Hence, the empty set is a subset of any set. 

2 There are many correct ways to represent each set, We 

sive only one for each. 

(a) The set of even 
2y for some y € Z). 

(b) The set of real numbers bigger than 10 is (x € R: 
x» 10}. 

(©) Thesetof rational numbers whose square isless than 
2is ee Qix? <2 

3 (a) (0,5,10,-5, -10, 
(9) (1,5, 10, 101, 1025, ...) = (€ +1: ke NJ 

Fed = igiziaeZ.keN) 

4 (a) Since Bcontainsno elements there cannot be any el- 
ements that A and B have in common, so A n B = V 
tatis Anh = 4. 

(b) (1,3] Observe that AU d =A. 

(©) (1,2,3,4, 5]. AUC — C because A C C. 

(d) No, because the element 1 isin A but not in B. 

(€) Yes, There are no possible counterexamples, The 
empty set is a subset of every set. 

(D) There are several ways to do this. One is to choose 
D= I] and E = (2,3,4,5}. Another is to choose 
D= (1,2,3,4, 5) and E 

(9 (An BY = (2.4) = (1.3.5.6.7,8,9.10] 

(0) A'U B' = (6,7,89. 10) U (1,3,5,7, 9) = (1,3,5,6, 
7,8,9, 10}, Yes, this is the same as (A N B). 











integers is [xeZ: 











ISk:ke2) 




















Exercises for Section 3.1 
1. Given the universal set U = (1,2,3,4, 5.6, 7,8, 9, 10) 





F and sets A = (2,4), B = (1,2,8), and C = (1,2, 5.6, 
10], find cach of the following: 
( Anc 
w) AUB 


wc 


31 Set Definitions and Operations. 193 


(c) W = U (since has nothing in it, "the set that con- 
tains those elements of the universal set U which are 
not in 9" contains all the elements of U), 

On the left-hand side, the set A — (HUC) is built up 

from first computing B UC, and then A — (BUC). Fig- 

ure 3-8 shows how to draw these two steps. 

On the right-hand side, the set (A — B) N (A - C) 
is built up from first computing A — B and A — C, and 
then forming (A — B) n (A — C ). Figure 3-9 on page 194 
shows how to draw these two steps. 

Since the diagram for A — (B UC) is the same as 
the diagram for (A — B) N (A — C), we believe that the 
statement is true. 

















IfweletA = (1.2) and B = (1,3), then AU B = (1,2,3) 
andso 
(AUB)-B- D] A 

(a) n((203, 210, ....497)) = n((29 7. 30-7... 

WN) = 
(b) (200, 205... 5001) = n({40-5, 41 +5, 

100 -5}) = 6t 
(©) m(m N:200<n<500 and m is a multiple 





of. 35]) = n((210, 245, ...,490]) = n([6-35, 7-35, 
14-35) =9 
(d) 43+61-9=95 








A 8] [4 B 

















Buc A-(BU0) 
Figure 3-8 Left-hand side of Practice Problem 6. 


(a B-C 
(cop 

2 Let = (x €N: x = Mforsomek € N), B = [y € Z 
2y is a perfect square}, and C = [z € Z: 2 < 1,000) 
List five elements in each of the following sets: 
(a) AU(BNC) 











J4 Chapter 3 / Sets and Boolean Algebra 





A D 


e 














A-B 


a-c 


(4-8) A-0) 


Figure 3-9 Right-hand side of Practice Problem 6. 


©) (AUB)nC 
(9 An(BUC) 
(@ An(auc) 
. Fill in the blanks to make each sentence true. 
/ (a) The set of even integers is ve: 
z= 
(b) The set of integers that are powers of 2 is [x € N: 
x= j- 
(© The set of odd natural numbers is 
i= n 
(d) The set of odd perfect squares is [x €N: 
—— 
. List five elements of each of the following sets: 
(a) nine n] 
(b) (r:meZand-5 < m <3} 
(© B-2r:reQand0 <r <5} 
(D G-2reg:xeR and 05x55] 
graph of y — 3— 2r + x? might help.) 
"Let A=(Gr+1ixeN}, B= (2x+1ixeZ*), and 
C = (6x +1 : x € Z). List five elements in each of the 
following sets: 
(a) ANB 
b auc 
(B-A 
A-C 
i, Let U — (0, 1.2.3.4, 5.6}. Match each of the sets on 
the left with the appropriate set on the right. Not every 
set on the right will be used. 
G) Qke2:ke U) (A) (-2,2,610.14,18,22] 





ke 








(Hr: A 








(b) (2m:meU) (B) (0,4, 8, 12. 16, 20, 24) 
(©) (nin-2-4Lfor (C) 148,12,16, 20,24,28) 
some L € U} 
(d) (4p:peu) (D) (0,2, 4,6,8, 10,12) 
(E) (2.6, 10, 14, 18, 22, 26} 
(F) (2.4.6.8, 10,12,14) 


7 





10. 


Fill in the blanks to make each sentence true. 

(a) The set of even integersis (2x : x € 

(b) The set of perfect cubes is rr eZ). 

(©) The set of natural numbers that end with a 7 is 

ire . 

(d) The set of rational numbers between —1 and 1 is 
(eo: <a< . 

Fill in the blanks with either the symbol C or the sym- 

bol £ (which denotes “is not a subset of") to make the 

statement true. If you believe the first set is nor a subset 

of the second set, give an example of an element that 

supports your answer. 


fa) N. z 
-z Qc 
wQ. E 
(R r” 
ez Q 
OR Q 


in each of the following problems, three sets are de- 
scribed. One of the sets is not the same as the other 
‘two. In each case, find the set that is not like the oth- 





C=Re+4:zeZ) 
Write each of the following sets using a “form descrip- 
tion” instead of a “property description.” (For example, 
the property description [x € Z x is even] can be writ- 
ten using the form description 2k: k € ZI.) 
(a) (x €Z: xis odd] 


b keg: 





2" for somem e Z} 








(6) [e € N xis twice a perfect square) 

(@ (x € Z:xisthe productof woconsecutiveintegers} 
1L. List five elements of each of the following sets In each 
F case, also state which of the sets N, Z, Q, or R would be 

an appropriate universe forthe set. 

(a) (5 :a €Z,bEZ) 

6) (1:26 N] 

(© (a:a e Z*,beZ) 

(à) (à +P a EN, b eN) 

(© (Va: EN) 

12. For each of the following, determine what the given el 
ments have in common and then write the sct-builder 
notation description of a set that includes them. (Note 
that there are many correct answers, but the best an- 
swers are the simplest ones to describe) 
t2) 11.3,5,7.9, 0 
(b) (11, 33, 55, 77. 
(6) (1,9,17,25, 
à) (1,2, 4,8, 16,..) 
(9 0.35.1 ad 

13, Decide if each of the following statements about finite 

F setsis truc or false. For any false ones, provide a specific 
counterexample to back up your answer. (You do not 
need to prove the true statements) 

(a) IfA C B, then n(A) < n(B). 
(b) I£ n(A) < n(B), then A C B. 
(©) IfA C B and A # B, then n(A) < n(B). 
(d) It n(A) < n(B) then A € B. 

14. Sometimes you willsee the notation A C Bor A C Bto 
denote the fact that A isa subset of B but A  B. (We 
read A C Bas A isa proper subset of B.") Write a few 
sentences explaining how to remember the difference 
between the set notation A C B and A C B, using an 
analogy to the numerical notation x < y and x < y. 

15. For the universal set U = [1.2,3.4,5,6.7,8,9,10) 
and sets A—[L3,5] B—[1,2.3.4, and C 
(1,2. 5,6, 10), verify that the following set properties 
hold by calculating cach side of the equation separately. 
(a) The distributive property of N over U: 





























AN (BUC) =(ANB)U(ANC) 
(b) The distributive property of U over n: 
AU(BNC) =(AUB)N(AUC) 


(c) One of DeMorgan’s laws: 





(AUBY =A NE 


31 Set Definitions and Operations 195 
(@) The other one of DeMorgan's laws: 


(Any =a'uB 
(e) One of the absorption properties: 
An(AUB)-A 
D The other absorption property: 
AU(AnB)-A 
16, Use Venn diagrams to verify each of the following prop- 


F erties of our set operations: 
(a) The distributive property of U over 


AU(BNC)=(AUB)N(ALC) 
(b) One of DeMorgan’s laws: 


(AuBy=a'nB 
(©) The other one of DeMorgan's laws: 


(AnBy- A'UB 
(d) One of the absorption properties: 

An(AUB)-A 
e) The other absorption property: 


AU(AnB)-A 


17. Use Venn diagrams to check whether or not you be- 

F lieve these properties are true. For those that are not 
true, give a specific counterexample, 

(a) An (BUC) =(ANB)UC 

(b) (BUC) - A - (B A)U(C — A) 

ie) A- (BnC) - (A - B)U(A - C) 

(@ (A—B)U(B-C) C (A-C) 

(e) (BnC)- A - Bn(C - 4) 

(D ITA C B.tbnA -B— B— A. (Hint-Since A C 
B, draw the circle for A inside the circle for B.) 

(Q) ITA CB, then A = AUB. 

18. Given A = [2x : x € Z| and B = [3y : y € Z), describe 
each of the following using the simplest set-builder no- 
tation possible: 

@) ANB 
) A-B 
(08-A 
(@ AUB 
Z-A 











6 Chapter 3 / Sets and Boolean Algebra 


A partial sum of the harmonic series is a sum of the form 
Le Le Je for some positive integer n. A= 
{x € Q: x (in lowest terms) has an even denominator], 
B = (x e Q: x isa partial sum of the harmonic series], 
andC = (x €Q:1 <x < 2} are given. 

(a) What does it mean in English if B C A? 

(b) What does it mean in English if B C C? 

(c) What does it mean in English if AC = 9? 

(d) Write in notation involving A, B, or C and set oper- 
ations the following claim: “Every rational number 
between 1 and 2 has an even denominator (when in. 
lowest terms)” 

(e) Write in notation involving A, B, or C and set op- 
erations the following claim: “If a rational number 
between 1 and 2 has an even denominator (when in 
lowest terms), then that number is a partial sum of 
the harmonic series.” 

The notation [a, 6] (called interval notation) is used 

to describe a closed interval of real numbers. That is, 

[a.b] = (x € R: a <x < b). Rewrite these setsusingei- 

ther interval notation, set-builder notation, or a simple 

list of elements. 

(@) [-5.32]nZ 

() [-61,4]n [1.72] 

(©) [0.2] [4.10] 

‘There are several other variations on the interval nota- 

tion from Exercise 20. 

=(reR:a<x<b) 

(eeR:a<x <b} 

freR:a<x<b) 

IxeR:a <x) 

9 (-20, b]=(reRix sh) 

Rewrite the following sets using either interval nota 

tion, set-builder notation, or a familiar name of the se 

9) (0, 10] — [3.12] 

6) (0.19] — [5. 10) 

(©) (75. 00) u (70e. 2] 

(4) (75.2) (20. 0] 

(©) Qno. 20) 

(D Zn [0. c) 

. Express cach of the following sets as an interval or the. 

union of intervals: 

(a) Brix eoa 

(by (32: :x e [0.4]) 

(©) B=2e42:x €[0.6]) (HINT A graph of 
3— 2r +? might help.) 

@ [r ER:3— 2r +x? € [3.6]) 
y=3— 2r +x? might help.) 



































(Hinr: A graph of 














Figure 3-10 Sets for Exercises 23 and 24. 


23. See Figure 3-10. 
(a) Calculate n(A), n(B), and n(C), where 


A = (2.6.7.9. 10. 12, 13, 16}, 
B = (2,3,4,8,9, 11, 12,13,14} 
C = (1,4,5,7.8,9, 11,12, 13, 14,15, 16) 


(b) For the sets in part (a), find the following. us- 
ing (where appropriate) the two-set inclusion- 
exclusion counting principle. 

i. n(A NB) and n(A UB) 
ii. n(An C) and n(A UC) 
fii, n(B OC) and n(BUC) 
ix nANBNC) 

v n(AUBUC) 

24. Sec Figure 3-10 for the Venn diagram of the sets A, B, 
and C in Exercise 23. Based on the two-set inclusion- 
exclusion principle, you might expect that n(A U BU. 
C) would be n(A) + n(B)  n(C) — n(A n B) -n(A n 
C) — n(B N C)—that is, add up the three sets and then 
subtract the overlaps. However, this does not match 
your answer in Exercise 23(b). Why is this formula not. 
correct? In particular, how many times does this for- 
mula count each element in A BNC? 

25. State the four-set version of the inclusion-exclusion 
principle, and invent an example to illustrate that you 











< 1,000). 
(a) Use the inclusion-exclusion property to find the size 
of the set 


In € T: nis a multiple of 2or3] 


(b) Use the inclusion-exclusion property to ind the size 
of the set. 


In € T : nis a multiple of 2 or3 or 5) 


(©) Use your answer to the previous part to find the size 
of the set 


In € T : nis a multiple of neither 2 nor 3 nor 5] 





27. A certain club is forming a recruitment committee con- 
sisting of five ofits members. They have calculated that 
there are 8,568 different ways to form such a committee. 
The club has two members named Jack and Jill. They 
have calculated that 2,380 of the potential committees. 
have Jack on them, 2,380 have Jill, 1,820 have Jack but 
‘not Jil, 1,820 have Jill but not Jack, and 560 have both 
Jack and Jill. 

(a) How many committees have either Jack or Jill? 

(b) How many committees have neither Jack nor Jill? 

(©) Jack and Jill are car-pooling, so they insist that if 
either one is on the committee, the other person 
must also be on the committee. How many commit- 
tees meet this condition? 

(@) Jack and Jill have had a fight. Jack says, “If Jill is 
‘on the committee, I won't be.” Jill says, “Likewise.” 
How many of the committees meet this condition? 

In Mrs. Smith's science class, there are a total of 25 stu- 

dents. Ofthese, 15 are female, and 15 are honor students. 

Draw a Venn diagram for a specific situation consistent. 

with this information, and then answer the following 

questions: 

(a) Whatis the smallest possible number of female stu- 
dents who are on the honor roll? 

(b) What is the largest possible number of female stu- 
dents who are on the honor roll? 

29, In Mr. Jones’ math clas, all 40 students must complete 
activities at three stations during the current marking 
period. So far, 6 students have completed stations Aand 
B, 10 students have completed stations A and C, and 15 
students have completed stations B and C. Everyone in 
class has completed at least one station, and for each 
of the three stations, five students have completed only 
that one station. Draw a Venn diagram for this situa- 
tion, and use it to determine how many students have 
completed all three stations. 

30. Students at Apollo Elementary School receive ribbons 
at the end of each year at a school-wide awards cer- 
emony. This year 120 students receive gold stars for 
perfect attendance, 180 receive certificates for partic- 
ipating in the science fair, and 80 students receive blue 
ribbons for outstanding grades. Of these, 40 students 
‘who receive the attendance star receive no other award, 
50 students who receive the science fair certificate re- 


31 Set Definitions and Operations 197 


ceive no other award, and 10 students who receive the 
blue ribbon receive no other awards. In addition, 10stu- 
dents receive all three awards and 65 students receive 
noawards Draw a Venn diagram for this situation, and 
determine how many students attend the school this 

31. Students at Apollo Elementary School receive ribbons 
at the end of each year at a school-wide awards cer- 
‘emony. Last year 100 students received gold stars for 
perfect attendance, 200 received certificates for partici 
Pating in the science fair, and 50 students received blue 
ribbons for outstanding grades. In addition, 10 students 
received all three awards and 80 students received no 
awards Draw a Venn diagram for a specific situation 
consistent with this information, and then answer the 
following questions: 

(a) What was the smallest possible number of children 
in the school last year? 

(b) What was the largest possible number of children 
in the school last year? 

(© If in addition we know that a total of 100 children 
received exactly one of the three awards, what was 
the exact number of studentsin the school last year? 

32. For each of the following, determine if the statement 

# is true or false. If false, give a specific counterexample. 
If true, explain informally (using Venn diagrams if you 
wish) why you think so. 

(a) IFA and B are disjoint, then A — B — A. 

b) If A and B are disjoint, then n(A UB) = n(A) + 
nB). 

(c) If A and B are disjoint, then n(B — A) = n(B) — 
n(A). 

4d) If 4 C B, then n(B — A) = (8) - n(A). 

(©) IAS B henA- B- BA. 

33. Discuss the following paradox attributed to Bertrand 
Russell. Let 5 be defined as the set of all sets X for 
which X e X. 

(a) isses? 

(b) Iss gs? 

(e) If C denotes the collection of all possible sets, is C 
itself a set? (Hinr: The previous parts are related 
to this question.) 





SE eee 4E a 34D 


198 Chapter 3 / Sets and Boolean Algebra | 


3.2 More Operations on Sets 





So far we have seen how to build new sets by combining or restricting the elements 
in existing sets. For example, if A and B are sets of numbers, then using any of the 
operations discussed so far, the resulting set will also be a set of numbers. We will 
now see that there are other ways to build new sets from old that involve creating 
completely new types of objects. 


Cartesian Products 


We first consider sets of pairs of objects. Anyone who has drawn a graph for an algebra 
class is already familiar with ordered pairs of numbers, In algebra, we sometimes plot 
individual points and sometimes graph "smooth" lines or curves. In either case, we 
are creating a visual representation of a set of ordered pairs, as the following example 
illustrates, | 


Ww Example 1 In Figure 3-11, the graph on the left illustrates the set of points 

(1,2), (2, 5)), while the graph on the right illustrates the set of all ordered pairs (x, y) 
of real numbers, where the y-coordinate is always one less than three times the x- 
coordinate. 


These examples show that we need notation and some language for working 
with sets of ordered pairs 





Definition Given sets A and B, we define A x B = (a.b): € A and b e 
B). (We read A x B as “A cross B" and call (a, b) "the ordered pair a, b") 
We often refer to A x B as the Cartesian" product of A and B. In the common 
special case that both coordinates are taken from the same set, we often write 
A? instead of A x A. 

















With this notation, the set of points on the right in Example 1 could be described 
by 





(Gy) eR iy e 3-1] 
which is closer to the way we usually would describe the line in an algebra class. 
When itis possible to draw a coordinate axis as in Example 1 with the horizontal 
axis labeled with elements of A and the vertical axis labeled with elements of B, 
then we can produce a picture like the one in that example. In particular, this is 
always possible to do when the elements of A and B are numbers and when A and 
B are small sets. We will refer to the picture we get as the Cartesian graph of the 
set of ordered pairs. We will use the adjective “Cartesian” faithfully because of an 
unfortunate confluence of math terminology that can lead to confusion in Chapter 
nothing to worry about at the moment. 











* The adjective “Cartesian” pays homage to René Descartes (1595-1650), a French mathematician and. 
philosopher who greatly contributed to the notion that geometry could be studied using coordinates. 














32 More Operations on Sets 199 





(1,2), (2,5) lix 9x=1) xR) 





Figure 3-11 The Cartesian graph of a set of 
ordered pairs. 


Practice Problem 1 Draw the Cartesian graph for each of the following sets of 
ordered pairs 


(e) 01,2), (1,3), (2,2), (2,4), G. 1 

(b) ((x,y) € R? : y 2x — 1} (It is only possible to draw a portion of the graph for 
this set.) 

© (aye 
set) 





= x°) (It is only possible to draw a portion of the graph for this 





Ofcourse, sets of ordered pairs are relevant beyond the confines of a high school 
algebra course, In particular, we have already seen them (in spirit at least) in some of 
our earlier work in this book, and we will continue to bump into them in the future. 





Example 2 Express each ofthe sets described below using efficient set notation. 


1. In Chapter 1, when encountering two inhabitants A and B of the Island of Liars 
and Truthtellers, we found ourselves considering all possibilities for what type of 
person each was. That is, we formed (by way of rows in a truth table) the set of all 
pairs of the form 

(type of A, type of B) 

2. In Chapters 1 and 2, we described a sequence like 

2,4,8, 16, 32, 


using the notation a, = 2, a; = 4, as = 8, andso forth, To avoid the subscripted letter 
a, we could have used ordered pairs where the first coordinate tells the position of 
aterm in the sequence and the second coordinate describes the value of that term: 


(1,2). (2,4), 3.8), (4, 16), (5, 32), 


3. In Chapter 5, we will develop procedures for counting the size of increasingly 
complex sets. An early example asks how many ways one can order a sandwich and 
a drink from a restaurant that numbers sandwich choices 1, 2, 3, and labels drink 
choices A, B, C, D. The set of all possible choices includes ordered pairs like these: 


(1, A), (1. C), (2. A) (3. D). 





oe. UD 


Chapter 3 / Sets and Boolean Algebra 
SOLUTION 


1. Ifwe use V 
asV x V = V 


2. ((n.27):neZ*) 
3. {1,2,3} x (4, B,C, D] 





(Truthteller, Liar), then the given set can be simply expressed 








n 


Before moving on to look at other set structures, we will investigate the size of 
the Cartesian product of two sets. This is obviously relevant to the last part of the 
preceding example. 


a Example 3 Let A = {2,4,6,8} and B = {1, 2,3,4,5). List all elements in A x 
B in an orderly way. Will B x A have more elements than A x B? Are they the same 
set? Draw the Cartesian graph of both sets. | 
SOLUTION In each case, we list the elements of the new set in a tabular 
format that allows us to see a pattern. Table 3-1 shows A x B followed by 
B x A. Because a 4 x 5 table and a 5 x 4 table have the same size, we can see 
that A x B and B x A are sets of the same size. The Cartesian graph, shown 
in Figure 3-12, makes the relationship between the size of A x B and the size 
of B x A even more apparent. To see that A x B and B x A are not the same 
set, we simply note that there are elements of A x B, like (2, 1) and (8,5), for 


example, that are not elements of B x A. o 
Elements of A Elements of 8 — 
l 1 2 3 4 s 
2 Q0 Q02 Q3 Q9 Q5 
4 UD aD (3 AD (45 
$ 6D (62 63) (&9 (65 
8 6D) (82 (3 (9 89 
Elements of B Elements of A — 
1 2 4 6 s | 
1 (02 (49 (9 (9 
2 Q2 @4 (Q6 Ə 
3 62 G9 QG9 G9 
4°42) a) Go) as | 
s 62 G4) (59 (65 


Table 3-1 A x Band B x A in Example 3 





12845 





Figure 3-12 Cartesian graph for Example 3. 











32 More Operations on Ses 201 


Practice Problem 2 For A = (1,2) and B = (2.4, 6. 8), use the table approach 
above to list all the elements of the set A x B. What is n(A x B)? 


The table and Cartesian graphs lead to an obvious observation about the size 
of the Cartesian product of any two sets A and B. We will step through one possible 
proof of this fact in the exercises. 


Theorem ! Forall finite sets A and B, 
n(A x B) = n(A) -n(B) 
PROOF See Exercise 20 at the end of this section. " 


We have seen that the Cartesian product of sets A and B is aset of ordered pairs. 
‘There is no reason to restrict ourselves to pairs of elements when it might be natural 
to have more than two objects tied together in this way. We can easily modify the 
language of ordered pair to include ordered triples or ordered quadruples, and the 
analogy is fairly clear. At some point, we will no longer know common English words 
for describing the number of elements we would like to convey, so mathematicians 
adopt a convention that is as descriptive as it is funny to say. 





Definition For any integer n > 3, the structure (x1, 
ntuple 


xj) is called an 








This generalization of ordered pair leads to the obvious generalization of the 
Cartesian product. 





Definition Given sets Sı, S2... „Sn, we define the Cartesian product 





Sy x Spx xSy 


as the set of all n - tuples (xi, s, ..., x,) such that x; € S1, 32 € S2, and so on, 
As before, in the event that the sets $1, 52, and so on are actually all the same 
set, we will often use the notational shorthand 





S'-$xSx-xS$ 














a Example 4 Express each of the sets described below using efficient set notation. 





1. In Chapter 1, when analyzing a proposition with four propositional variables p. q, 
r, and s, we examined (via rows in a truth table) all possible quadruples of the form 


(truth value of p, truth value of q, truth value ofr, truth value of s) 


7 You can pronounce this as “tuppie” or “toople” In either case, mathematicians will know what you 
mean, while nonmatbematicians will just crack up. 


ID i=... a= 


Chapter 3 / Sets and Boolean Algebra 


2. The familiar 12 x 12 multiplication table we learn in elementary school conveys 
a relationship among three pieces of information, two factors and their product. 
Hence, instead of using the table format, we could describe the information in the 
table as a list (2,3, 6), (5.1, 5), and so on, where the third number is product of the 
first two numbers. 


SOLUTION 
1. Thissetcan be expressed as (T, F] x (T. F} x (T, F} x IT. F}orequivalently 
as (T, Fy. 


If we use $ to denote the set (1, 2, 3,..., 11, 12], then we can express the 
12 x 12 multiplication table using the "property description" 





{(a,b,c) eS x Sx Niasboc] 
or the “form description” 


((a,b,a-b): (a,b) eS x S) 


Practice Problem 3 List five elements of each of the following sets: 
(a) (T. FP. 
(b) The set 
S = [s € (0, 1 : s consists of three 0's and two 1's in some order) 
(c) Ata restaurant that serves sandwiches from the set (1,2, 3, 4, 5}, side orders from 
the set (x, y, z), and drinks from the set (A, B, C, D), the set of all possible ways 
10 order one of each type of item can be described as 
(1,2,3,4,5) x (x, y, z) x (A, B,C, D] 


To finish this discussion and to follow up on our earlier work on the size of the 
Cartesian product of sets, we present the following generalization of Theorem 1. 


Theorem 2 For any finite sets S1, $2... , Sio 





(Sy x $2 x +++ x Sj) = n(5)) i n(52),  n(2) 


PROOF See Exercise 23 at the end of this section. " 


Sets of Sets 


‘We will now look at another common construction with sets. Namely, we will consider 
the possibility that the elements of a set can themselves be sets. For example, the 
notation (1, (3,4), (1, 2] simply describes a set with three elements, one of which is 
a number (namely, 1) and the other two of which are sets (namely, (3, 4) and (1, 2]). 














32 More Operations on Ses 203 


If this seems strange, imagine a school supply box containing three objects: a pencil 
and two pencil boxes, with each containing pencils. The notion that a set can contain 
other sets as elements is really no different than the idea that a box can contain other 
boxes. However, many people confuse the basic set terminology like “element” and 
“subset” when these types of sets are used. The following example is intended to 
help clarify some of these potential difficulties. 








E Example 5. Answer each of the following questions about sets: 


L. ds (1,2) € (1,2), (1.3.41)? 

2, I5(,2) € (1,2), (1,3,4)? 

3. What is n(((1. 2). (2, 4). (1,3, 4)))? 
SOLUTION 


1. No. The number 1 is an element of the set {1,2}, but the two elements of 








{(1,2}, (1.3, 4]) are (1, 2} and (1, 3, 4)—while these sets themselves contain 
1 as an element, there is no doubt that neither of these sets is actually equal 
tol. 


2. Yes. As mentioned above, the set (1,2) is an element of the set 
461,2), (1,3, 4))- 
3. n({(1.2), (2, 4), (1,3, 4}}) =3 since the set ({1, 2), (2,4), (1.3, 4]) contains 


three elements, each of which happens to be a set. a 


Practice Problem 4 The setS = (1.2, 3, (1, 2]. (1,3, 41] is strange since some of 
its elements are numbers and some of its elements are sets. List the five elements of S. 
What element of S is also a subset of S? 


Once we accept the legitimacy of a set of sets, it makes sense to define an 
operation that produces the set of all subsets of a given set. 





Definition P(A) = I$: € A) (We read P(A) as" 





Ihe power set of A.") 











un Example 6 Let A = (1,2,3,4}, B = (2,4,6, 8) andC = (1,2,3,4, 5} 


l. List all elements in P(A) in an orderly way. 
How much larger do you expect P(B) to be? 
How much larger do you expect P(C) to be? 


SOLUTION 


1. Table 3-2 illustrates a systematic way to build up the answer one row at a 
ime. Each row contains everything in the previous row, plus all the new sets 
Qie., those with the new element). To get the new sets, we simply insert the 
new element into each of the old sets. 





a ae 


Chapter 3 / Sets and Boolean Algebra 








p) o m 
T2) | — 
e oA 

T(i23) | ^ nu ep (2 


i) (3) (23) (02.3) 





7(1234)| e t @ aa B (53 (3 — 1.23 
M (4) (24) LLA BA (,34] (23.4) (12.3.4) 





Table 3-2 Building the Power Set Recursively 


2. We expect that P((2, 4, 6, 8}) will have the same size as P({1, 2, 3, 4)) since 
the number of subsets of a four-element set should be the same regardless. 
of what the four elements are, 

3. Using this same pattern, we expect P(I1, 2, 3, 4, 5]) to contain 32 subsets— 
the 16 we already have from P({1, 2, 3, 4]), and each of those same 16 subset 
but with the element 5 inserted. 





o 


The solution to Example 6 suggests a method for a proof of the following theo- 
rem. 


Theorem 3 For any finite set A, if k = n(A), then n(P(A)) = 24, 
PROOF See Exercise 24 at the end of this section, LI 


Before moving on, let's look at one more practice problem that combines the 
two main ideas from this section. 





Practice Problem 5 Let A = (1,2, 3] and B = (1,3, 5]. 


1. Which is the larger set, P(A U B) or P(A) U P(B)? 
2. LetC = (x y, z). Which is the larger set, C x (AN B) or (C x A)n (C x B)? 





Partitions of a Set 


There are many instances in mathematics where a particular type of set whose ele- 
ments are sets is relevant. The basic idea is even present in problems being solved 
every day by children on playgrounds everywhere. 





BE Example 7 Alison, Billie, Chris David, Ellen, and Fred wantto play hree-on- 
three basketball. How can we address the question “How many ways can they divide 
themselves into teams?" using the idea of sets of sets? 
SOLUTION A basketball game can be represented as a set containing two 
basketball teams, each of which is a set of three players. Here are three examples 
of basketball games represented in this way: 


© (A,B,C), (D, E, F} 








3.2 More Operations on Sets 205 


© (A.B. F), (D. E,C]) 
© ((F.C, B). (D. E, A) n 





Each of the examples above is indeed a set of sets, but they are specia 
couple of ways. First, all six children want to play, so all the letters A, B,C, D, E, 
must be used. Second, no child can be on both teams. These two properties are the 
defining characteristics of what we will call a partition of the set of children. 











Definition For a set A, a partition of A is a set S= 
subsets of A (cach set $; is called a part of S) such that 





1. For alli, S, # Ø. That is, each part is nonempty. 
2. For all i and j, if Sj #5), then S; 0S} =Ø. That is, different parts have 

nothing in common, 
3. $,08:0$U. = 





. That is, every element in A is in some part. 











Practice Problem 6 The six children in Example 7 want to form three teams of 
two children each for a doubles table tennis tournament. Use partitions to list all the 
ways they can do this 


fl Example 8 For each of the following requirements, find a partition of the set 
{1,2,3, 4, 5, 6) that satisfies it: 





1. Every part has the same size. 
2. No wo parts have the same size. 

4, There are as many parts as possible. 
4. There are as few parts as possible. 


SOLUTION For the first two parts, there is more than one answer. 
1. {11,3}, (2,4). 15, 6)) 

2, (2). (3,6), (4. 1,5]) 

3. (11. (2), (3), (41, (51, (6)) 

4. (01,2,3,4,5,6]) n 





We can also have partitions of our (infinite) universal sets Z, Q, and so on. Some 
of these partitions are connected to concepts we have already studied, and some are 
important for higher mathematics. 





BB example 9 Verify that each of the following are partitions of the given set: 


1. Let A — (2k: keZ} and B= (kk Vi ke Z). Show that (A, B) is a partition 
oft. 


XT" NEU SS o 0707707. 3E" 


206 Chapter 3 / Sets and Boolean Algebra 


- Let A = Bk:ke Z), B (3k--1:ke Z), and C = (8k--2: ke Z). Show that 
ADS C] is a partition of Z. 

3. Fora rational number r, let P, = (a, b) € Z x Z* : a/b =r}. Show that (P, :r € 
Ql is a partition of Z x Z*. 


SOLUTION In each case, we verify the three properties of a partition given 
by the definition: 


1. (1) Both A and B are nonempty since 0 € A and 1 € B. (2) A N B = Ø since 
any number x in both A and B would have to satisfy x = 2K and x —2L +1 
for some K € Zand L € Z, which in turn would imply that K — L = }, which 
is impossible for integers K and L. (3) The division theorem (for division by 
2) from Section 2.2 tells us that any integer must be in A or B. 

2. (1) Allthreesetsare nonempty since € A ,1 € Byand2 e C.(2)A n B d 
since any number x in both A and B would have to satisfy x = 3K and x 

3L + Lforsome K e Zand L € Z, which in turn wouldimply that K — L 

which is impossible for integers K and L. A similar argument shows that 

ANC = and BNC = 9. (3) The division theorem (for division by 3) from 

Section 22 tells us that any integer must be in A, B, or C. 

(1) Let r € Q be given. Since r = $ for some a € Z, b € Z* by definition of 

Q, so (a,b) € Z x Z*. Hence, P, is nonempty. (2) Let r # s € Q be given. 

P, n P, = 0, since any pair (a, b) in both sets would have to satisfy r 

and s = $, which would in turn mean that r = s, a contradiction. (3) Since 

for any (a,b) € Z x Z*, we know that £ € Q, soit follows that (a, b) € Pap. 

Hence, every element of Z x Z* isin one of the parts. n 











3. 








Another common way to define a partition of a set A is to describe which 
elements of A should belong to the same part of the partition. This is difficult to 
write down with our set-builder notation right now, but we will come back to this 
issue in Chapter 4 when we have a richer vocabulary for writing the description. 
The danger in the type of description below is that it is easy to accidentally describe 
something that is not actually a partition. 





"M Example 10 For cock description, write down the partition of the set A = 
{1.2,....8, 9} described, if possible. 


1. Elements a,b € A are in the same part if and only if a — b is even. 


2. Elements a, b € A are in the same part if and only if aè and E have the same digit 
in the ones place. 


3. Elements a, b € A are in the same part if and only if a + b is a prime number. 


SOLUTION 
1. The partition is ((1, 3,5, 7,9), (2,4, 6,8)}. 
2. The partition is {{1, 9}, (2, 8]. (3.7). (4, 6}. (5]]-. 


3. There is no such partition. To see why, consider the numbers 2, 3, and 5. 
According to the description, 2 and 3 should be in the same part (since 
2--3— 5 is prime), 2 and 5 should be in the same part (since 2 +5 = 











ae o 





32 More Operations on Sets 207 


prime), but 3 and 5 should nor be in the same part (since 3+ 5 = 8 is not 
prime). This is impossible. d 


More on the Size of Sets 


We close this section with an important connection between two of the structures we 
have discussed. This connection will be developed much more in Chapter 5. 
Ifyou have been paying attention, you may have noticed that the results about 








the size of Cartesian products and the size of the power set are identical in some 
cases. 
© For A ={1,2,...,k}, Theorem 3 tells us that n(P(A)) = 24. 





PL23)| (01? — 9 For S = (0.1), Theorem 2 tells us that n(S*) = 2*. 





















u (0, 0,0) Could this be a coincidence? Of course not. To see the connection, we will list 
Bi (0.0.1) the elements in the two relevant sets when k has a very manageable value of 3, 
m |oo "Vi example 11 Find a direct connection between the sets P((1, 2, 31) and (0,1) x 
93 [erp (0.11 (0.1) 
—— A SOLUTION In Table 3-3, each set on the left describes which coordinates 
" (1,0,0) are 1's in the corresponding ordered triple on the right. o 
us [aon 





Practice Problem 7 Table 3-4 maintains the same correspondence that we saw in 
(2) (0.1.0) — Example 11. Fill in the missing entries. 


a23 [0.10 








Insome books, this correspondence is taken one step further, leading the authors 








Table 3-3 to use the notation 2^ instead of P(A) for the power set of A. Given the above 
Correspondence for correspondence, this choice of notation is very appropriate, 
Example 11 
Pt.23,4n | (2.4) | {1,3} | | 
tar | 0.1.0.0 [arao [a101 


Table 3-4 Complete for Practice Problem 7 


Solutions to Practice Problems 


1 Figure 313 on page 208 shows the three Cartesian — (c) (1, x, A). (1, y. D). (2. x, A). (3, y. B). 2.1. C). 
graphs in order from left to right. are a few of the elements. 
A x B= (12). (1.4). (1.6). (1.8). 2.2). 2.4). 3 (1.2) is an element of the set (1.2.3, (1,2), (1,3,4) 
2,8), son(A x B) = Since 1and2Zarealso elements of (1. 2,3, [1.2]. [1,3.4]], 
3 a) (T, FP = (T, T. T). (T. T, F) T.F. T). then [1.2] is a subset of (1,2,3, (1,2) (1,3, 4]) as well. 
(T. FF). (F, T. T)....] 5 Let A= 1.2.3] and B = (1,3.5] 
(b) $= ((0.0.0.1,1).(0.0. 1.0. 1). (1.1.0. (a) P(A) U P(B) consists of all sets that are subsets of 
(0.1,0,1,0), (1.0.0.0, 1)....] A or subsets of B, while P(A U B) consists ofall sub- 
sets of A U B. The latter includes more items since, 























08 Chapter 3 / Sets and Boolean Algebra 





N 
N R E . 
4 . 8 
7 
ats E 
5 
2| ee af 
3 
1 . 2 
ite 
NEN Um y 123456789 
N R N 


Figure 3-13. Cartesian graphs for Practice Problem 1. 


for example, (1,2. 5] is a subset of A U B while not 
being a subset of either A or B itself, 

(b) Let C = (x, y, z] C x (AN B) and (C x A) (€ x 
B) both describe all ordered pairs with the first ele- 





(1A, B) (C. D}, (E, F]) 
1A.C)48, D). (E. F]] 
{{A, D}, (B.C), (E, F]] 
(A, E}, {B,C}, (D, F]] 
VA. F), {B,C}, (D, E] 


ET 


ment from C and the second from (1, 3], so they are 
identical sets. 

6 The complete list of all 15 partitions is given below: 

7 Table 3-5 has been completed. 


{{A, B}, (C. E), DD, FH} 
(1A.C). (B. E]. (D. F}} 
1A. D), IB, E), (C. F]] 
MA, E). (B, D), (C. F} 
VA. F}. (B. D), (C, E]] 





List for Practice Problem 6. 


PUL234 || (34 | a 


| as | 62 |24 





(0,1 


[oron | @1,0.0) | a.1,0 [0.109 10.100 


Table 3-5 Solution for Practice Problem 7 


xercises for Section 3.2 


|. Given the universal set U l 
and sets A = (2,4), B = (1,2,8), and C 
10}, find each of the following: 

(a) AxB 
(b) (Ax B)- 
(©) P(B) 

(a) (BNC) 
(© P(B) - P(BNC) 

2. In each of the following problems, three sets are de- 
scribed. In cach case, decide which set is not like the 
others. 


6,7,8,9,10] 
1, 2, 5, 6, 








(AxA) 


wA 
B 


= (3) ix € NI 
1/3. y) : y€ NI 
c 13:-3)::e2*] 
(by A — (Gy ce y) x ve zi 
B=((x,y.4—y):4,y €Z) 
- d yrty)in yen) 
3. Determine the size of each of the following sets: 






(2) (5 € P(11,2.3) n5) = 2 
€) (S € P(U,2,3)) :$0 1.210] 
(© IS € P(0.2,3.4]) $n (1,4) = 0) 





4. List five elements of each of the following sets: 
(0) UX. Y) € PU. 2.31) x PUL 2,31) +X € Y] 











(b) (C. Y) € P(1.2,3) x P233) Xn Y =M 
(©) ICX, Y) € P(1,2,3)) x P(0,2,3) n(X) = 
n(Y) 

Let A=[4k:keZ], B=(4k+1:ke 2}, and C= 
[4k +2: k € 2). Show that (A, B,C} is not a partition 
of Z. 

Let A=(2k:keZ), B= (3k+1:keZ), and C 
(6k-+5: k € Z). Show that (A, B, C] is not a partition 
of Z. 























l. Let A= Dk: ke Zl, B=[3k+1:ke Z), C= He 
2:keZ}, and D=(Sk+3:keZ}. Show that 
(A. B.C. D) is not a partition of Z 

Let A= Bk:ke Z), B= [Mee V: ke Z), C o He 
3:keZ| and D=[6k+1:keZ). Show that 
|A. B.C, D] is not a partition of Z. 

Let A=(2k:keZ), B=(4k+1:keZ), and C= 
(4k +3: k € Z). Explain why (A. B.C] is a partition of 


u 


E 


17. 





£ 

Each of these statementsis als 

for each. 

(a) For all sets A, B, and C, A x (BUC) = (A x B) N 
(Ax) 

(b) For all sets A and B, A x (B-A) = A x B. 

(c) For all sets A and B, P(A) U P(B) = P(A U B). 

(d) For all sets A and B, if A — B=, then A 

Identify which of the following statements is true. For 

cach false statement, provide a counterexample. You 

do not have to prove the true statements, 

(a) For all sets A, B, and C, if A C B, then (A « C) C 
(BC). 

b) Forall sets A and B, (AU B) x (A B) = A? — 

(©) Forall sets A, B, and C, A x (Bx C) = (A x B) x 
[i 

Which is typically larger, P(A x B) or P(A) x P(B)? 

Give an example of sets A and B for which these are 

the same size, 

Calculate the size of these sets, given A = (1. 3, 5) and 

B = (2,4, 6,8]. This can be done using only the facts 

that n(A) = 3 and n(B) = 4. You do not need to list all 

elements of the new sets, 

(a) P(A) 

©) P(B) 

(AxB 

(4) P(A x B) 

(e) P(A) x P(B) 


Find counterexample 

















32 More Operations on Sets 209 





ze the previous problem. That is, using the let- 





ters k = n(A) and m = n(B), find an expression for each 
of the following in terms of k and m: 

(a) n(P(A)) 

(b) n(A x B) 


(0) (P(A x B) 
(d) n(P(A) x P(B)) 

How many different basketball games can be played in 
Example 7? 

Which of the following are partitions of the set A = 
11,2,3,4,5,6,7,8]? For those that are not, explain why 
not 

(a) 8 — (1,2, 





16. 





3.4.5}. (6.7.81) 
















(by P= (1.5), 16.7.2), (4.3.5), (81) 
(e) T= (11,8) 43,5], 7.21) 
(a) T = (14.2.3). (5. 1,8]. (6,711 

17. Give a partition of A 6,7,8) satisfying 
each of the following criteria: 

(a) Every part has the same size, 

b) There are exactly three parts, all having different 
sizes, 

(c) Even numbers are all in the same part, 

d) Prime numbers are all in the same part. 

18. Using the definition of x, explain why A x i— 8 for 
any given set A. 

19. Let A = (1,2,3,4.5]. Use mathematical induction to 
prove that for all integers n= 1, if B= (1,2,....nh, 
then the number of elements in A x B is 5-7. 

20, Let A = (1.2... k). Use mathematical induction to 





prove that for all integers n = 1, if B= (1,2, 
then the number of elements in A x Bis k-n. 

21. The sets A x Bx C and (A x B) x C are very similar 
but not equal, Give an example of these sets for a spe- 
cic choice of A, B, and C, and explain the differences 
between the two sets, 

22. Explain informally why (for k = 3) the sets Sj x S; x 

x Sii x Scand (65 Si.) x Sa have the 

same size. 

23. Use the previous exercise along with Theorem 1 
to prove by induction on k>2 that for all sets 
p 


nh, 





(Sy Sp X +++ Sa) = S1) n3) sn) 


24, Use mathematical induction to prove that for every 
n> 1, if a set has n elements, then its power set has 
2" elements. 


) Chapter 3 / Sets and Boolean Algebra 


3.3 Proving Set Properties 





‘The set operations union, intersection, and complement have many important prop- 
erties, some of which were illustrated in the examples and exercises of the previous 
section, Some of thase properties are “obvious.” For example, if you think about 
what intersection of sets (N) means, then it is not hard to see that A N B C A and 
that A NB =B NA. Our main purpose in this section is to learn how to create 
careful proofs of theorems about sets. 

In particular, in this section we will learn how to prove that one set is a subset of 
another set, and how to prove that two sets are equal. The techniques we use apply 
equally well to “obvious” properties as they do to the less intuitive properties. 





Element-Wise Proofs 


We begin by learning how to prove that one set is a subset of another set, using a 
proof technique called an element-wise proof. Here is a simple concrete example of 
an element-wise proof, 


"Bl Example 1 Forthesus B= 





2, 4,6, 8, 10) andC = (2,4), explain whyC € B 


is true. 
SOLUTION The only elements of C are 2 and 4, and each of those elements 
is also an element of B. n 


To prove thatC is a subset of B, you must convince the READER that each element 
of C has to be an element of B. The terminology element-wise comes from the fact 
that the proof deals with the individual elements of the sets. 





‘To show that one set is a subset of another, we establish that 
every element of the first set must also be an clement of the 
second set, 


Here is a slightly more abstract example. The sets are too large for us to list all 
the elements, so we are forced to think instead about the properties the elements 
have. 





wn Example 2 Let A be the set (0,10,20,30,40,...] and let B be the 
set {...,—6,—4-2,0,2,4,6,...). That is A — (Ik: ke NJ, and B={keZ: 
kis even), Explain why A © B is true. 








SOLUTION Because the sets are infinite, we cannot list all the elements of 
A and verify that each is in B. Instead, we imagine a READER who is trying to 
find a counterexample to the if, then statement “If x € A, then x € B,” and we 
write this letter: 


Dear READER, 


As you search for a counterexample, you are choosing numbers that make 
the hypothesis true and hoping to discover that the conclusion is false. T 





33 Proving Set Properties 211 


the hypothesis true, you must choose an element of A—let's agree to call it x. 
Since x € A, I know that x has the form 10k for some k € N. 

To convince you that this value cannot make the conclusion false, I need 
to show that x is in B. To show x € B, I need to show x is even. Since you agree 
that x = 10 -k, I know I can write x = 2- (5- K). Since 5- K is also an integer, 
this shows that your x is even, which means that your x is in the set B. 

This is why there is no counterexample to the if, then statement, and hen 
you must agree that the statement A C Bis true. 












Hoping we're still friends, 


AUTHOR 
o 


Our next example is more abstract. We will not just prove a statement about 
particular sets, but we will prove that a property holds for any choice of sets. We give 
two versions of the proof, the first a “letter to the Reaper.” and the second a more 
concise version of the same proof, 


Proposition 1 ANBCA. 
PROOF (Informa 





version) 


Dear Reaper, 


No matter how you choose your sets A and B, I'll be able to show that 
there is no counterexample to the statement "If x is an element of A A B, then 
x is an element of A." 

If your choice of A and B makes A NB =Ø, then there can be no coun- 
terexample because you cannot make the hypothesis true. So we will focus on 
the case where A N B # 0. 

Choose an element of A B, and let's agree to call that element x. Since 
we agree that this x isin A N B, by the definition of the intersection operation N, 
Tknow that two things are true: x is in A and x is in B. Notice that in particular, 
we have agreed that this x is in the set A. 

Therefore, no matter which element of A N B you choose, I will know it 
is also an element of A. But this is exactly what we mean when we say that 
ANBCA 











Looking forward to the next challenge, 
AUTHOR " 


In general, when we prove the subset relationship S C T, we will not separately 
address the case where S = Ø. The above reasoning establishes the general fact th 
the empty set is a subset of any set; hence, there is no need to address this possibility 
in every proof. We see this approach right away with the formal proof of Propo: 
tion 1, 

PROOF (Formal version) 

Let sets A and B be given. Let x € A NB be given. By the definition of N, 
we know that x € A and x € B. In particular, we know that x € A, Thus, every 
element of A n B is also an element of A, so A NBC A. 7 

















|2 Chapter 3 / Sets and Boolean Algebra 


In a way, this proof simply explains in detail how our careful set definitions 
logically support the obvious fact that A N B C A. Intuitively, everything that is in 
the overlap of sets must be in the first set, so this result is not surprising. The proof 
simply explains the result using careful mathematical language and the deductive 
argument structure that we learned about in the previous chapter. 


Practice Problem 1 Another obvious property of any sets A and B is that A C 
AUB. 


(a) Write a letter to the Reaper to convince her that this property is true. 
(b) Write a more formal proof of the property. 
(c) Explain what the property says in nonmathematical terms. 


‘The following proposition gives us more practice with the technique of element- 
wise set proofs, but it differs from the previous proposition in a crucial way. The 
Reape is not choosing just any given sets A and B as before, but instead she is 
choosing sets that satisfy the particular property A U B C B. When proving this type 
of statement, we will be using even more of the ideas about proofs that we learned 
about in the previous chapter. 





Proposition 2. If(A UB) C B, then A C (An B) 
PROOF (Detailed version) To help make the proof clear, we will number the 
paragraphs. 


1. Suppose that the Reaper has chosen sets A and B satisfying the hypothesis 
of the theorem (ic., (A U B) C B). We must convince him that A € (A N B). 
This means we must demonstrate that every element of A is also an element 
of ANB. 

2. To do this, we invite the Reaper to choose any element of A, and we agree to 

call that element x. 

. Since we know that x € A, the definition of union tells us that x € A U B. 
(Remember that to be in the union, an element just has to be in either one of 
the sets) 

4. From paragraph 3 we know that x € A U B. From paragraph 1 we know that 
(A UB) CB. By the definition of subset, everything in A U B is also in B. 
Therefore, x € B. 

5. From paragraph 2 we know that x € A. From paragraph 4 we know that x € B. 
Sox € A NB, by the definition of intersection. 

6. We have demonstrated that every element of A is also an element of A N B. 
so the proof is complete. 

. 


PROOF (Lessdetailed version) Let sets A and B be given satisfying A UB C B, 
and let x € A be given, (We must show that x € A NB.) Since x € A, clearly 
x € A UB. But since A UB C B, this implies that x € B. Thus, x € A and also 
x € B, sox € A NB. Therefore, A C An B. Li 





33 Proving Set Properties 213 


Practice Problem 2 Show thatifA C Band B CC, thenA C C. Beas formal or 
informal in your writing as you would like, but be sure to include every logical detail 


The next proposition and practice problem together establish an important prop- 
erty of intersection and union operations, the “distributive property”: A N (B UC) 
(A n B) U (A NC). (Notice the similarity to the familiar distributive property for the 
multiplication and addition operations on numbers: a - (b +c) = (a - b) + (a - c).) 

In order to prove this property, we first need a formal definition of the meaning. 
o[*—" for sets The intuitive meaning of two sets being equal is that they contain the 
same elements, In order to build on the terms and proof techniques that we already 
know, we will use the following informal strategy to prove set equality: 














‘To show that two sets are equal, we show 
that each is a subset of the other. 
This often requires two separate 

proofs. 


Why is this? Well, sets being equal means that they contain the same elements, So 
every element of the first must be an element of the second, and every element of 
the second must be an element of the first, We take this explanation as our formal 
definition of set equality, 





Definition Two sets A and B are equal, written A = B, if A C B and 
BCA, 





From this point of view, the distributive property for sets, written 
An(BUC) = (A nB)u(A nC) 
is actually equivalent to the two separate properties 


AN(BUC) € (A nB)U(A nC)and (A nB)U(A nC) C A n(BUC) 


These properties will be proven separately, one as the proposition below and the 
other as the practice problem that immediately follows, Notice that the proof of 
the proposition uses the “division into cases” technique that we first encountered in 
Section 2.2. 


Proposition 3 An(BuC) c(AnB)u(AncC). 
PROOF Letsets A, B, and C be given, and let x € A N (BUC) be given. By the 
definition of intersection ( ^). we know that x € A and x € B UC. By the definition 
of union (U), either x € B or x € C, so we consider each possibility as a separate 
case. (Remember that we know that x € A for sure in either case.) 


© [fit happens that the given x is in B, we now know that x € A and x € B, from 
which it follows by the definition of intersection (N) that x € A NB. This is 
enough (by Practice Problem 1) to guarantee that x € (A n B) U(A nC). 


Chapter 3 / Sets and Boolean Algebra 


re more on 


© If, on the other hand, it happens that the given x is in C, then we know that. 
x € A and x € C, from which it follows by the definition of intersection (N) 
that x € A NC. This is enough (by Practice Problem 1) to guarantee that x € 
(AnB)u(AnC). 





Since in either case we have the desired result, x € (A N B) U(A MC), we 
can conclude that A (BUC) (A nB)U(A nC). 


Practice Problem 3 Prove that (A ^ B)U (A NC) S A n(BUC). 


We are finally ready to conclude that the distributive property is true of the set 
intersection and union operations. All the hard work has already been done, 


Theorem 4 (Distributive Property) For all sets A, B, and C, A N (BU 
€) - (4n B)U(4 NC). 
PROOF To show that A ^ (BUC) — (A n B)U(A NC), we must show that 
cach is a subset of the other. In Proposition 3 we proved that A n (BUC) C 
(A 0 B)U (A NC), and in Practice Problem 3 we proved that (A 0 B)U (A A 
C) cAn(BUC). " 





Our final example of an element-wise set proof involves an "if and only if” 
statement in order to show yet another level of complexity in proof structure. In 
Section 1,6, we saw that a statement of the form “p if and only if q” is called a 
biconditional statement because itis equivalent to the statement "if p, then q, and if 
4, then p.” Therefore, we will prove a statement of the form "p if and only if q” by 
writing two proofs, one for the statement "if p, then q,” and another for the statement 
"if q, then p." 








Proposition 5 A c BifandonlyifA NB =A. 
PROOF We must actually prove two different propositions: 


1. Claim: If A C B, then A NB =A. Let sets A and B be given satisfying the 
hypothesis A C B. We wish to show that A B = A, so we will need to show 
that both (a) A NB C A and (b) A CA n B. 

(a) We proved in Proposition 1 that the statement A NB C A is true of all 
sets. 

(b) To show that A C A NB, we start by letting any x € A be given. Since 
the sets A and B satisfy the hypothesis A C B, it follows that x € B, and 
consequently (since x is in A and in B) that x € A N B. Hence, A C A NB. 

Since we have shown that A € A NB and A NB C A, we conclude (under 

the hypothesis that A € B) that A NB =A, 

2. Claim: IfA NB = A, then A C B. Letsets A and B be given such that A N B = 
A. Let any x € A be given. Since A N B = A, it follows that x € A 0 B, and 
consequently that x e B. Since we have shown that any x in A must also be in 
B, we conclude that A C B. 




















33 Proving Set Properties 215 


‘The result in Proposition 5 establishes an important connection between the c 
relationship and the N operation. In particular, it gives us a way to express the subset 
relationship as an equation involving set intersection, 





Proving New Properties from Old 


We will next see how proofs about sets can be based on algebraic properties of set 
equations. This means that once we establish some basic properties, we can then prove 
new properties using these earlier results. This will allow us to avoid using an element- 
wise proof structure that clearly gets more cumbersome with every additional layer 
of complexity in the statements being proven, 

So once we have a core of basic rules, we can use this type of proof technique to 
derive even more results without having to use the “element-wise proof” structure 
at all. Theorem 6 below is a particularly useful collection because it consists of set 
properties with a flavor of algebraic manipulation rules, Theorem 4 above, for exam- 
ple, shows up as one of the versions of the distributive property in Theorem 6. You 
will be asked to prove some of the other properties in the exercises, while we focus 
the remainder of this section on using these properties to create proofs about sets 
using algebraic manipulation, 








Theorem 6 For sets A, B, and C, the universal set U, and the empty set 0, the 
properties shown in Table 3-6 hold. 


‘The two-column organization of Theorem 6 is worth taking a moment to discuss 
Note that most of the properties listed have two very similar versions, one given in 
each column, In fact, the two versions are exactly the same with some standard 
changes. 





uw Example 3 For each of properties (a), (b), (c), (8), and (j), verify that inter- 
changing the symbols U and n changes the first version of the property into the second 
version. What additional changes need to be made in the other properties? 


SOLUTION. Interchanging Nand U and interchanging U and # will turn every 









first-column version into the second-column version. a 
(a) Commutative ANB=BOA AUB=BUA 
(b) Associative (A nB) NC (A UB)UC = A U(BUC) 
(©) Distributive A N(BUC) AU(BNC)=(A U B) 
A(A UC) 
(d) Identity AURA 
(e) Negation Anas 
(D) Double negative 
(g) Idempotent AUA 
(h) DeMorgan's laws (AnBy-Aum (AUBy-AnB 
() Universal bound. AUU-U And-d 
() Absorption. An(AUB)-A AU(ANB)=A 
(k) Complements of U and U' =i "-U 


(I) Complement and negation A-B=A N8 
Table 3-6 Properties of Set Operations 





Chapter 3 / Sets and Boolean Algebra 


ve more on 
he Web, 


‘The above observation leads us to formalize a principle that will not only account 
for the organization of Theorem 6, but more generally will give us a "free" result for 
every one we prove. 





Definition For any expression involving the set operations N, U, Ø, U, 
and’, the dual expression is obtained by interchanging every N and U and by 
interchanging every U and Ø. 








Theorem 7 (Duality Principle) For any true equation of two set ex- 
pressions, the equation obtained by replacing each expression by its dual expres- 
sion is also true. 











We will address more on duality in the exercises for this section. We complete 
the section by illustrating how Theorem 6 can be used to prove other properties 
about sets, 

In our first example below, we are to show that two sets are equal, where one 
of the sets is given by a fairly complicated expression and the other is much simpler. 
For this type of problem, a common technique is to start with the more complicated 
expression and simplify it as much as possible, justifying each step with properties 
from Theorem 6, 





"Blll Example 4 Assume rhe properties of Theorem Garg ali crut. Usethem to prove 
that for all sets A and B, A U (A' 0 B) = A UB, What is the dual equation that must 
also be true by the duality principle? 
SOLUTION We start with A U (A' N B), simplifying until we obtain A U B. 
Here are the steps: 


AU(A' NB) = (A UA) Q(A UB). (cdistributive) 


-Un(AUB) (c, negation) 
-(AUB)nU (a,commutative) 
=AUB (d, identity) 


‘The dual of this equation is A A (A' U B) = A N B. By the duality principle, we 
can conclude that this is also true. 


Practice Problem 4 Which properties of Theorem 6 justify each step of the fol- 
lowing proof that A 0 (B' n AY =A NB? What is the dual equation that must also 
be true? 
An(B NAY 2 An(ByuA) 
-An(BUA) 
A nB)U(A nA") 














33 Proving Set Properties 217 


In the next example of this proof technique, both sides of the proposed equation 
are fairly complicated, so we will simplify each side as much as possible, hoping that 
they simplify to the same thing, (It is worth noting in this example that we are 
implicitly using the associative property of “union” when we write expressions such 
as A’ U BUC’ rather than (A’ U B) UC' or A’ U (B UC’). We do this simply to avoid 
an unnecessary level of parentheses in our expresssions.) 











lll Example 5 Assume he properiesof Theorem 6arealliru. Usethem to prove 
that for all sets A, B, and C, 


(AnB)U(A'UCY = (A UB) N(A UB) n(A'UBUC) 


SOLUTION First, we take the expression on the left-hand side, (A N 8) U 
(A' CY, and we simplify it as much as possi 





(A NB)U(A'UCY = (A NB)U((A'Y NC’) (h, DeMorgan’s) 
(A NB)U(A NC') (f double negative) 
A A(BUC') (©, distributive) 









Next, we take the expression on the right-hand side, (A UB) n (A U B') n 
(A'U BUC’), and again we simplify it as much as possible, 


(AUB)n(AUB)n(AUBUC!) 


-(AU(BnB) n(ÁUBUC) (c, distributive) 
=(A UM n(4'UBUC') (e, negation) 
=An(A'UBUC) (d, identity) 
-(AnA)U(A A(BUC)) (c, distributive) 
=0U(A (BUC) (c, negation) 
=A N(BUC') (d, identity) 


Because both sides of the original equation simplify to the same expression, 
namely A N(B UC’), we conclude that they are equal, meaning the original 
equation is true. a 


Practice Problem 5 Using properties of Theorem 6, simplify each side as much 
as possible to show that the following expression is true: 


(A nB)u (A'UBy =((A NAY N (A'U B)) 
What is the dual equation? 


Finally, we will examine the proof of an if, then statement relating two partic- 
ular relationships between arbitrary sets A and B. In this case, we follow the same 
reasoning as with all direct proofs of if, then statements. We invite the READER to 
form an example satisfying the hypothesis, and we demonstrate (using properties 
from Theorem 6) that such an example must also satisfy the conclusion. 








. PESE S — A 


18 Chapter 3 / Sets and Boolean Algebra 


Proposition 8 IfA C B, then A UB= B. 
PROOF Letsets A and Bbegivensuchthat A C B. By Proposition 5, this means 
that A NB = A. Since these are the same set, taking the union of each with B will 
result in the same set. That is, 


AUB-(AnB)UB 
But we can use properties from Theorem 6 to simplify this as follows: 


(AnB)UB 





-BU(BnA) 


U(A NB) by commutativity 
by commutativity 
by absorption 





Sine AUB=(ANB)UB and (ANB)UB=B, we conclude that 


AUB- 





olutions to Practice Problems 


(a) The text of the letter might go something like this 
You choose the two sets A and B. To show you that 
A CA UB, I have to convince you that every el- 
ement of A is also in A U B. But if you choose an 
clement (call it x) in set A, FI automatically know 
it'sin A UB. Why? To be in the union, all you have 
to do is be in either one of the two sets, and your x 
certainly satisfies that condition. 

(b) Letsets A and B be given, and letx € A be given. By 
the definition of union, to establish that x € A UB, 
we need only show either that x € A or that x € B. 
Since we do know that x € A, this shows that x € 
AUB, 

(c) Inplainterms the property C (4 U B)meansthat 
joining together A with any set B produces a set that 
must contain all the elements of A. 

Suppose that the READER has chosen sets A, B, and C 

satisfying the hypothesis ofthe theorem (i.e A Band 

also B C C). We must convince him that A C C. This 
means we must demonstrate that every element of A is 

also an element of C. 

So we suppose the READER has chosen a particular 
element x of A, and we must convince him that x is also 
an element of C. We reason as follows: 

Since x is an element of A, and since every element 
of A is also an element of B (because A C B), x must 
be an element of B. 

Now we know that. isan element of B and that every 
element of B is also an element of C (because B C C), 
and therefore x must be an element of C. 

We have shown that no matter which element x of A 
chosen, x must also be an element of C. This is pre- 
cisely what we mean by the statement A C C, and so we 
have proved that A SC. 








3 Let sets A, B, and C be given, and let x € (A n B)U 
(A nC) be given. By the definition of U, either x € 
AnBorxeAnc. 
© Ifithappens that x € A N B, we know that x € A and. 
x € B (from the definition of N). Since x € B, the defi- 
nition of Utellsusthat x € BUC. Now because x € A 
and x € B UC, we conclude from the definition of n 
that A (BUC). 

© 1E on the other hand, it happens that x € A NC, we 
knowthat € A andx € C (fromthe definition of N). 
Since x € C. the definition of Utellsusthatx € BUC. 
Now because x € A and x € BUC, we conclude from 
the definition of ^ that A n (BUC). 

‘Thus, in either case we have the desired result x € A N 

(BUC), and we conclude that (4 n B)u(A nC) S 

An(BUC). 

4. (b)DcMorgan's (f) double negative, (c) distributive, (e) 

negation, and (d) identity. The dual of the proven equa- 

tion is A U (B' U AY = A U B, which we can conclude is 
also true by the duality principle. 

The left side simplifies to 

{A NB)U(A'UBY = (A N B)U((A') NB’) 
=(ANB)U(A NB’) 
=AN(BUB) 

-Anu 
=A 
The right side simplifies to 
((A NAY n(A'u B)y 








(Ay n (A u By 
-(A n(A UB) 
(Ay 








‘The expressions are the same set. The dual equation, 
which is also true by the duality principle, is 


(AUB)n(4 By = (A UAY U(A' n BY 


Exercises for Section 3,3 


1. Decide which of the following statements are true. For 
each true statement, explain why in a letter from the 
AUTHOR to the READER. For each false statement, give 
a specific example to support your answer. 

(a) (1,3,5,7.9) © (k € N: kis odd) 
(b) (KEN: kends in 0) € (ke N: kis divisible by S] 
(©) (KEN: kis prime] C [k € N : kis odd) 
(D (xeRix<l) Cle Ré <1} 
2. Prove each of the following statements about specific 
sets: 
(a) Hm:meZ| c Qn:ne zi 
(b ám 1:meZI Cn 1:nez| 
(©) Z € Q (Use the definition of Q given in Section 3.1 
in your proof.) 
(D (2n--1:n e ZI n Bm+1:meZ}) C (6e: 
kez} 
(e) (Qn--1:n eZ) O{Sm+4:meZ}) C (1069: 
kez) 
3. Prove each of the following statements about specific 
sets: 
(a) If a is divisible by b, then [a-m:m € Z) © (b-n: 
nez) 
O) (pi - 1:n eZ) nk: ke ZI) € m: mez) 
(©) (Ip: pisa prime number] n (€ — 1: k € N)) = 
BI 
D ((a? + :a be N] nBk:ke NI) 
NI 
4. Fillin the missing stepsin each of the following element- 

F wise proofs: 

(a) Proposition Forall sets A and B.A NB CB. 
Proof Let A and B be given. Let x € A n B be 











(9m:me 


given. 
Sox € B. Therefore, A NB C A. = 
(b) Proposition For all sets A and B, 8 C A UB. 
Proof Let A and B be given. Let x € B be given. 
Sox € A UB, Therefore, B C A UB. LI 
(©) Proposition IfA C B, then A UB C B. 


33 Proving Set Properties 219 


Proof Let A and B be given, and assume that 
A C B. Let x € A U B be given. We consider the 
two possible cases, either x € A or x € B. 


Soin either case x € B. Therefore, A UB C B. M 
5. Fill in the missing steps in the following clemcnt-wise. 


F proot. 
Proposition IfA C BandA CC,thenA C (8 NC). 
Proof Let A, B, and C be given, and assume that 


A C Band A C C. Let x € A be given. 


Sox € B NC. Therefore, A € (B NC). 3 
6. Exercise 5 gives us the following strategy for proving 
statementsof the form“A C (B C) Do wo separate 
proofs, one to show A C Band another to show A C C, 
‘and then cite Exercise 5 to justify the final conclusion. 
Use this strategy to prove the following statements: 
12) (10n— 1:2 eZ} C (De 1:ke ZI nm4 





mezj 

(b) (6-1: ke ZI C (Qn ine Z) nmt: 
meZ) 

(© ire R:x <4 C (lve Rix <2) nee Rin 
2) 


Fill in the missing steps in the following element-wise 

proof: 

Proposition If A CC and B CC. then A UB CC. 

Proof Let A, B, and C be given, and assume that 

A CC and B CC. Let x € A UB be given. We con- 

sider the two possible cases, either x € A or x € B. 
Case 1: Suppose x € A. 


Therefore, x €C. 
Case 2: Suppose x € B. 


Therefore, x eC. 

Soin either case, x € C. Therefore, A UB CC. m 

8. Exercise 7 gives us the following strategy for proving 

statements of the form "(AU B) € C)" Do two sep. 

arate proofs, one to show A € C and another to show 

B CC, and then cite Exercise 7 to justify the final con- 

clusion. Use this strategy to prove the following state- 
ments: 





"Exercises 4(b) and 4(c) together establish that fA C B, then A UB = B. 


20 Chapter 3 / Sets and Boolean Algebra 


(a) (H1: ke ZIUdm-3:me ZI) S n1: 


nez) 

(b) ((6k+1:kEZ]U 6m- 1 :meZ)) € n1: 
nezi 

(©) (Ire R:x «-3]Ulx eR ix» 3) C lee: 
x9 


(@) (Ice R: x « -l)U( eR: 3) C lx eR: 
35-2r—320] (Hint: It might help to know 
that algebraically i? — 2r — 3 = (x — 1 — 4.) 

Fillin the missing steps in each of the following clement- 

wise proofs: 

(9) Proposition A U (B NC) © (A UB) n( UC). 
Proof Let A, B, and C be given, and let x € 
A U (B NC) be given. We consider the two pos- 
sible cases, either x € A or x € BNC. 











So in either case, x € (A UB) (A UC). There- 
fore, A U(B NC) S (A UB) A(A UC). m 
(b) Proposition” (4A UB) (A UC) SA UB 


Proof. Let A, B, and C be given, and let x€ 
(A UB) N (A UC) be given. Then x € A UB and 
also.x € A UC. Weconsidertwo possibilities either 
xeAorrg A. 





So in either case, x € A U(B NC). Therefore, 
(AUB)n(AUC) C(A UB) O(A UC) m 
Give element-wise proofs of the following properties 
from Theorem 6: 
(a) (Commutative property) A U8 = BUA 
(b) (Distributive property) A N (BUC) = (A n.B)u. 
(Anc) 
(©) (Absorption property) A n (A UB) =A 
(d) (DeMorgan’s law) (A U BY = A n 
Give element-wise proofs for the following: 
(9) ITA UB =B, then A NB = 
(b) ITA NB =A, then A UB = B. 
(©) IA AB =A and B NC = B, then A NC =A. 
(@) IfA UB = Band BUC =C, then A UC =C, 
When proving a statement of the form “S = 0" one ef- 
fective strategy is to use proof by contradiction. Specif- 
ically, begin with the assumption that there iv some ele- 
ment x € S, and argue that some absurdity results. Try 
type of reasoning on the following statements to be 
proven: 
(a) (2k+1:kEN) n He: keN] 2 
O) (GE 1:ke ZI n (6k 5:kez) - n 














19 (f, y) ERR: 
Rx » 
@ IA € B hen A NB — 
13, Give element-wise proofs for the following: 
(0) ITA B= A then A UB =U. 
(b) ITA C B, then BY C A 
(e) IEA € B. then A U (8 A) = B. 
14, Prove the following by quoting the appropriate parts of 
F Theorem 6: 
) (A UU) (AUB) =A 
W An(A'UB)=ANB 
(©) AU(A'NB)=AUB 
D (A UB) n(BUC) - (A nC)UB 
(©) (A UB) N(A' NCY =A U(B nC) 
15, Form the dual of each equation in the previous exercise, 
16. It turns out that if we have given an elementwise proof 
F ot parts (a), (c), (d), and (c) of Theorem 6, all the other 
partscan be proved by reference to those four parts. Jus- 
tify cach step of the following proofs by quoting parts 
(a). (c) (d), and (e) of the theorem. Once you have 
proved a property, you can use it or its dual in later. 
proofs. 
(a) Prove part (g), idempotent: A NA 
A=AnU 
-An(AUA) 
-(AnA)U(AnA) 
-(AnA)us 
=ANA 





-2:x-3-yniG.y)e 























(b) You can use parts (a), (c). (d), (e), and (g). Prove 
part (i), universal bound: A UU 
AUU -(AUU)nU 
-(AUU)n(AUA) 
-AU(UnA) 
-AU(A nU) 
-AUAM 
=u 








c) Now you can use parts (a), (c), (d), (e), (2), and (i) 
to prove part (j), absorption: A N (A U B) = A 
AN(A UB) = (A UB) N (A UB) 

u@ns) 

=AU(B NB) 

uø 








ercises (a) and 9(b) together establish that A U (B NC) = (A UB) A (A UC). 





34 Boolean Algebra. 221 


17. We did not prove the duality principle, but we can get (e) IA 0B — A, then B N(B NA'Y =A 
an idea about why itis true by considering the proofsin 29, Prove each of the following statements: 
the previous exercise. (a) P(A NB) C P(A) NP(B) (Use the strategy 
(a) Prove that A U A = A by taking the proof of part from Exercise 6.) 
(a) above and replacing every line with its dual. How ii li 
[lestie pee qed bcs w fun C P(AUB) (Usethestrategy from 
(b) Prove that A NØ — by taking the proof of (b) (©) Ax V M. (Use the strategy from Exercise 12.) 





above and replacing every line with its dual. How 
do the reasons for each step change? 


(©) Prove that A U (A N B) = A by taking the proof of 


21. Prove each of the following statements: 
(a) IfA S B, then (A x C) C (Bx C). 








(c) above and replacing every line withitsdual. How (8) IA C B. then P(A) C P(B). 
do the reasons for each step change? 22. Prove each of the following statements: 
18. Prove the following by quoting the parts of Theorem 6: (a) (A N B) xC = (A xC) N(B xC) 
(a) ILA UB = B, then A OB ( (AUB) xC - (A «C)U(B C) 
(b ITA n B A then A UB = B. 23. Prove each of the following statements: 
(© IA n B= A then AUB =U, (a) P(A) PLB) = PLA N B) 
19. Prove the following by quoting the parts of Theorem: — (b P(A)UP(B) € PIAU B) 
(a) LAO =U, then A n V. (©) Give an example to show that the statement P(A) U 
(b) ITA UB=B, then A U(B nA) =B. P(B) = P(A U B) is not necessarily tru. 


3.4 Boolean Algebra 





In the remainder of this chapter, we will consider the important notion of Boolean 
algebra as a thread uniting concepts about “logic systems" (from Section 1.6) with 
concepts about “set systems” (from Section 3.3), and we will see how in this case 
the use of abstraction actually makes it easier to discover and understand new 
properties of these more concrete systems. We conclude the chapter with a tra- 
ditional computer science application of Boolean algebra to logic circuit design 
issues. 

One of the most remarkable properties of the study of mathematics is that we 
frequently discover essentially identical patterns in widely diverse subject matter. In 
fact, we have already encountered an important example of this phenomenon, Take 
a look at this theorem from Section 1.6, which describes properties of the logical 
connectives and, or, and not. 








Theorem 2 (Section 1.6) again Let p, q, and r be propositions, and let t 
indicate a tautology and c a contradiction. The logical equivalences shown in Table 
3-7 on page 222 hold. 


Compare this with Theorem 6 of the previous section which describes properties 
of the set operations intersection, union, and complement 


Theorem 6 (Section 3.3) again For sets A, B, andC, the universal set U, 
and the empty set @, the properties shown in Table 3-8 on page 222 hold. 





2 Chapter 3 / Sets and Boolean Algebra 





(a) Commutative PAqmqAP pvqumqvp 
(b) Associative (p^q)^rmp^(q^r) (pvqyvrepviavr) 
(©) Distributive palavr)=lpag)v ipar) pv(qnr)mpvq)^(pvr) 
(d) Identity pAt=p pyc=p 
(e) Negation PY-pst p^-puc 
(f) Doublenegative (=p) =p 
(g) Mempotent p^pmp pvpup 
(b) DeMorgan’slaws — —(p^g)m-pv-q pv 4) =p A-9 
G) Universal bound — pvrer p^cmc 
G) Absorption palov =p pv(p^d)m p 
(K) Negationsof rande (c E 
Table3-7 Logical Equivalences 
(a) Commutative ANB=BOA AUB=BUA 
(b) Associative (ANB)NC=AN(BNC) — (AUB)UC =A U(B UC) 
(©) Distributive AN(BUC)=(ANB) A U(BnC) - (A UB) 
wA nc) MA UC) 
(d) Identity - AUM-A 
(e) Negation ANA = 
() Double negative 
(s) Idempotent AUA=A 
(h) DeMorgan’s laws (AUBY e An B 
(i) Universal bound Ania 
G) Absorption AN(AUB)=A AU(ANB)=A 
(k) Complements Uap "=U 
of Uand à 


These theorems are remarkably similar. In fact, simply making the following 


‘mechanical changes: 


© From p, q.r to A, B.C © From =to= 
9 From ^ton © FromrtoU. 
* From v toU © Fromcto# 
* From —to' 


Table 3-8 Set Equivalences 


translates the first theorem into the second. Similarly, any logical expression can 
be converted to a related expression from set theory (and vice versa), as the next 
examples illustrate. 


"lll Example 1 Convert these logical espresionslequiralencs to se theory nota- 
tion, using sets A, B, and C from the universal set U: 


1, p^q)v (a Ar) 
2. (pv-q)^t m -(7p^q) 
SOLUTION 


1. (AnByu(B'nc) 
2. (AUB)nU - (An BJ 








34 Boolean Algebra 223 


a Example 2 Convert these set theory expressions/equalities to logical notation, 
using logical variables p, q, and r: 


1. (AUB)n(C'UU) 
2. (An B)UM — (A UB 


SOLUTION 


1 (pva) a(r vi) 
2 (po-q)ves-(Cpvq) 


Practice Problem 1 


(a) Convert (-(p v c) ^ (^q ^ t) v p to set notation, using sets A and B. 
(b) Convert (A' U8) N(B UC) = A'N B C to logical notation, using logical vari- 
ables p, q, and r. 


Here are the solutions to two earlier examples and exercises that further il- 
lustrate the similarities between properties of logical expressions and properties of 
sets 


"ME example 3 verify die logical equivalence p v (^p ^q) = p v q by quoting 
Theorem 2 of Section 1.6. 
SOLUTION 
pv(cp^q)s(pv-p)^(pvq) (c. distributive) 
=ialpya) (c, negation) 
m(pvq)^t 
=pv4 





o 


@ Example 4 Verify the set equality A U (A' A B) = A U B by quoting Theorem 
6 of Section 33. 
SOLUTION 


AU(A'nB) - (A UAA (A UB) (c, distributive) 


-Un(AUB) (c. negation) 
-(AUB)nU (a, commutative) 
=AUB (d. identity) 


o 


In the mid-1800s the English mathematician George Boole investigated systems 
with properties such as these, giving rise to the structure that today is called Boolean 
algebra. It is customary in describing a Boolean algebra to use lowercase letters, to 








Chapter 3 / Sets and Boolean Algebra. 


Logical Sets Boolean Algebra 











Variables Par ABC abc 
Operations. ^ n 
v u 





1 





Special elements 


a o 
t U 1 


Table 3-9 Connections between Logic, Sets, and Boolean Algebra 


use - and + for the operations" and to use 0 and 1 for the special elements. It is 
also customary to consider - to have higher precedence than +, reducing the need 
for parentheses. This results in the list of properties satisfied by any Boolean algebra. 
shown in Table 3-10. You should compare this list to the two preceding theorems, 
observing the relationships summarized in Table 3-9. 

Any logical expression or expression of set theory can be written using Boolean 
algebra notation. 


TM Gases wae von ADS Cp n A UR Ue o7 
using Boolean algebra notation, with variables a and b. 
SOLUTION The logical equivalence translates as (a 4-B))- 1 = (a'b) and the 
set equality as ab’ + 0 = (a’ + by. Observe that we have omitted some of the - 
operations and removed unnecessary parentheses in these solutions. nu 


Practice Problem 2 Convert (-(p v c) A (~q ^t) v p and (A'Ug)n (BU 
CY = A'n B'nC' to Boolean algebra notation, using variables a, b, and c. 


Because the three systems have exactly the same properties, a proof that two 
expressions are equal in any one of the systems automatically shows that the corre- 
sponding equalities hold in the other two systems. This is true because a proof in any 
‘one of the three would use exactly the same properties, in exactly the same order, 
as in the other two settings. We illustrate this by giving a Boolean algebra version of 
the proofs in Examples 3 and 4, 


Bl example 6 Veriy ne Bookesa algebra equality à +a'b 
the properties of a Boolean algebra. 
SOLUTION 





+b by quoting 


a (a'b) — (a--a')(a--b) (c, distributive) 
=1-(a+b) (e, negation) 
=(a+b)-1 (a. commutative) 
=a+b (d, identity) 


* We frequently omit the - operation, writing ab for a - b when no confusion can result. 








34 Boolean Algebra 225 





(a) Commutative a pAGhEs 
(5) Associative (t5) teca (bo) 
(c) Distributive EA ed p M 





(d) Identity a+0=a 
(e) Negation aa =0 
(D Double negative 
(g) ldempotent ata=a 
(h) DeMorgan'slaws (a+b) = a'H 
(i) Universal bound a.0= 
G) Absorption a+(a-b) 
(k) Complements v= 
of Land 0 
Table 3-10 Properties of a Boolean Algebra 
Notes: 


1. It turns out that to verify that any given system is a Boolean algebra, one needs 
only check that properties (a). (c). (d). and (e) are true. The other seven properties 
can be proven if we know these four hold. 

.. One of the advantages of the Boolean algebra notation is that some of the prop- 
erties are analogous to familiar properties of ordinary algebra. For example, the 
first of the two distributive properties is the same as what we are familiar with. 
‘This makes some symbolic manipulations easier for us than they would be using 
either the logical symbols or the set symbols. 


Duality 


Every property of a Boolean algebra, except for property (f), occurs in two forms. 
For example, we have these two negation properties: 





ata 





0 


a-a 


We say these properties are “duals of each other.” If we start with the first nega- 
tion property, a +a’ = 1, and change + to - and 1 to 0, we get the second negation 
property. 





Definition The dual of a Boolean algebra expression is the expression 
obtained by interchanging the roles of + and -, and also interchanging the 
roles of 1 and 0. 











"Blll ample 7 Form the duat of hese expressions/equalities: 


1. (a+ bfc +1) 
2 ab +0=(a'+by 








Oo T a ë 


226 Chapter 3 / Sets and Boolean Algebra 


SOLUTION 


1. (a - b) + (c' 0). Observe that we can drop some parentheses and - symbols, 
obtaining ab + c 0. 

2, (a b) - 1 = (u'b). Be careful to insert the required parentheses when con- 
verting - to + as you form the dual. 


u 


Practice Problem 3 — Form the dual of the expression (a + 0y (b 1) +a and of 
the equality (a! +0)(b +c) = a'bre 


Because all the properties occur in dual pairs (and (f) is self-dual because it 
contains no 1, 0, +, or ), we obtain the following principle: 


Theorem 1 (Duality) For every true equality in a Boolean algebra, the "dual" 
of that property (given by swapping +/- and swapping 0/ 1) is also true. 


Why? Because the same proof that proved the original equality could be used 
1o prove the dual property. Consider the following example, 


TA Examples Verify the Boolean algebra equality a(a' +b) = a - b 
SOLUTION (Version 1) 


«(a b) =a-a' +a+b (c, distributive) 
=0+a:b (e, negation) 





=a4:b+0 (a, commutative) 
=a-b (d, identity) 
(Version 2) This is the dual of the equality established in Example 6. a 
Observe that the Version 1 proof quotes exactly the same properties, in ex- 
actly the same order, and for exactly the same reasons, as the proof in Example 6. 





Informally, this is the reason the duality principle applies. It is worth noting that 
sometimes it is easier to prove the dual of a theorem than it is to prove the the- 
orem as stated, simply because some properties of Boolean algebra more closely 
mirror properties of “ordinary” algebra. For example, the distributive property 
a(b +c) = ab + ac comes more naturally to us than the dual distributive property 
a+ be = (a + b)(a +). 


xplore more on Practice Problem 4 Here is a proof of an equality in Boolean algebra. Write out 
the Web. the proof of the dual equality, and for each step tell which Boolean algebra property 
is being applied. 





34 Boolean Algebra 227 


(a +b)(ab'y = (a+ bya’ +b") 


Solutions to Practice Problems 


1 (0 (AU0yn(nU)uA 
w (pvc) ^ (vr) m P^ 

2 (as ON) c a (w Ob C) e abre 

3 a1) + (b + 0))a 5 (01) + (be) a +b! +c 





Exercises for Section 3.4 


1. Rewrite these logical equivalences in Boolean algebra 
(0 (p^) v p= p 
w (p^) pv (pag) 
(e) ~pa) alpy a) = p 
@ -(pvCp^qi 
(9) ~pa) v (pva) mt 
(0 (pvr)^(qvr)m(p^gvr 
Rewrite each of these set equalities in Boolean algebra 
notation. (Hint: Remember that A — B is the same 
asa nt) 
(a) A N(B'UA)=A 
(b) (4 UB)NC'=(A nC)u (inc? 
(o (A UB)n(4'nC) =A u(BnC?) 
(d (An B)UA- A 
(9 (A- BY = A'U(A NB) 
( (ANB) ~ (BNC) =(A NB) -C 
3. Justify cach step in the following proofs by quoting one 
F of the properties of a Boolean algebra: 

(a) Claim: For all a, (a + 1Y(a +0) =a. 

Proof 








n 








(a 1) 40) =1-(a +0) 
= (a+0)-1 
ato 


= (a +b)(a' +h) 
= (b+a)(b+a') 
=b+aa' 

b+0 
=b 


4 ab+(a+b') o aba! DeMorgan's 
=ab-+a'b double negative 
= ba+ba’ commutative 
= bla+a') distributive 
-bd 
=b 





tity 





(b) Claim: For all a and b, a(a’ + b) = ab, 
Proof 
aa +b) = asa’ tab 
O+a:b 
abt) LL 
a:b . 
(c) Claim: For all a and b,a+a'b = a + b. 
Proof 
abe (ab) 


= 1. (a+b) 
= (aa!) (a +b) 

=a (a + b)+a'- (a+b) 
ata’ (a b) 
casada b 
aaa ab 
+0+a': b 

ta'b . 














4, Prove the following identities in a Boolean algebra, jus 
tifying each step by quoting one of the properties of a 
Boolean algebra 
(a) (a+ b)b-- c) mace b 
O) ab be = (1) 

(e) (a - bya'c) =a + be 
(a) ab +(a’ c) =a(b+e) 
5. Fillin the blanks in cach proof below with the properties 

F of Boolean algebra that make each equality true: 








28 Chapter 3 / Sets and Boolean Algebra 














(a) Claim: Ifa + b= b then a + (b-a') =b. 
Proof. Let a and b be given such that a +b 
Then 

b=a+b e bab 
(ab): 
(a+b)-(a+a') 
a+ (bra) 

=a+b-a' . 


(b) If ab = a then b(ba'y =a, 
Proof Let a and b be given such that ab = b. Then 


bwy 








Sinea-b=bh m 


For a Boolean algebra, prove each of the following: 
(a) a+b =b, then al 
(b) Ifab =a, then a’ +b =1. 
(©) Ma +6=1, then aby =0, 
(d) I ab — 0, thena +b =b. 

Non we these four statements establishes that the 
statements (i) a + b = b, (ii) ab = a, (iii) a" + b = 1,and 
(iv) ab! = Oare equivalent. That is, ifone of themis true, 
then all four of them are true, 

Rewrite the two propositions you proved in Exercise $ 
as theorems of set theory. (Use the notation F — G in 














place of Fr G' when you write the theorems) Which 
do you suppose is easier, the Boolean algebra proof of 
these theorems or an element-wise proof in the style of 
the previous section? 

3. Ina Boolean algebra, we can define a relationship < by 
saying a < b means that a -b =a, By quoting proper- 
ties ofa Boolean algebra, verify the following properties 
this relationship has: 

(asa 

(b) Ifa band b < a, thena =b. 

(©) Ifa < band b <c, thena « c. 

9. Consider the set of all divisors of 30, 
11,2,3,5,6.10, 15,30). Define operations - and + by 
a-b= ged(a, b) and a+b=lem(a, b). (By ged we 
mean the greatest common divisor, and by fem we 
mean the least common multiple.) 

(a) Fill in the entries in Table 3-11 to show the result of 
doing the operations 

b) There is an element of $ that acts as an identity for 
‘multiplication. (If we call this element u, it will be 
true that for any element a, a -u = a.) Which ele- 
‘ment has this property? 

(©) There is an element of § that acts as an identity for 
addition, (If we call this element z, it will be true 
that for any clement a, a + z = a.) Which element 
has this property? 

d) For each element a of the set, define a’ as the el- 
ement that satisfies both a a’ = z and a a = 
where z and u are the elements determined in the 
previous steps, Calculate a’ for each element of S. 

10, Repeat Exercise 9 for the divisors of 70, 














































































































e [1|2|3]|5 |6 [10 | i| 30 *1[2[s]|s]|e [n | 15] 30 
1 1 
2 2 
3 
4 + 
5 
+ 
6 6 
Ee 
10 10 
E 
15 15 
30 | 30 
Table 3-11 Addition and Multiplication Tables 








35 Excursion: Logic Ciraits 229 




















































































/ 
2 3 
3 | 
z 1 
Figure 3-14 Lattice of divisors of 30. 
6 
(d) Repeat part 12(c) for the divisors of 70. 
d 13, Starting with only properties (a), (c), (d). and (e) of a 
zm Boolean algebra, prove the following properties. Once 
you have proved a property, it can be used for the later 
30 proofs. 
(a) Property (g) (His 
Table 3-12 Table for Exercise 12 (b) Property (i) (Hinr: Start with a + 
and use the fact that 1 = a-+a'.) 
1L. Refer to Exercise 9, Do you think this set with these (©) Property G) (Hint: Start with a(a +b) = (a+ 
operations (., +, asd”) forms a Boolean algchra? To O(a +b). 
decide, you need to decide if properties (a), (c), (d), 14, It turns out that for any elements x and y of a Boolean 
and (e) are true. What elements play the role of land algebra, if x + y = 1 and xy = 0, then y must be equal 
0 103^. Thus, to prove DeMorgan’s law 
12, Refer to Exercise 9, Define the predicate L by saying wdy =a 4h 
L(a, b) means that a - b = a, 
In Table 3-12, place a checkmark to indicate those all we have to do is verify that 
pairs where L(a, b) is true. (0-5) +8) =1 
(b) Describe in words what relationship a and b must asd that 


have for L(a, b) to be true. 


(©) Complete the visualizatio (a-b)- (a +b) =0 





n Figure 3-14 showing. 





the truth value of the L predicate among divisors Do this. You may quote any property of Boolean al- 
‘of 30, To do this, we place a at a level below band gebra that you wish except DeMorgan’s law, of course, 
draw a line joining them whenever L(a, b) is true. since that is the one you are proving. 





e 3,5 Excur: 





n: Logic Circuits 











One of the fascinating things about mathematics is that applications of an idea are 
frequenily discovered many years, even centuries, after the mathematical idea is 
developed. Boolean algebra provides an excellent example of this phenomenon. 
George Boole developed his ideas in the mid-1800s, and in the mid-1900s these same 
ideas were used in the construction of the electrical circuits that make up a computer, 

We will give an overview of how Boolean algebra relates to computer circ 
Data are stored and manipulated in a computer as binary (base two) numbers, that 
is, as strings of 1's and 0's. For example, on many computers the letter "c" is stored as 
the string 01000011, and the number 75 as 0000000001001011. Each of the numbers 
in the string is referred to as a bit (binary digit), Individual bits are represented by 
voltages, with two different voltage levels representing the 0 value and the 1 value. 











LE VEO O ———  JXEEEN— 


30 Chapter 3 / Sets and Boolean Algebra 


These bits are combined using complicated circuits, to allow for operations such as 
integer arithmetic. For example, a device called the arithmetic-logic unit (ALU) can, 
among other things, combine two strings that represent numbers to obtain a third 
string that represents the sum of those numbers. Given the string 0000000001001011 
(representing 75) and the string 0000000000000011 (representing 3), it creates the 
string 0000000001001110 (representing 78). 

How is this accomplished? The answer is that simple circuits called gates are 
used to combine one or more single bits into a single bit answer. These answers are 
then combined (using more gates) to obtain further answers. We will not examine 
in detail the various ways in which these combinations can be used, for example, to 
do addition. However, we will learn about the individual gates, and we will see how 
to use combinations of these gates to obtain any desired output from a given set of 
inputs. In essence, we will see how to use these gates to build truth tables. 


Logic Gates 
b 

Hm One of the gates that is used to combine bits is called the and gate. Figure 3-15 shows 
— the symbolic representation of a two-input and gate. 








i The wires labeled a and b contain an “input” voltage that is either the voltage 
igure 3-15 The that represents the bit value 1, or the voltage that represents the bit value 0. The 
and gate. resulting "output" voltage, labeled a - b, is given by this “truth table": 


a-b 





Observe that if we think of 1 as meaning true and 0 as meaning false, this is 
exactly the truth table for the logical and operation. This is the motivation for calling 
this an and gate. It is also possible to build and gates with more than two inputs, 
where the rule is “The output is 1 if and only if all the inputs are 1." We also note 
that we frequently omit the - operation, writing ab for a - b. 

Similarly, one can build or gates with two or more inputs, where the rule is “The 
output is 1 if and only if at least one of the inputs is 1.” For two inputs, Figure 3-16 
shows the symbolic representation. Here is the truth table: 








Again with 0 for false and 1 for true, this is the truth table for the logical or 


operation—hence, the name or gate. 

P Cad The final basic building block is the inverter, which takes a single input and forms 

Figure 3-17 its negation or complement. Figure 3-17 shows the symbolic representation. Here is 
he not gate. the truth table: 





k hal 








p 


Explore more on 
the Web. 


35 Excursion: Logic Circuits. 231 


With 0 meaning false and 1 meaning true, this is the same as the logical not 
operation, and an inverter is also called a not gate. 
‘These simple building blocks can be combined in an unlimited number of ways. 


a Example 1 Consider the circuit in Figure 3-18. What output will be obtained 
for the different combinations of input? That is, what is the truth table for this circuit? 
SOLUTION Perhaps the easiest way to answer this question is to label each 
of the outputs from the gates, working our way from left to right, as shown in 

Figure 3-19 on page 232. 
We can then build the truth table as follows: 


a bj @ wtb a5 





Observe that the output from this complicated circuit (a two-input or gate, 
a two-input and gate, and a not gate) is the same as what we would get with a 
circuit consisting of just a two-input and gate. We say that the two circuits are 
equivalent because their outputs are the same. a 


Practice Problem 1 Build the truth table for the circuit shown in Figure 3-20 on 
page 232. 


Simplifying Circuits 

An important question to be considered in this section is how we can build a circuit 
with a desired output, and how we can accomplish this with as few gates as possible. 
Part of the answer to the latter question comes from realizing that the expressions 
which represent the output from circuits form a Boolean algebra, where "—" means 
equivalent circuits. To verify this, we only need to verify these four properties: 





(a) Commutative ab 
(c) Distributive abc) 
(d) Identity al 
(e) Negation ata’ 





a atb=b+a 
=(ab)+ (ac) a+ (be)=(a+b) (a +c) 








a 


Figure 3-18 The circuit for Example 1. 





Chapter 3 / Sets and Boolean Algebra 








246. /9———— 
Figure 3-19 The solution to Example 1. 


Two circuits are equivalent if they give the same output for all possible combi- 
nations of input; hence, we can establish these equivalences with truth tables. 


BE example 2 stow a+ (he) = 69069. 


SOLUTION Here are the truth tables for the expressions a + (b-c) and 
(a+b): (a +0): 





a b e| be atbe a b cqatb ate (9 
Drogo | 0 0 0 0 0 0 0 0 
QE 1 0 0 Qu. ome ur 0 1 0 
DUE p 0 0 0v i và 1 0 0 
0 1i 1 1 D" E 1 1 1 
Er 6 0 1 1 oO $ 1 1 1 
ONIS 0 1 1. ADR 1 1 1 
LEE 0! 0 1 "e quoi 1 1 1 
DRETA y 1 1 E 1) Bt 1 1 1 


‘The fact that the two circuits give the same output for all possible combi- 
nations of input is exactly what we mean when we write a + (b - c) = (a + b) + 
(a c). o 


It is easy to verify that the other three properties also hold. Thus, these combi 
nations of simple gates constitute a Boolean algebra. We may therefore use all the 
properties of a Boolean algebra to simplify the expression for a given circuit, We 
could have anticipated that the a(a' + b) circuit would have the same output as the 
a - b circuit, because we had previously done this simplification using the properties 
of a Boolean algebr 





a(a’ +b) =aa'+ab part (c) 
O+ab part (e) 
=ab+0 part (a) 
ab part (d) 











Figure 3-20 The circuit for Practice Problem 1, 








3.5 Excursion: Logic Gireuits 233 











Figure 3-21. The circuit for Example 3. 





Example 3 Write the Boolean algebra expression for the circuit shown in 
Figure 3-21, and simplify as much as you can. 
SOLUTION The expression is (a + b)a'c) . We simplify this as follows 


(a+ bya'cy = (a+ b)((a'y +c) par (h) 
=(a+ba+c) — pat(f) 
a+ bc part (c) 





Observe that the original circuit contained one two-input or gate, two two-input. 
and gates, and two not gates. The circuit for the simplified expression has one two- 
input or gate, one two-input and gate, and one not gate. 


Practice Problem 2 Write the Boolean algebra expression for the circuit shown 
in Figure 3-22, and simplify as much as you can, 


The next example illustrates a truth table for three variables, and shows general 
way to convert from a truth table to a Boolean algebra expression. The Boolean 
algebra expression obtained in this manner is not the simplest, but there are standard 
methods for simplifying the expression. We will consider this topic in more detail in 
the next section, 

Itis worth noting that, as we did for logical expressions, we list the truth table in 
a systematic manner, starting with a row of O's and ending with a row of 1's Just as in 
counting, we cause the right most position to vary most frequently. It would certainly 
be possible to start with all 1’. This would correspond to starting with all true for 
the logical expressions, We have chosen to begin with all 0's because that makes the 
rows progress just as numbers in the binary number system: 000, 001, 010, 011, 100, 
101, 110, 111. 















































Figure 3-22 The circuit for Practice Problem 2. 


Za: aa fn o 


Chapter 3 / Sets and Boolean Algebra 


wu Example 4 Write a Boolean algebra expression that gives the following result, 
draw the corresponding circuit, and count the gates that are used: 





SOLUTION We present a technique that yields a so-called sum-of-products 
expression. The answer will be the sum of several terms of the general form 
xyz, where the first factor is either a or a’, the second factor is either b or b',and 
the third factor is either c or c’. If we use terms of this form, it is easy to build 
an expression that has a 1 exactly where we want it. For example, the factor 
a’bc’ will have a 1 in the row where a is 0, b is 1, and c is 0. That is, a'bc' has a 1 
in row 3 and 0 in the other rows. We apply this same idea for each of the other | 
four rows where we want a 1: 


a’be’ has a 1 in row 3, and 0 in the other rows 
abc hasa 1 in row 5, and O in the other rows 
abc hasa 1 in row 6, and 0 in the other rows 
ab hasa 1 in row 7, and O in the other rows 
abc hasa 1 in row 8, and 0 in the other rows 


If we combine these five results with the logical or operation, we will get a 
1 in rows 3, 5,6, 7, 8 (cach row where at least one of these results was 1), and 0 
everywhere else. This is the desired result. So the answer is 

a bc + abc + able e abe + abe 
‘The corresponding circuit is shown in Figure 3-23. It uses five three-input and 
gates, one five-input or gate, and six not gates. 
Nore: In Figure 3-23, we adopt some frequently used shorthands: (1) we indicate. 
inverters (not gates) by a small circle placed immediately adjacent to the next 
and gate or or gate; and (2) we do not include the part of the circuitry that splits 
each incoming signal. o 





Practice Problem 3 What is the truth table for a + bc? 


Examine the truth tables for a + bc’ you just did in the practice problem and 
for a’be’ + ab'c' + ab'c + abc’ + abc of Example 4. The truth tables are identical, 
so the corresponding circuits are equivalent. We have now seen three expressions 
that yield this same output: (a +6)(a’c), a + be’, and a’be’ + abc + able + abc + 











35 Excursion: Logic Circus 235 














Figure 3-23 A more complicated 
circuit. 


abc. Simplifying (a + bYa'c)' to a + be’ was not difficult. Simplifying a'bc abc + 
ab'c + abe’ + abc is more tedious, but it can be done: 
d'be + ab/c + able + abe + abe — a'be' +ab'( +c) "c ab(c +c) 
‘be’ --ab(c-- c) cabe) 
abe +ab'(1)+ab(1) 
a'b ab cab 
bc + a(b +b) 
— a'bc +a(b+b') 















= (Da bc) 
a + be'\(1) 
=atbe 





We finish this section with a visual method for simplifying sum-of-products ex- 
pressions. 


Karnaugh Maps 


In the remainder of this section, we introduce a graphical technique for simplifying 
sum-of-products expressions containing up to four variables. We will name the vari- 
ables x, y, z, and w. The technique is due to Maurice Karnaugh, and can actually 
be extended to expressions containing up to six variables. We use a visualization 
referred to as a Karnaugh map. The use of Karnaugh maps gives the simplest pos- 
sible sum-of- products expression. It is sometimes possible to reduce the number of 
gates further if we do not use a sum-of-products expression. For example, xy + xz’ 
requires two two-input and gates, one two-input or gate, and two not gates. We can 








36 Chapter 3 / Sets and Boolean Algebra 




















gure 3-24 
‘The 
<arnaugh 
ap grid for 
o variables. 





jure 3-26 
The 
arnaugh 
map for 
ample 6, 





rewrite this as xy’ + xz = x(y’ + z’) = x(y2)', which can be built with two two-input 
and gates and one not gate. In the section, we restrict our attention to finding minimal 
sum-of-producis expressions. 





Two variables 


The technique and its underlying logic are relatively easy to visualize for two vari- 
ables, so we begin with some two-variable examples. The basic idea is to use a grid to 
Jay out all the possible terms (products of the two variables or their complements) 
50o that terms we can simplify are in adjacent squares. When only two variables are 
involved in the expression, we lay out the grid as in Figure 3-24, 











un Example 5. Perhaps the simplest example is xy + xy’, which can be simplified 
tox(y y) =x +1 = x. Inthe Karnaugh map we place checkmarks in the boxes that 
correspond to the two terms xy and xy, as shown in the grid on the left in Figure 3-25. 
The existence of a rectangular shape in the checkmarks is our signal that simplification 
is possible. We put an oval around the rectangle, as shown in the grid on the right in 
Figure 3-25, and we recognize that the presence of a rectangle of “area 2” indicates 
that one variable will not appear in the simplification, Which variable will not appear? 
The one that appears in both complemented and noncomplemented form (y in this 
example) since when we simplify, the y and y' will sum to 1 and thus no longer appear. 


xy tay = x(y + y’) 





Practice Problem 4 Use a Karnaugh map to simplify xy’ + x'y. 


ll Example 6 Simpify his more complicated expression: xy + xy + x'y. 
SOLUTION In the Karnaugh map shown in Figure 3-26, we can identify both 
a horizontal rectangle and a vertical rectangle, so we circle them both. A single 
checkmark is allowed to appear in more than one rectangle. The expression for 
the horizontal rectangle is x, for the vertical rectangle is y, and the final answer 
is the sum of the expressions for the two rectangles: x + 








To see that this gives the same answer we could get by doing the algebra, we 
first of all realize that property (g) (the idempotent property) allows us to write xy 
xy + xy. If we omit the steps that just apply the commutative property, the algebraic 
simplification goes as follows: 








ya 








*[v[v] * 


x | « 


























Figure 3-25 The Karnaugh 
map for Example 5. 








3,5 Excursion: Logic Circuits 237 


xy tay xy Sry Hry tay tay 
xy tay! xy by 
xO ty) +(e +x)y 
ritiy 
=r+y 








The horizontal rectangle provides a visualization that xy + xy’ will simplify to x, and. 
the vertical rectangle is a visualization that xy + x'y will simplify to y. 















xl [o 
Figure 3-27 Practice Problem 5 Use a Kamaugh map to simplify xy +x'y-+x'y’. 
The 
Kunini Before moving on to more complex expressions, note that sometimes no simpli- 
map with no B als A a 
m fication can be done. For example, the Karnaugh map for the expression xy + x'y’ is 
‘Ge shown in Figure 3-27. Since the only rectangles we can circle are 1 x 1 rectangles, we 


yr ye yr yz conclude that the original expression is already simplified as much as possible. 








atone —| Three variables 
= If a sum-of-products expression involves three variables, we use the K: 
Figure 3-28 The form shown in Figure 3-28, 
Karnaugh map grid for 





Along the top of this picture, labels that are side by side (like yz and yz’) differ 
in exactly one of the two variables. This is the key in all Karnaugh maps to using 
yz yz rectangles to simplify an expression. 


three variables, 








* V) E 
P a a [ 5 Wil Example 7 Simplify xyz e ry Z 4 xy 2 +yz 


ad SOLUTION The Karnaugh map with the terms checked and the re 
Figure 3-29 The 


Karnaugh map for 
Example 7. 




















gles 
circled is shown in Figure 3-29, Since we can completely cover the checked 
squares with three rectangles, the simplified expression will contain three terms; 


9 Thelxlre 





ngle does not remove any variables. Iis term is xyz. 

© The vertical circle of “area 2" removes one variable (x). We reason that x 
appears in the rectangle both as x and x’, so it will not appear in the. 
answer, The term is y' 

© The horizontal circle of “area 2" removes one variable (z). We reason that 


appears in the rectangle both as z and z’, so it will not appear in the 
answer, The term is x'y. 












The final answer is xyz - yz +x 
Practice Problem 6 Simplify xyz + x'yz+ x yz -xyz 


It is no surprise that the three-variable problem is more complicated than the 
two-variable problem. The larger Karnaugh map used for the three-variable problem 
creates some issues that we have not had to deal with before, For example, it is now 
relevant that only a rectangle whose area is a power of 2 corresponds to a collection 





UC BEE ë A 


JB Chapter 3 / Sets and Boolean Algebra 


y y yz yz yx ow yr yz 


«lviviviv] *[ [v[v 
x «Civ 


Figure 3-30 The Karnaugh maps for Example 8. 



































of terms that simplify. In particular, a simplifying rectangle has sides of length 1, 2 or 
4. No simplifying rectangle has a side of length 3. 





TAN example 8 Find the simplest sumof products expression for the Karnaugh 
maps shown in Figure 3-30. 
SOLUTION In each, we circle a rectangle of area 4. Such a rectangle will 
cause two variables not to appear in the answer, and once again the general 
rule is that any variable appearing in the rectangle both complemented and 
uncomplemented will not be part of the term for that rectangle, 
Using this rule of thumb, we see that the first Karnaugh map simplifies to 
x. Here is the algebra that the Karnaugh map captures: 


xyetayd tye +xyz= xylz+ z) + axy (z+ z) 
maydeay-d 




















yxy 
-2304y) 
ram =x 
=x 
viv am) ; 
Likewise, in the rectangle of the second Karnaugh map, both x and x 
viv iy iv boxes are checked, and y and y' boxes are checked, so x and y are not in the 
ua term for that rectangle. Thus, the solution for that map is z. n 


Karnaugh map for 
ractice Problem 7. 
yz yr yz yz Practice Problem 7 Find the simplest sum-of-products expression for the Kar- 

fal naugh map shown in Figure3-31. (Hist: Usea | x 4 rectangle and.a2 x 2 rectangle.) 








E ] g A second complication with the three-variable problem is that checkmarks 
which do not appear to be side by side nevertheless can be simplified. For exam- 




















Figure 3-32 A ple, consider the Karnaugh map shown in Figure 3-32. Algebraically, the expression 
omplication with does simplify as 
Karnaugh maps. 


ye yr yr yz 





Y. K| Forthe Karnaugh map visualization to give us this simplification, we merely have to 
Figure 3-33 A visualize the map as if it were the label on a tin can, so that the left and right columns 
rectangle can would be adjacent to each other. Then we can circle the area 2 rectangle as in Figure 
“wrap-around”, 3-33, and our usual rule of thumb gives the proper answer, 

















Figure 3-34 A 
solution to Example 9 


y ye yr yz 
v 
* dv. 


Figure 3-35. A better 
solution to Example 9. 
































ye ye yr yz 











IRE | v 
a viv 
Figure 3-36 


Karnaugh map for 
Practice Problem 8. 


y oy yr ye 


























“lV iv lv 
Y put Y 
Figure 3-37 


Karnaugh map for 
xy bay? +yz 


aw aw fw zw 





» 





» 





xy 





xy 




















Figure 3-38 Karnaugh 
map grid for four 
variables. 





35 Excursion: Logie Circuits 239 


Vll tane 9 Simpy aye E A 
SOLUTION The Karnaugh map is shown in Figure 3-34. 
This choice of rectangles gives yz + xy z + y z However, this is not the 
simplest sum-of-products expression. We should follow this guideline when we 
choose our rectangles: 





Choose rectangles so that the number of rectangles is as small 
as possible and each individual rectangle is as large as possible 
(but remembering that sides of length 3 are not allowed). 


Following this guideline, we obtain the better choice of rectangles for Ex- 
ample 9 shown in Figure 3-35. The area 4 rectangle in the first and last columns 
has both x and x’ checked, and also both y and y' checked, so the term is z. 
The area 2 rectangle has both z and z checked, and the term is xy’. The final 
answer is z+ xy. n 





Practice Problem 8 Choose rectangles for the Karnaugh map shown in Figure 
336 


Before we proceed to four variables, we make one last note, Suppose the expres- 
sion to be simplified is xy + xy'z + x’z. The question is, what box(es) do we check to 
correspond to the xy in the problem? What about the x’z? To answer this, we reason 
using the reverse of the process of going from Karnaugh map to term: 


For a term in which one or more variables do not 
appear, think of those variables as appearing both 
complemented and uncomplemented, and check all 
the corresponding boxes. 


Applying this principle to the expression x y + xy'z + x’z, for term xy we check 
both xyz and xyz’, and similarly for x'z. Figure 3-37 shows the Karnaugh map for this 
expression. 


Four variables 


When the expression involves four variables, we use a Karnaugh map form as shown 
in Figure 3-38. Observe that again each label differs from the one on either side only 
in one of the two variables. As for the three-variable map, the last column differs 
from the first column in only one variable, and we again think of the first and last 
columns as being adjacent. In addition, observe that the top and bottom row labels 
likewise differ in only one variable. We may therefore consider the top and bottom 
rows as being adjacent. 











"MI Example 10 simplify 
SOLUTION Figure 3-39 shows the Karnaugh map on page 240. 





xy zw x y Zw! x yzur. 


AO. Chapter’ / Sets and Boolean Algebra 


aw zw zw zw 


[LT 























iw £m fw Iw 














y 
Y viv 
y |v iv [v [v 
ii 





Figure 3-40 Karnaugh 
nap for Practice Problem 
10. 


w ow tw rw 
y ay 























Y v 
Y v 
ey 
ey |v v 


Figure 3-42 Karnaugh 
map for Example 12. 


























We circle an area 4 rectangle and an area 2 rectangle. The corresponding 
terms are found as follows: 


Inthe area 4 rectangle: In the area 2 rectangle: 














xappearsasxandi! x appears only as x 
yappearsas yand y y appears only as y 
z appears only as z z appears only as 7 
wappearsonlyasw! w appears as w and w’ 


The term is Zu! The term is xy'z 


‘The final answer is z'u' + 





a 


Practice Problem 9 Write the expression that corresponds to the Karnaugh map. 
in Figure 3-40. 


"ill penoe 1! sepuy Du 4 aye Xy eo bx y ew zat ut 4 
X yz'w 4 x'yzu + x' yzw' 4 x! yzw' + x'yz'w, 
SOLUTION Figure 3-41 shows the Karnaugh map. 

Observe that we do not just use an area 2 rectangle for the top two check- 
marks. Because the top and bottom rows are adjacent, we can combine them 
with the two checkmarks on the bottom row to obtain a rectangle of area 4, 
‘Again, we go through the reasoning in detail: 


In the area 8 rectangle: In the area 4 rectangle: 














appears only asx’ x appears as x and x’ 
yappearsas yand y) y appears only as y 
zappearsaszand z appears as zand z’ 


w appears as wand w! w appears only as w’ 


The term is x’ The term is yw’ 


‘The final answer is x’ + yw’. a 


Vl Eample 42: crete We rectongion foc she: Kangi map thaw in Figure 
342. 
SOLUTION The grid on the left in Figure 3-43 shows the most obvious so- 
lution, but it is not the best solution. We have forgotten here that the top and 
bottom rows are adjacent, and also the left and right columns are adjacent. 
Thus, the four corners form a 2 x 2 rectangle of area 4, and we have the better 
solution on the right in Figure 3-43. n 











35 Excursion: Logic Gireuis 241 























aw aw Fw zw aw aw ew itn 
» [v] O » Y 
» Ia x (vv) 












































Mommo 2 BCL 


Figure 3-43. Solution for Example 12, 








Practice Problem 10 Give the expression that corresponds to the better selection 
of rectangles in Example 12. 


Bl ample 18. oie de Karning ma for shu expression yu E JEFA 


SOLUTION As we did in the three-variable problem, we handle the y’w' 
term by realizing that since x is missing, we will have both x and x’ appearing 


















































x || in the Karnaugh map, and that since z is missing, we will have both z and z' 
appearing, Just this term will cause the checkmarks shown in the leftmost grid 
xy viv in Figure 3-44, For the yz term we reason that since x is missing, we will have 
wa Wily both x and x’ appearing, and since w is missing, we will have both w and wu" 
appearing, This gives the checkmarks shown in the center grid of Figure 3-44. 
Pod Wal Wala Finally, for the x’ yw" term we reason that we will have both z and z appearing, 
as shown in the rightmost grid of Figure 3-44, The complete final answer is given 
Figure 3-45 Complete in Figure 3-45, ü 
Karnaugh map for 
KENEAN aw aw zw zw aw aw zw zw w ow zw zw 
» | wv [v]v » 
EE 
w| [viv] x Y 
x [vv] xy xy 
xx] xy |v |v x| lv |v 






































Figure 3-44 The three terms in Example 13, 


Practice Problem 11. Give she corresponding simplified expression for the Kar- 
naugh map in Example 13. 


Solutions to Practice Problems 


1 The expression is a’ + ab, Here is the truth table: 2 (at bob) = (a + bea b) = a + (ho)! = a+ 
chi =a+c0=a+0=4 


ab Gab 











42 Chapter 3 / Sets and Boolean Algebra. 


e be a+b 





‘The Karnaugh map, with a vertical rectangle of area 2 
circled, is shown in Figure 3-46, This time only the y 
will appear in the final answer (the rectangle is in the y’ 
column), and the x and x’ will add up to 1, The answer 
iy. 

Figure 3-47 shows the Karnaugh map. The vertical rect- 
angle represents y^ and the horizontal rectangle x’. The 
answer is.’ + 








ree 








v 
x Y 














Figure 3-46 Solution to Practice Problem 4. 





Figure 3-47. Solution to Practice Problem 5, 


ercises for Section 3.5 


To show that the output from circuits made up of and, 
or, and not gates forms a Boolean algebra, where 
‘means equivalent circuits, we need to verify the com- 
‘mutative, distributive, identity, and negation proper 
(properties (a), (c), (d), and (e)). We used a truth table 
to verify the equivalence a + (b: c) = (a + b) (a+c), 
which is one of the two distributive laws. Do the same 
for the following: 
(8) a (bc) = (a-b) + (a - c) (the other distributive 
law) 
(b) at 
aw 











(one of the identity laws) 
0 (one of the negation laws) 








yr yr yr yz 
k {yv 


y Iv. 


Figure 3-48 Solution to Practice Problem 6, 





6 Figure 3-48 shows the Karnaugh map with the terms 
checked and the rectangles circled. The checked squares 
can be entirely covered by two rectangles. (It would be 
Possible also to put in a vertical rectangle around the 
xyz and the x’yz’, but we get a simpler answer if we 
use as few rectangles as possible.) The top rectangle has 
both y and y^ so the y terms drop out and the term is 
xz. For the bottom rectangle z appears both as z and as 
2, so there is no zin the term—the term is x y. The final 
answer is xz + a'y, 

7 Y & y (The x’ is the 1 x 4 rectangle along the bottom. 
‘The y is the 2 x 2 rectangle in the first two columns.) 

4 There are two equally good solutions shown in Figure 
349. 

9 xy + yw 

10 ywtxyz 

11 The expression y'u’ + yz + x'w’ comes from the reetan- 
sles shown in Figure 3-50, 














aR 


ye yr y yr yz yz 
v v] ‘maa iv 

viv s Y. 
Figure 3-49 Solution to Practice Problem 8. 

















zw £m zw zw 
w ly] y 

x 

xy 

xy li 9 











Figure 3-50 Solution to Practice Problem 11. 





35 Excursion: Logie Crcuts 243 


















































2. By quoting the properties of a Boolean algebra, justify a b 6 | Result 
cach step in the simplification fos 
o o o 0 
o 0 4 1 
o 1 0 D 
a'b + able + able + abe! abe a be oe a8 À 
zog D 
that we did in this section. ip a 1 
3. Give the truth table and draw the circuit corresponding peti : 
££. to the following Boolean algebra expressions. For the 
circuits you may use the shorthand notation developed 
in Example 4 z 
a +bla+ b) Me e a 
viae 7. Write a sum-of-produets Boolean algebra expression 
(©) (abt 0 +e) for each of the following truth tables: 
(d) (aba + cy 
(e) abe abe + ab ir 3. SET se adi 
(D abc + abe’ + able’ + able ———— ik: 
4. For each expression of Exercise 3, simplify the expres 0 0 0 1 
sion as much as you can by using the laws of Boolean 0 0 i 0 
lpehra 0 1 0 D 
5, Write a sum-of-products Boolean algebra expression w 4 H rH ; 
for each of the following truth table ] $1 1 
Logo 0 
roa J 1 
a b e Result. 
a b e Result 
mmm n 
MA E: 0 0 0 o 1 
0 d b o ELE 4 i 
wo 1 d » $ 4 oo 1 
1 » H 1 w 0 1 1 1 
1 0 1 9 1 0 0 1 
yeu tn D E E. $ 
1 1 1 1 1 1 0 1 
o d 1 
a b c || Result 
a b c jj Result E 7 
0 0 0 1 0 0 4 o 
0 9 d 1 0 1 0 1 
0 1 0 1 fe) 0 1 1 0 
mo 1 1 1 1 0 0 o 
bendi o o oo d 1 
1 0 1 0 1 1 0 0 
bas. dine o bor 1 
iva 1 o 











4 Chapter 3 / Sets and Boolean Algebra 


For cach of the following pairs of circuits, determine 
‘whether or not theyare equivalent. If they are notequiv- 
alent, give an input that demonstrates they arc not. 

(a) ab+ ab anda 

b) a'b--ab/ and ab 

(©) albe +a'beanda'e 

d) abc + abe -a'l/c "a b/c and abe + be’ eai 
(©) ab ac--a'c and ac + be kae 

Find the expression for each of these Karnaugh maps. 
(Hint: In each case the answer consists of a single prod- 
wet) 






















































































€) yv y yr yz 
x | 
x Y |v 
€ x v yr yz 
x [v [v 
* ] 
GE 3e gei yu yu 
Y F4 
[v^ Y 
@ wow zw zw 
» 
y viv 
xy 
xy 
© wowo Fw Fw 
y [viv 
» 
xy 
xy [v |v 
o w mí zw rw 
vy[v F4 
d v 
x 
xv Lv r4 
ey [v F4 




















10. Find a minimal expression for each of these Karnaugh 


maps. 


@ 


[] 


© 


xy 
xy 















































wowo zw iw 
v 
dell Eira 
viv 
r4 
zw £M Fw Iw 
A 
cd FG 
r4 
viv v 
aww Fw zw 
Fi F4 
MÀ 
l'4 Y 
ELEL 























11. Find a minimal expression for these expressions: 
(a) xy tayet ely? cx yat 
(b) ayz ry? xy ze ye yz 
(c) ayzxyz bxycexyreryrayc 
40) xyzexy coy yea yz a yz x 
12. For each column (a), (b), and (c) in the truth table be- 
low, give a Karnaugh map for the corresponding sum- 
of: products expression, and simplify the expression. To 








35 Excursion: Logic Greits 245 


save space, the 16 possible values of the four variables x, 
y, z and w are presented in two tables, each with eight 
rows, 

13, Find a minimal expression for each of the following: 
(a) xyexydxy 
(b) cexyzexyz 
( yz 
4) y zc + xzw xy 











N o6 64 6 ..9. (9 x- y—cr- a Qr 9 (9 
0 0 0 of o o o 1 0 0 of o 1 1 
te. 0) Ses o 0 1 0 0 1 1 1 o 
dpa — 1e 0 1 1 Eh rode dog X 1 1 
baud: 71, ud o 0 hr G6 Se 1 o o 
LI id 1 1 11 1 0 Of o 1 o 
pea g dt 1 o ri 3 6 31 1 o 1 
oe S de 1 1 feq wg A. 1 o 
chew d 1 1 ] r4 Few 0 1 1 


Truth Table for Exercise 12. 


Chapter 3 Summary 





31 Set Definitions and Operations 


‘Terms and concepts 


© You should recognize the notation for frequently used 
sets of numbers, including: 
- N (natural numbers), Z (integers), Q (rational num- 

bers), and R (real numbers). 

-RSRU Q, QP’, Z*, zn 

© You should know what it means to be an element of a set 
(written x € A). and what it means to be a subset of a set 
(written A c B). 

© You should recognize the empty set, whether denoted by 
1} or, and you should recognize the use of U to denote 
the universal set. 

© You should understand the notation used for various 
set operations: A n. B for intersection, A U B for union, 
A — B for difference, A’ for complement. You should 
know that disjoint sets are sets whose intersection is the 
‘empty set. 

© You should be familiar with important properties of set 
operations: commutative, associative, distributive. 


© You should be able 10 use and interpret the notation. 
n(A) used to indicate the number of elements in the finite 
seta. 


Describing and working with sets 


© You should be able to identify elements of the standard 
sets N, Z, and so on. 

© Youshould understand that two scts are equal when each 
isa subset of the other. 

© ‘You should be aware that sets may contain elements that 
are not numbers. For example, we may work with sets of 
strings, sets of students or sets of ordered pairs. 

© You should be able to work with sets described using 
set-builder notation, as in these examples: 
- (t£ Nx —2k for some keN) - using property 

descriptions. 

~ (2k: k e N) - using form descriptions 

© You should be able to determine whether or not one set 
isa subst of another set. The given sets may be described 
ina variety of ways, including sct-builder notation. 





46. Chapter 3 / Sets and Boolean Algebra 


iet operations and set size 


) You should be able to build new sets from existing sets 
using various combinations of the set operations inter- 
section, union, difference, and complement. 

? You should be able to use Venn diagrams to illustrate 
and investigate properties of the set operations. 

> You should be able to determine the size of a set de- 
scribed in various ways, including those described using 
set-builder notation. 

? You should be able to apply the Znclusion-Exclusion 
Principle, for two and three sets, to determine the size 
of the union of sets 





3.2 More Operations on Sets 


ferms and concepts 





? You should recall the notion of ordered pair from high 
schoolalgebra, For example, (3, 5)is read as "the ordered 
pair 3,5” 

? You should recognize the notation A x B ("A cross 8") 
for the Cartesian product of A and B, and the special 
notation A? used to indicate A x A. 

? You should be able to work with the generalization of 
these concepts: 
= the m= tuple (xi 35...) 
the Cartesian product of n sets, written as Sj x S2 x 

x$, 
~ the use of the shorthand 5^ 

? You should recognize the use of P(A) to denote the 
“power set of the set A.” 

9 You should know what is meant by a partition of a set, 
and understand the use of the phrase “a part of S" inthis 
context. 


Describing and working with Cartesian products 


© Given two or more sets, you should be able to calculate, 
amd answer questions about, their Cartesian product. 

© You should be able to give the Cartesian graph of small 
sets of ordered pairs, or of sets of ordered pairs described. 
using algebraic equations. 

@ You should be able to relate Cartesian products to earlier 
concepts (for example, the truth tables of Chapter 1). 


Describing and working with sets of sets 


© You should be able to answer questions about sets whose 
elements are also sets. 


© You should be able to calculate the complete power set 
for small sets, and to answer questions about the power 
set for any given se. 

© Fora given set, you should be able to find partitions of 
that set satisfying given conditions 

© Given a set of subsets, you should be able to determine if 


it represents a partition; and if not, identify which prop- 
erty or properties of a partition are not satisfied. 





Set operations and set size 


© You should be able to combine the operations of this 
section (Cartesian product, power set) with those of the 
previous section (intersection, ete.) 





size 


© You should be able to calculate the size of the Cartesian 
product of sets, and of the power set of a set, 

© You should be able to combine these two calculations 
with set size calculations from the previous section of 
the text. 

© You should understand, for small sets, the relationship. 
between the size of the power set and the size of the 
Cartesian product (0, 1)!, 





3.3 Proving Set Properties 
‘Terms and concepts 


© You should recognize that an element-wise proof of a set 
property involves choosing an arbitrary element of one 
set and showing that it must be an element of some other 
set, 

© For an expression involving the set operations N, U, Ø, 
U, and ^, you should know what is meant by the dual 
expression, and you should be able to write that dual ex- 
pression, 

© You should know and be able to apply the duality princi- 
ple (it a property is true for sets, so is its dual property). 

© You should recognize set properties that are similar 
to properties of numbers: commutative, associative, dis- 
tributive, identity, double negative. 

© You should also realize that sets have other properties 
numbers do not satisfy: negation, idempotent, DeMor- 
gan's laws, universal bound, absorption, complements of 
U and 0, complement and negation. 


Element-wise proofs 


© You should be able to give element-wise proofs to show 
that one set is a subset of another: 


— For particular sets — for example, (10k: k € N) € (ke 
Z: kis even]. 

~ For set properties - for example, A 0 B € A. 

For set properties that are true with additional hy- 
potheses ~ for example, “If (A U B) C B, then A C. 
(ANA). 

© You should know (and be able to apply) that we prove 
two sets are equal by proving that each is a subset of the 
other. 

© You should know how to prove that a set is empty. 





Proofs using previously established prope 





jes 


© You should be able to use the properties in Theorem 6 
to prove additional properties of sets, 

© For cach property you prove, you should be able to state 
the dual property that is also true, 


3.4 Boolean Algebra 


Terms and concepts 


© You should recognize the connections among sets, logical 
connectives, and Boolean algebra, and be able to trans- 
late expressions and properties written in one of these 
forms to either of the other forms, 

© You should recognize properties of a Boolean alge- 
‘bra, and their relationship to similar properties for sets 
and for logical connectives: commutative, associative, dis 
tributive, identity, negation, double negative, idempotent, 
DeMorgan’s laws, universal bound, absorption, comple: 
ments of 1 and 0, complement and negation. 

© For an expression involving the Boolean algebra opera- 
tions », +, 0, 1, and ’, you should know what is meant by 
the dual expression, and you should be able to write that 
dual expression, 

© You should know and be able to apply the duality prin- 
ciple (i a given property is true, so is its dual property). 














ots 





© You should be able to use the properties of a Boolean 
algebra to prove additional properties. 

© For each property you prove, you should be able tostate 
the dual property that is also true, 





3.5 Excursion: Logic Circuits. 247 


3.5 Excursion: Logic Circuits 


‘Terms and concepts 


© You should understand, and be able to draw, the stan- 
dard logic gates that make up circuits: the and gate, the 
or gate, and the not gate (or inverter). 

© You should understand the terminology "sunrof- 
products” asit applies to Boolean expressions for circuits. 

© Youshould know that Karnaugh maps are graphical tools 
Jo assist in simplifying sum-ol-products expressions. 





Circuits, Boolean algebra expression, and truth 





~ Give the corresponding Boolean algebra expression 
that describes the output for that circuit 

~ Give a truth table for the circuit. 

© Given a truth table that represents the desired output fo 
‘circuit, you should be able to write a "sum-of-produ, 
expression that yields that output, 

© Given a Boolean algebra expression, you should be able 
to draw the corresponding circuit, 

© For a given circuit, you should be able to use Boolean 
algebra properties to simplify the expression that corre- 
sponds to the circuit. 














Karnaugh maps 


© For 2, 3, or 4 variables, you should be able to create an 
empty Karnaugh map diagram, 

© For any sum-of-products expression, you should be able 
to check the appropriate boxes in the Karnaugh map. 

© For any Karnaugh map with boxes checked, you should 
be able to locate a minimal number of maximumssize 
rectangles that cover the checked boxes, and use this in- 
formation to write a minimal sum-of-products expression 
for the map. 

© You should be able to combine these steps, using Kar- 
maugh maps to simplify arbitrary sum-ol-products ex- 
pressions, 




















Functions and Relations 


Everyone has experience with mathematical functions from their high school 
days, but that experience is often limited to studying functions that model 
physical motion or time-dependent data (like parabolas, lines, or trigonomet- 
ric functions). Discrete math makes use of some of these functions as well, 
but we also have a second goal regarding functions. We would like to learn 
something about functions as abstract mathematical objects. In this pursuit, 
we will encounter a broader class of mathematical objects, called binary rela- 
tions. Binary relations will not only provide us with a fresh perspective on the 
nature of functions, they will also be important for applications in this chapter 
as well as later in the book. 

A distinguishing feature of discrete mathematics is that it deals primarily 
with integers. In the previous chapter, we saw this feature in that most of our 
examples and applications focused on either finite sets (i.e, sets whose size is 
a natural number) or the set of natural numbers itself. In this same vein, our 
functions and relations will primarily involve finite sets or the set of natural 
numbers. In the ultimate twist of irony, we will see that functions and relations 
can actually be thought of as sets, so much of the material from the previous 
chapter is relevant here once again. 

The general study of functions and relations typically includes the notions 
of “inverse” and “composition,” so we will stress these operations initially, In 
order to make connections with the previous and subsequent chapters, we 
will then study the relationship between certain properties of a function and 
the sizes of the sets on which that function is defined, and then investigate the 
notion of equivalence relations. A key concept throughout these early sections 








4 Definitions, Diagrams, and inverses 249 


is the “arrow diagram’ of a function. Aside from its visual appeal, this structure 
is important because it will show up later as the notion of a directed graph 
in Chapter 7. The next section will introduce (or reinforce) specific numerical 
functions that arise frequently in discrete mathematics, including the floor, 
the ceiling, and the logarithm. The chapter ends with two excursions that can 
be tackled independently of one another. Both excursions deal with recursive 
thinking, some proofs by induction, and the numerical functions that have 
been used in the earlier sections. 


4.1 Definitions, Diagrams, and Inverses 





We begin our discussion with functions, where some familiar examples will help us 
flesh out the notation and terminology we will be using. 


Notation and Terminology of Functions 





Definition The notation f : A — B is used for a function, simply called 
f, with a set of inputs A (called the domain), and a set B (called the codomain) 
that includes all the outputs. The function f associates with each input in A 
one and only one output in B. 











We use the notation f : A — B and say, f is a function from A to B." If the rule 
associates to the element a of the domain the element b of the codomain, we write. 
(a) = b, and say, " f maps a to b,” “the value of f at a is b," or " f of a equals b." 








LI is defined by the rule f(x) 22x +1. We 
can think of this as meaning, "Given an input x € M, f maps x to the output value 
2x +1 € N.” Is every element of the codomain an output of one and only one input 
into the function? 





No 


SOLUTION No, there are codomain elements like 0 € W that are not the 
value of f atany input value a € N—thatis, there is no de-N for which 2a + 1 = 
0. This does not affect the fact that f is a function. o 





uw 

Example 2 Suppose f : Z — Z is defined by the rule f(x) = x?. We think of 
this as meaning, “Given an input x € Z, the value of f at xis x2 € 2." Is every element 
of the codomain an output of one and only one input into the function? 





SOLUTION | No, there are codomain elements like 1 € Z for which there are 
two input values a € Z with a? = 1. In particular, both a = 1 anda = —1 satisfy 
this requirement. This does not affect the fact that f is a function. o 











| Chapter 4 / Functions and Relations 


«(HCDS) 
«(HCSD) = 
X(HSCD) 
«(HSDC) 
K«(HDCS) 
*(HDSC) 


mamala 


sle 4-1 Truth 
e for Example 5 








People are most familiar with functions that serve to describe some computation 
to be performed on a number to get a new number. However, we use other types of 
functions all the time without really being aware of it. 


TE canoe s 


1. When you look up a phone number in a directory, you are using a function whose 
inputs are names and whose outputs are phone numbers. 

2. When you type a document in word processing software, your computer uses a 
function whose inputs are keystrokes and whose outputs are symbols displayed on 
your computer screen. 

3. When we add numbers, we are really using a function whose inputs are pairs of 
numbers and whose outputs are the numbers resulting from the sum of the pair. 


We have also encountered functions in this book starting from the very first 
pages 


u Example 4 The actions on the packet of four cards in the magic trick in 
Example 1 of Section 1.1 are each a function that takes a packet of cards as input and 
returns a rearranged packet of cards as output. In that example, we refer to the cards 
using the letters C, S, H, D. For this example, we will not allow the cards to be flipped. 
Hence, HC DS refers to the following packet of cards, from bottom to top: heart, club, 
diamond, spade. Explore the cutting action as a function. 

SOLUTION Let « denote the action of cutting a single card from top to 

bottom. Then we can list the output for each of the 24 different packets of 

cards that we take as inputs: 


HCD. «(SHCD)- DSHC. x(DSHC)=CDSH, «(CDSH)- HCDS 
DHCS, «(DHCS)-SDHC, x(SDHC)=CSDH, k(CSDH)- HCSD 





X(CDHS) x(SCDH) = HSCD 
«(DCHS) x(SDCH) = HSDC 
«(CSHD) «(DCSH) = HDCS 
«(SCHD) «(DSCH) = HDSC 





[a] 


"WI Example 5 The uh tables from Section 1.3 are functions that take as input 
the truth values of the variables and return the truth value of a compound expression 
as the single output. What are the domain and codomain of the function given by the 
truth table shown in Table 4-1? 
SOLUTION The inputs are actually pairs of truth values. The set of inputs 
can be represented as the Cartesian product 


{T.F} x (T. F} = (T. T). (T. F). CF, T). CF. F)} 


and the outputs are simply truth values from the set (7, F). The given table 
explains the rule for this function perfectly well, but if we would like to state 
it in words, we could say, “To the input value (x, y), associate as output the 











41 Definitions, Diagrams, and verses 251 


truth value of the statement *p and q, where p has truth value x and q has 
truth value y." 
a 


In summary, to completely describe a function, we must do four things: 


1. Give the function a name. f, g, and h are popular names for functions, but it's 
always okay to be creative and descriptive. 

2. Describe the domain. 

3. Describe the codomain. 

4, Describe the rule, 


There are many ways to describe the rule for a given function. In some of 
the previous examples, we gave formulas describing an arithmetic operation to be 
performed to find the output to associate with each input. In others, we gave a 
table that showed every possible input with its associated output, To illustrate the 
connection between these different descriptions, we will look closely at a single 
function whose rule is described in several different ways. 





ua Example 6 Give several alternative ways to describe the rule for the following 
function: 


© Name: f 
© Domain: (1,2. 
© Codomain: N 
© Rule: To each number in the domain, associate the square of the number. 





45] 


SOLUTION The original description of the rule is in the form of a simple 
English sentence. The following are cquivalent descriptions of this same rule: 





1, We can use the sentence above to describe the rule. 
1, We can use an algebraic formula to describe the rule: f(x) = x°. 


3. Wecan describe the rule by listing all the combinations of inputs and outputs, 
as ordered pairs: f consists of the ordered pairs of numbers (1,1), (2.4), 
3,9), (4,16), (5,25). 

A. Table 4-2 lists the inputs and outputs in table form. 





o 


The table representation is especially nice when the domain of the function is 
small. We can make this representation even more visual by using a picture to show 





ONAA 

Gane [pepe De T5 

FUE 
Example 6 














Chapter 4 / Functions and Relations 











7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 





Figure 4-1 Arrow diagram for Example 6. 


the mapping of inputs to outputs. In Figure 4-1, we use an arrow from each input value 
toits associated output value, We will refer to this kind of picture as an arrow diagram. 

Notice that in Example 6, the codomain is not a finite set, so we can only show 
part of that set in our diagram, When both the domain and codomain are small 
‘enough, we will show the entire diagram, 


a Example 7. Show the arrow diagrams for the following functions: 


1. Let f be the function with domain (a, b,c} and codomain (1,2, 3) defined by the 
set of ordered pairs ((a,2), (b. 3), (c, 1)). 

2, Let S = (a, b, c), and consider the function n; PS = 10, 1,2, 3), where n(A) is 
the number of elements in the set A | ( 


SOLUTION The diagrams are shown in Figure 4-2, In the arrow diagram, 
each function has exactly one arrow beginning at each point in the domain, — [1 


Practice Problem 1 For the set X = (a, b. c], define the function C : P(X) > 
P(X) with the rule C(A) = X — A. (Recall that X — A is the set of elements from 
X that are not also in A.) Complete the arrow diagram for this function shown in 
Figure 4-3. 


Binary Relations 


So far we have defined functions as rules acting on the elements of specified sets. 
We have seen that there are several ways for describing the rule for a function, but 
one of them warrants a closer look. When we describe a rule by listing input-output 
pairs, we are actually adopting a mathematically sound point of view from which a 


K 





Arrow diagram for f Arrow diagram for n 
Figure 4-2. Arrow diagrams for Example 7. 





41 Definitions, Diagrams, and inverses 253 






(ab) lad (bid (a, b. o) 





fa baited (a) (ab) (d) (b ig Q0 


Figure 4-3. Diagram for Practice Problem 1 


function is just another kind of set, We will see that even when a rule fails to describe a 
function, it can still have important mathematical properties and applications worthy 
of our attention, 

In general, a set of ordered pairs is a very simple structure for storing information 
in a way that captures a basic relationship between data. This type of structure is 
called a binary relation, and it will provide a generalization of our notion of function, 
Formally, a binary relation is nothing more than a subset of the Cartesian product 
A x B for specified sets A and B. 











Definition A binary relation R consists of three components: a domain A, 
à codomain B, and a subset of A x B called the "rule" for the relation, We 
will often omit the adjective “binary” since we will not address other kinds of 
relations in this book, 














BE Example 8 The folowing are examples of binary relations 


1, Relation Ry 
© Domain: The ser of all students at your college this semester. 
© Codomain: The set C of all classes offered at your college this semester 
© Rule; (v. y) is in Ry if student x is enrolled in class y this semester 
2. Relation Ry 
© Domain: The set A = (1, 
© Codomain: The same set A 
© Rule (x, y) is in Ra ifx — y is an even integer. 
3. Relation Ry 
© Domain: The set N of natural numbers 
© Codomain: The set Z of integers. 
© Rule Ry = (y) EN XZ 
4. Relation Ry 
© Domain: The set Z of integers 
© Codomain: The ser N of natural numbers. 
© Rule Ry = (x, y) € N x Z: y = x°), or equivalently, Ry = (x, x?) : x € N}. 








y^], or equivalently, Rs = (Y^. y): y € Z) 





Chapter 4 / Functions and Relations 


To shorten these descriptions, we will sometimes refer to R as a relation between 
A and B when we mean that the domain of R is A and the codomain of R is B. 
Moreover, if the domain and codomain are the same set A, we will simply say that R 
isa relation on A. In Example 8, for instance, we can say that Rs is a relation between 
N and Z and R is a relation on {1, 2, 3,4, 5, 6). 

Since a rule is typically described as a set of ordered pairs, it is casy to produce 
arrow diagrams (or at least partial ones) for any binary relation in the same way 
we produced them for functions. When we want to further emphasize the arrow 
diagram or the grammatical structure of a binary relation, there is an alternative 
notation called infix notation with which we write “a R b" (and read “a is R-related 
to b" or just “a is related to b” when the relation is made clear from context) instead 
of (a,b) € R.” The next example illustrates when this notation makes sense. 


uw Example 9 Draw an arrow diagram for the relation L defined by 
© Domain: A = {1,2,3,4,5) 


© Codomain: B = (2,3,5,7] 


© Rule: L = (1.2). (1.3). (1.5). 01.7). 2,3), 2.5). 2.7). 3.5) 
(3,7), (4.5). (4.7), (5. 7)). We can use set-builder notation to write 


L=((x,y)€AxB:x<y} 
or we can use infix notation to write 
1L2,1L3,1L5,1L7,2L3,2L5,2L7,3L5,3L7.4L5,4L7,SL7 
The latter list of infix pairs clearly reflects the list of statements 
1<2,1<3,1<5,1<7,2<3,2<5,2<7,3<5,3<7,4<5,4<7,5<7 











Relation L that defines the relation L. 
4-4 Diagram te 
"Games, "SOLUTION  Thearrow diagram, shown in Figure 4-4, contains an arrow from 
a to b exactly when (a, b) € L, or equivalently, when a L b is true. n 
TE When the domain and codomain ofa relation are the same set, we can simplify 


the arrow diagram by using just one set of nodes to represent both domain and 
codomain. We will call this the one-set arrow diagram and use the phrase two-set 
arrow diagram to refer to diagrams like those in the previous examples. 


AX 
“6 R9) Vll corps 10 diode oor onor diagram for relation R; from Exam- 
Relation Rs ple 8. 


e45 Diagram SOLUTION | Tokeep the arrow diagram relatively uncluttered, we will often 
r Example 10. use “double arrows” to indicate edges in both directions between a pair of 
nodes. The result is shown in Figure 4-5. 


Practice Problem 2 Draw the one-set arrow diagram for the relation R described 
below: 
© Domain: A = (1,2,3,4,5,6,7,8, 9.10]. 

e more on 

e Web. © Codomain: The same set A. 


41 Definitions, Diagrams, and inverses 255 


© Rule: R = ((1.2), 2.4). (3.6), (4.8). (5. 10)). Alternate ways to describe the rule 
Jor R include 


xRy ifandonlyif y=2x 


R= (21): x € (1,2,3,4.5]] 
Draw part of the arrow diagrams for relations Rs and Rs from Example 8 as well. 


When is a Relation a Function? 


‘We saw at the beginning of this section that one way of describing the rule for a 
function is to specify a set of ordered pairs. Although not just any set of ordered 
pairs meets the defining criteria for a function, we can rewrite our definition to 
reflect that a function is simply a special kind of binary relation. 





Definition A function F from A to B is a binary relation with domain A 
and codomain B with the property that for every x € A, there is exactly one 
element y € B for which (x, y) € F. 











"lll Example 11. Which ofthe relations in Example 8 are actually functions? 
SOLUTION Wego through the list of relations to explain why each one is or 
is not a function. 

1. Relation R; has just one problem—most students take more than one class 
each semester. 
2. Relation Rz includes pairs (1,3) and (1,5), violating the definition of a 
function. 

l Relation Rs includes pairs (1, 1) and (1. —1), violating the definition of a 
function. There are also domain elements (fike 2) that are not the first coor- 
dinate of any ordered pair in Rs. = 

4, Relation R, is a function. 


o 


We see that in practice, to be a function, a relation must satisfy two conditior 
(1) Every element of the domain must have something associated to it, and (2) no 
element of the domain can have tivo things associated to it. 





Practice Problem 3 Which of the following relations are actually functions? For 
each relation that is not a function, give a specific way in which it violates the definition 
of a function. 


(a) Let A be the set of letters in the alphabet and B be the set of people in your math 
class, and let R be the relation from A to B with the rule “a R P if and only if the 
first name of person P begins with letter a. 

(b) Let R be the relation on R given by the rule "x R y if and only if x- y — 1." 








(©) Leta 
(xn) € Rif? =x. 

(d) Let R be the relation whose domain is (1.2, 3.4.5}, whose codomain is N, and 
whose rule is given by R = {(1, 12), (2,4), (3,4), (2, 9), (5, 25)}. 


16,25}, and define the relation R from A to N with the rule 





‘The arrow diagram of a relation gives usa more visual way to determine whether 
the relation is a function, 


wm Example 12. Show the arrow diagram for each of the following relations. 
Which are functions? How can you tell from the arrow diagram whether a relation is 
a function? 


1. Let f be the relation with domain (a. b, c] and codomain (1, 2, 3) defined by the set 
of ordered pairs ((a, 2), (b. 3), (c. 1). 

2. Let V be the relation from {1,2,3, 4, 5] to (a, e, i, o, u) defined so that (x,a) € V if 
a is a vowel in the English word for the number x. 


SOLUTION In its arrow diagram, a function will have exactly one arrow 
beginning at each point in the domain. In the diagram for f shown in Figure 
4-6, each input has exactly one arrow beginning at that input, so f isa function, 
In the diagram for V, three different inputs (1, 4, and 5) have more than one 
arrow, which means they are mapped to more than one output value, so V is 
not a function. o 





Practice Problem 4 Using sets A = (1,2,3,4) and B = (1,2, 3}, draw arrow di- 
agrams for each of the following: 


(a). A function from A to B. 
(b). A relation between A and B that is not a function because some domain element 
is not associated with any codomain element. 


(c) A relation between A and B that is not a function because some domain elements 
are associated with more than one codomain element. 


Inverse Relations 


In practice, a rule for a function is typically interpreted as an "action" on domain 
elements that results in codomain elements, while a rule for a relation is viewed 


N 


îs 
6 
He 


Figure 4-6 Diagram for Example 12. 


BS2e7 


41 Definitions, Diagrams, and inverses 257 


simply as a relationship between domain elements and codomain elements that does 
not "favor" one type over the other. For example, the relation R = [(x, 2x) : x € Z} 
on Z can be thought of as the action of doubling each domain element to find the 
associated codomain element. On the other hand, the following relations R and $ 
describe very similar relationships between men and women: 


© Relation R 
Domain: The set M of all males on the planet. 
Codomain: The set F of females on the planet. 
Rule: (x. y) is in Rif y is the sister of x. 

© Relation $ 
Domain: The set F of females on the planet. 
Codomain: The set M of all males on the planet. 
Rule: (x, y) is in S if x is the sister of y. 


The only difference between R and S is the interchanged roles of domain and 
codomain sets and the order of the elements in the ordered pairs. Specifically, given 
two people a € M and € F, 


(a,b) ER ifandonlyif (b.a)eS 


If we call the pair (b, a) the reverse of the pair (a, b), then we can describe $ as the 
set of all reversed pairs from relation R. In this case, we will say that the relation R 
is the inverse of relation $. 





Definition | Givenarelation R with domain A and codomain B, the relation 
R^! (read “R inverse") with domain B and codomain A is called the inverse of 
R and is defined so that 

(x,y)€R ifandonlyif (y,x) eR" 


Note that this definition also tells us that the inverse of R-! is R, so we could 
simply say that these two relations are inverses of each other, 











"Bl Example 13 Recall he following from Example & 


© Relation Rs 

Domain: The set N of natural numbers 

Codomain: The set Z of integers 

Rule: Rs = {(x, y) € N x Z : x = y"), or equivalently, Rs = (X°, 
© Relation Ry 

Domain: The set Z of integers 

 Codomain: The set N of natural numbers 

Rule: Ry = (x. y) € N x Z: y = x°), or equivalently, Ry = (x. x?) : x € N} 





Show that Rs and Ry are inverses of one another. 





Chapter 4 / Functions and Relations 


SOLUTION The domain of Rs is the codomain of R4 and vice versa, so we 
only have to check the relationship between the rules for the two relations. 
Consider a pair (x, y) € Rs. This means that x = y^, which in turns means that 
the reversed pair (y, x) is in Ry. Similarly, every pair in Ry has its reverse in Ry 
‘Therefore, Rs and Rs are inverses of one another. a 





The arrow diagrams of relations that are inverses of one another have a very 
simple relationship with each other, Since the arrow diagram of a relation R consists 
of an arrow from a to b whenever the pair (a, b) € R, it follows that to find the 
arrow diagram for the inverse of a relation, one must simply reverse the arrows in 
the diagram for the original relation, 


u Example 14 Draw the arrow diagram for the inverse of the relation E shown 
in Figure 4-7. 
SOLUTION. The relation E can be best described as follows: 


© Domain: The set A = (1,2,3]. 
© Codomuin: P(A), the power set of A. 
© Rule: x E y if and only if € y. 





‘This means that the inverse E^! can be described as follows: 


© Domain: P(A), the power set of A. 
© Codomain: The set A = {1, 2,3), 


© Rule: The reverse of the rule for £, Namely, “(x, y) € 
x 











‘The arrow diagram is shown in Figure 4-8, a 


Practice Problem 5 Draw the arrow diagrams for the inverses of the relations R 
from Practice Problem 2 and L from Example 9. 


In the special case where the relation has the same domain as codomain, the 
relationship between the arrow diagrams of the relation and its inverse appears even 


stron; 
AS ZN _ AA 


DSN 











Figure 4-7. Arrow diagram for relation £. 





41 Definitions, Diagrams, and inverses 259. 








Figure 4-8. Arrow diagram for relation E^! 


Ml Example 15 zor he relation R described by the arrow diagram in Figure 4.9 
on the left, draw the arrow diagram for R~ 


© Domain: The ser $ = (A, B,C, D, 
lege athletic conference. 
© Codomain: The same set S. 





E, F) of men's basketball teams in a certain col- 


© Rule: (x, y) € R (or equivalently, “x R y") means "x beat y this year," where R is 
given by the arrow diagram below on the left. 


SOLUTION. The arrow diagram for R~! is shown in Figure 4-9 on the right. C 


Inverse Functions 


We have already noted that when considering a relation that is actually a function, 
we can think of it as performing an operation or action on domain elements to 
create an element from the codomain. Under this interpretation, we can think of an 
inverse function as "undoing the action" of the original function. It is very natural 
when you think about it: Once you have a function to perform some operation or 
action, it is inevitable that you will need to create another function to "undo" the 
operation or action of the first function, This process has been a theme throughout 
your mathematical life, 














© Once you learn to add, itis in 
"adding 3" and “subtracting 

© Once you learn to multiply, itis inevitable that you will have to divide, For example, 
“doubling” and “halving” are inverses of each other. 





vitable that you will have to subtract, For example, 
rses of each other 





are im 

















Figure 4-9. Arrow diagrams for 
Example 15, 














= pep E 
21 s 30 T= st) Il -s 


Table 4-3 Table for Example 16 


7a Isb etl 


€ Once you learn to set up problems involving x? in algebra, you soon thereafter 
need to compute square roots to solve these problems "Finding the square" and 
"taking the square root" are inverse functions. 


So we have seen inverse functions at many different times in our lives, perhaps 
without ever noticing the recurring theme. Let's see what inverses mean within the 
context of our function notation so that we can work toward a formal definition of 
inverse functions. 


un Example 16 Let f :Z — Z bea function with the rule f(x) = x +3, and let 
g: Z — Z bea function with the rule g(y) = y — 3. Compare the actions of f and g 
for several input values. 
SOLUTION We present these results in Table 4-3. We can see that g(b) =a 
exactly when f(a) = b for every a and b mentioned in the tables, but this is far 
from a formal proof. o 


This is hardly surprising, perhaps to the point that it seems like a waste of time. 
Indeed, it is much more efficient to show symbolically that any choice of values from 
Z must be inserted into the table in this way. 


Proposition 1 Let f :Z — Z bea function with the rule f(x) — x +3, and let 
8:Z — Z bea function with the rule g(y) = y—3. Then for all a € Z and for all 
beZ, f(a) =b if and only if g(b) =a. 





Nore: The definition of inverse functions involves the biconditional “if and only if” 
so we need to actually write two proofs: one to prove (i) “if f(a) = b, then g(b) = a," 
and a second to prove (ii) “if g(b) =a, then f(a) = 5." Exercise 19 explores what 
can go wrong if we don’t include both proofs 


PROOF 

Claim (i) For all a,b € Z, if f(a) = b, then g(b) =a. 
Proof. Let a € Z and b € Z be given such that f(a 

From this it follows that a = b — 3, and hence a = g(5). 

Claim (ü) For all a,b € Z, if g(b) =a, then f(a) = b. 
Proof. Let a € Z and b € Z be given such that g(5) — a. That is, b — 

From this it follows that b = a +3, and hence b = f(a). Li 








b. That is, a 3 — b. 














Practice Problem 6 Let f : R — R be a function with the rule f(x) — 2x, and 
let g: R — R bea function with the rule gy) = y/2. Complete Table 4-4 to show the 
actions of f and g for several input values, then write a formal proof that f(a) = b if 
and only if g(b) =a for all a, b € R. 








41 Definitions, Diagrams, and inverses 261 


eU pope pen oe 


Table 4-4 Table for Practice Problem 6 





From these examples and observations, we can make the following formal 
definition: 





Definition Function f : A — B and function g : B — A are inverses of 
each other if f(a) = b if and only if (5) = a for all a € A and b € B. In this 
case, we refer to f as the inverse of g and to g as the inverse of f. In this case, 
we will often use the notation f~ to mean the function that is the inverse of 
function f. 

















The formal definition gives us a strategy for finding the inverse for a given 
function if we are given the function rule in terms of an algebraic relationship. 


a Example 17 Find a function g that is the inverse of the function f :Q— Q 
with rule f(x) = 3x —2. 
SOLUTION According to the definition, we must have g : Q — Q such that 
fla)=6 itandonlyif g(b)— 
so suppose we have a, b € Q satisfying f(a) = b. This means that ła — 2 =b, 
from which it follows (solving for a) that a = $b +5. Hence, we can let g 
have the rule g(y) = $y +5. and it can be shown that f and g are inverses 
of each other. We could say this more concisely by simply writing, “ f~ (y) = 
i56 n 











It is even easier to see the relationship between inverse functions in the context 
of arrow diagrams. 


"E Example 18 Which ofthe functions fi, for fa isthe inverse ofthe function 
g shown in Figure 4-10? 
SOLUTION In terms of arrow diagrams, in order for f and g to be inverses 
of each other, we must have an arrow in f pointing from x to y whenever there 
is an arrow in g pointing from y to x, and vice versa. In other words, if f and 
g are inverses, then the arrow diagram for f can be obtained from the arrow 





Figure 4-10 Arrow diagrams for Example 18. 





62 Chapter 4 / Functions and Relations 





Figure 4-11 Arrow 
diagram for Example 
19. 


D 
s<f 
He 


Reversing the arrows 


igure 4-12 Solution 
to Example 19. 


diagram for g by simply reversing the arrows, and vice versa. From this point 
of view, it is easy to see that function fo is the inverse of g. That is, f =g. O 





Practice Problem 7 Draw arrow diagrams for the inverse of functions fi and fs 
in Example 18. 


We can see from our examples that when a function's rule is described as an 
algebraic formula, then the inverse function’s rule might be derived using algebra, 
and when a function is presented by an arrow diagram, then the inverse function 
can be found by reversing the arrows. Unfortunately, this is an oversimplification, 
Remember that a function is a special kind of relation, and there is no reason why 
reversing the rule for a function will necessarily produce another function, as the 
following example illustrates, 


EE Example 19 Explain why there is no function whose inverse is the function 
g: (a, b, c, d) > {1,2,3, 4) represented by the diagram in Figure 4-11. 


SOLUTION  Itiseasy enough to reverse the arrows as in Figure 4-12, but will 
the result be the arrow diagram of a function’? 

‘The answer is “no” for two reasons, First, since no arrow in g points to 
4, when the arrows are reversed, there is no arrow starting at 4, Second, since 
two arrows in g point to 3, when we reverse the arrows, there are two arrows 
starting at 3, Remember that the one requirement for a function is that for every 
element of the domain there is exactly one element of the codomain associated 
with it. Therefore, there is no function with domain (1, 2,3, 4) and codomain 
la, b, c, d) that has the (reversed) arrow diagram above. o 





Summary 


A binary relation is simply any set of ordered pairs with its first element from 
the domain A and second element from the codomain B. In the language of sets, 
this means that a binary relation is just a subset of A x B. In this context, a func- 
tion is a binary relation with an extra restriction: For each domain element x, a 
function has exactly one pair with x as its first coordinate. The inverse of a bi- 
nary relation can always be obtained by reversing the pairs in the relation, but 
a side effect of this is that the inverse of a relation that is a function can result 
in a relation that is not a function, We will study this specific issue later in this 
chapter. 














olutions to Practice Problems 


‘The arrow diagram is given in Figure 4-13, most likely letters that may be associated with more 
‘The three diagrams are shown in Figure 4-14, than one person with that first initial. 


Only the third rule. 


(2) It is possible, but very unlikely, that this is a func- 
tion, Many letters (like probably x or q) will not be 
ial of anyone in your class, and there are 


the first 








a function for sure, (b) This is "almost" a function, The problem is that this 

rule does not associate anything with the number 0 
‘The same rule for a relation on the set R — (0) isa 
function. 





41 Definitions, Diagrams, and inverses 263 





b) (ad (bd labe) 
































(ebd (bd (ad (ab (c) (0 (a ) 








Relation A Relation Fs Relation A, 


Figure 4-14 Solution to Practice Problem 2, 


(6) The rule is equivalently described by 6 Table 4-5 shows the completed table, so we only need to 
write the proof. 












R = (1.1) (4,2). (9.3). (16,4). (0,5) Chim Foralla,b € R. f(a) =bitandonly i g(b) = a. 
which satisfies the criteria to be a function from A Proof Let a, b € R be given such that f(a) = b. This 
10N. means 2a = b, from which it follows that a = b/2, and 

(d) There are two problems—the domain element 2is hence a = g(b), as desired. 
associated with two codomain values (4 and 9), and Now let a, b €R be gi 
the domain element 4 has no associated codomain means b/2 = a, from which it follows that b 
FREE b= fla), as desired. 

4 There are many possible solutions. In Figure 4-15, we 7 The diagrams in Figure 4-17 show the inverse of fi and 

give one possible solution for each part. the inverse of fs 


‘The two arrow diagrams are shown in Figure 4-16, 








264 Chapter 4 / Functions and Relations 


Relation A" 











Relation L-* 


Figure 4-16. Solution to Practice Problem 5. 


P 





E 


Figure 4-17 Solution to Practice 


=xercises for Section 4.1 


1. For each of the following functions, draw a portion of 
the arrow diagram showing at least five elements of the 
domain and codomain: 

(a) f: N= N with f(n) =2n 

(b) g:N— Z with f(m) = 13 3m 

(9) [:N— Qwith f(x) =2- 2: 
(D g: 8 Quithg(z) = 2+ sir 

2. Sometimes in the description of a function, a rule is 
given but the domain is notclearly specified. That leaves. 
it up to the reader to guess what was intended, For each 
of the following rules, provide a sensible domain and 











Problem 7. 





W gt) 311 VE 
1e) f(a) = 2T 


@ (= 


(©) gi) = a 

3. Let f : P({1,2,3}) + P((1,2,3]) be the function with 
the rule f(A) =A — DI. Complete the two-set arrow 
diagram shown in Figure 4-18, 

4. In the previous exercise, draw a single-set arrow dia- 
gram for the function f. 

S. Let f : P((1,2,3,41) > P((1,2,3,4}) be the function 
with the rule g(A) = A — (4]. Complete the single-set 
arrow diagram in Figure 4-19, 











41 Definitions, Diagrams, and inverses 265 


b oes 





a qp 3 [202] 0 |32 | 9 
onl Resa UNI ROS | 


|o 
o 





a) -3 


Table 4-5 Solution for Practice Problem 6 





B (2 (53 23) (1.2.3) 











(52,3) 23) 





ma a m m ) 





Arrow diagram for f 
Figure 4-8. Diagram for Exercise 3. 


m e cg wu ua na 
i {1.4} 
T WT 0,2,3,4) 
(4 (84 (223 RA 34 23,4) 
Arrow diagram for g 
Figure 4-19 Diagram for Exercise 5. 


6, Draw the complete arrow diagram for the following 


function n: 
© Domain: The set P([a, b.c, d). 
© Codomain: The set (0, 1,2, 3.4] 





© Rule: To each input A (a subset of la, b, c, d]),asso- 

ciate the size of the set, and denote this number by 
n(A). For example, n({b, d]) = 2. 

7. Draw a partial arrow diagram for the Following function 
o. (This is the Greek letter "sigma.") 











© Domain: The set Z^ of positive integers 

© Codomain: The set Z ofall 

© Rule: To cach input n in Z^, associate the sum 
of the positive numbers that evenly divide into 
n and denote this number by a(n). For exam- 
ple, here are some of the infinitely many ordered 
pairs that describe the function: (6, 12). (8,15). 
(20,42), 





gers, 











56 Chapter 4 / Functions and Relations. 


- Draw the two-set arrow diagram for cach relation R 

described below: 

(a) Domain: The set A = (0, 1.2.3, 4, 5. 6]. Codomain: 
The same set A. Rule: (x. y) € R means that the 
number x + y leaves a remainder of 0 or 4 when 
divided by 7. 

(b) Domain: The set 11.23. 
Codomain: The set A= (11.12.15, 
Rule: (x, y) € R means that y < x° < 2y. 

(©) Domain: The power set, 7([1,2.3]), Codomain: 
The set B= [0, 1.2.3.4. 5,6,7). Rule: (S, n) e R 
means that n is the sum of the elements in S. 

Draw the two-set arrow diagram for each relation R 

deseribed below: 

(a) Domain: The set (1,2,3,....10). Codomain: The 
set (a, e, i, o, u), Rule: (n. v) € R means that vowel 
w appears in the Spanish word for the number n. 

(b) Domain: The set A = (0, 1.2.3). Codomain: The 
set B of all possible strings one can make using 
three characters, each of which is a 0 or a 1. (B= 
(101, 001, 100, 111,...} is called the set of binary 
strings of length 3.) Rule: (n, s) € R if the nis the 
number of 1's in s 

(©) Domain: The power set, P((1,2.4, 8). Codomain: 
The set B= {0.1.2,...,14, 15}. Rule: (S.n) € R 
means that mis the sum of the elements in S. 

Decide whether each ofthe following relationsisa func- 

tion. If itis not a function, explain why not. 

(a) Domain and codomain are [1.23.4,5). The 
rule is given by this set of ordered pairs: 
(1,5). (2.3). G.3). 4.2). S. 1). 

(b) Domain and codomain are [1.2.3.4,5]. The 
rule is given by this set of ordered pairs: 
1(1.5)..3). 8.3) (1.2). (4. D). 

(©) Domain: Q. Codomain: Z. Rule: (7. z) € R means 
that the rational number r can be written as a frac- 
tion with numerator z. 








.9.10]. 
19.20}. 











TI. Decide whether each of the following relations isa func- 
tion. If itis not a function, explain why not. 

(a) Domain: The set of all finite strings consisting of 
a'sand b's, with at least one of each. Codomain: Z. 
Rule: (s. z) € R means that z is the number of a's 
‘minus the number of 6's in he strings. For example, 
(bbaabab, —1) € R- 

(b) Domain: N. Codomain: The set of all binary strings, 
Rule: (n, s) € R means that string s begins with n 
consecutive O's. 

(©) Domain: Z. Codomain: The set M of all people who 
were alive at midnight, December 31, 1999. Rule: 
(z. p) € R means that person pis zyears old at mid- 
night, December 31, 1999, 

(d) Domain: The set M of all people who were alive at 
midnight, December 31, 1999. Codomain: The same 
set M. Rule: (x, y) € R means that persons x and y 
are siblings. 

12. Which of the arrow diagrams in Figure 4-20 represents. 

F à function whose domain and codomain are the set 
11.2.3.4,5.6]? 

13. Which of the arrow diagramsin Figure 421, when the ar- 
rows are reversed, represents a function whose domain 
and codomain are the set (1,2, 3,4, 5,61? 

14. Let S = (1,2,3), and define a function c : P(S) + P(S) 
by c(A) — 5 — A. (That is c maps a set A to the com- 
plement of A relative to the universe S.) 

à) Fill in Table 4-6 indicating the function values. 

(b) Draw the arrow diagram for c. 

(©) Itc is inverüble, give a formula for the inverse func- 
tion 

15. For the function f : Q — Q with the rule f(x) =2r+ 
6, which of the following is the rule for the function 
§:Q— Q that is the inverse of f? 

(9 e) =2y-6 























oN 











@ b 





(e 





Figure 4-20 Diagrams for Exercise 12. 











4 Definitions, Diagrams, and inverses 267 















































2 5 1 2 
E el 3 s|s 
D 
4 5 » A re 
@ (6) (à 
Figure 4-21 Diagrams for Exercise 13. 
© g0)=hy- a biconditional statement. This exercise explores why 
@20)=! Kn both proofs are required. Let f : Q^ — R be given by 
16. Fill in the missing steps to complete the following proof Lt beg ul rl and let g: R — Q* be given by 
F that the function f : Q — Q with rule f(x) - 3x9 a 2 " 
anite faction iQ EQ (a) Prove that for all a € Q^" and b€ R, if f(a) = b. 
inverses of each other. then g(b) =a. 
Claim (i) For all a. b € Q. if f(a) = 6, then g(b) =a. b) Give a counterexample to the statement “For all 
Proof. Let a € and be be a € Q? and be R. if (b) =a, then f(a) =b." 
given such that f(a) That is, b. (©) Explain in plain English why f and g are not in- 
From this it follows that. „ and hence a = g(b). verses of each other. 
Claim (ii) For all a, b € Q, if g(b) = a. then f(a) = 20. Prove that the function f : (1, 5c) — (1, oc) defined by 
KE In ie gave the rule f(x) = 22 is its own inverse. 
gren such that wt is, —__=@ 91 Draw ram for the inverse of 
" Rice E een cm iz ss highly pur be an 
For each of the following functions wit in a Su 
codomain Q, find the rule for the inverse function f? 7 preneur forthe inverse cach relation 
ee ens 23. For cach relation R described below, draw the one-set 
Oer- arrow diagrams for R and R^ 
A9. fee (a) Domain and codomain: The set A= 
© fe) i-i 10.1.2.3,4,5.6]. Rule: (x, y) € R means that the 
(4 f(x)=xt+5 number x + y leaves a remainder of 0 or 4 on divi- 
18. For each of the following functions with domain and sion by 7. 
codomain R, find the rule for the inverse function s. (b) Domain and  codomaim The set A= 
Prove your answer is correct in each case. 10,1, 2,. 10]. Rule: (x, y) € R means that the 





(a) f:R— R with f(x) = 2s — 4 forall x € R. 
(b) f:R— R with f(x) = 3x + S forall x €R. 
(e) f:R— R with f(x) = 3x — $ forall x eR. 





(d) f:R— R with f(x) = YE- 1 forall x €R. 
19. We have stated that when proving a pair of functions 
are inverses of one another, we must prove two parts of 





number x? — vis divisible by 11. 
(c) Domain and codomaim The st A= 
10.1.2... 10}. Rule: (x. y) € Rmeans that? < y. 
24, For each relation R described below, draw the one-set 
arrow diagrams for R and R 





mo o e 


aa | a3 





eA) 





























Table 4-6 


Table for Exercise 14 





| Chapter 4 / Functions and Relations 


a) Domain and codomain: The set {1,2,3,...,12). (2,3,4,5). Codomain 

















Rule: (x, y) € R means that the English word for 
x has one fewer letters than the English word for y. Re = ((2,2).(3,5),(5,6)) 
b) Domain and codomain: P([1,2,3]), the power 
set of (1,2,3). Rule: (x,y) €R means that (d) Domain = (1,2, 3,4]. Codomain = (5,6, 7). 
¥Sy. 
€) Domain and codomain: The set (1.2.3)... 12). Ra = 1.5) (2.5). (3:6). (4.7)) 
Rule: (x, y) € R means that x has more positive fac- 
tors than y does 5 26. For each of the relations in Exercise 25, use the arrow 
jive arrow diagrams for the following relations, each diagrams to decide if the relation is a function. 
iven as a set of ordered pairs: 27. In your own words, explain how you can tell from a list 
») Domain 8,9]. Codomain 





of ordered pairs whether the relation with that list as its 
(8.1.9.2) rule is a function 

SIR 28, Explain how you can tell from a list of ordered pairs 

icol whether the inverse of the relation with that list as its 
Ry = (0,6) 2.7) G,8) rule is a function. 


R, 





14,5). (5, 
b) Domain = (1,2, 3}. Codomain 








4.2 The Composition Operation 





The idea of composition of functions and relations is a very basic one, Mathematics is 
largely about how complex concepts, structures, and properties can be built logically 
out of simpler ones. Since functions and relations are fundamental structures in 
mathematics, it stands to reason that combinations of two or more of these structures 
could be important. 





Composition of Functions 


Before being swept away by a formal definition, let us consider an example of how 
composition naturally arises in the English language, One rule that relates pairs of 
people is the "husband of" relation. Another rule of this type is the "mother of” 
relation. These relations can be combined to give two distinct meanings: 











* The relation “mother of the husband of” associates a woman with her mother-in- 
law. 

@ The relation "husband of the mother of" associates any person with his or her 
father or stepfather, 


‘The preposition "of" naturally ties together English clauses in the same way that 
‘composition ties together mathematical functions, as we see in our formal definition, 








Definition Y f:A + Bandg:B — C,thenwecan build anew function 
called (g o f) that has domain A and codomain C, and that follows the rule 
(g o (x) = B(f(a)). We call (g o f), read “g of f,” the composition of g 
with f. 














4.2 The Composition Operation 269 


The double of the square root gio» 
9 fof Second First 
Description order for (9 f) Evaluation order tor (g9 f) 


Figure 4-22 Howto read (g o f). 


"ll Example 1 Given the funcion f :R2° — R defined by the rule fx) = JE 
and the function g : R — R defined by the rule g(y) = 2+ y, describe the domain, 
codomain, and rule for the function (g o f). 
SOLUTION Since the codomain of f matches the domain of g, the compo- 
sition (g o f) : R?? — R makes sense. The rule for (g ^ f) can be stated in a 
couple of different ways: 











Since the rule for f can be described as "computing the square root of” and 
the rule for g can be described as “doubling,” the rule for (g © /) can be 
described as “computing the double of the square root oL." 


2. Algebraically, 





( © A(X) = B F(a) = (VR) = 2: VE 


so we see that (g o f) maps an input x to the output 2: X. 
o 


This example demonstrates a quirk about composing functions that we illustrate 
in Figure 4-22, When describing the rule for the function (g o /) in English, it is 
natural to describe the action of g on the result of the action of f. In other words, 
we would typically mention the functions in the same order (left to right) they are 
written in the notation (g o f). However, to actually evaluate the particular value, 
say, (g o f)(9), we must first compute f(9) = 3, and then double that result to get 
6. This means that to evaluate (g © f)(x), we must use the functions in the reverse 
order (right to left) from how they are written. This evaluation order is consistent 
with the usual rule in algebra that one works from the innermost parentheses to the 
outermost parentheses, 

There is an easy parallel of this situation in our English-language example, When 
we describe "the mother of the husband of" relationship, we are not thinking about 
any particular person, but as soon as we try to apply this relationship to a real person, 
we must think from right to left. For example, to find the mother of the husband of 
Melissa, we must first discover that Melissa's husband is Matt, and then find out that 
Matt's mother is Michelle. 

Hereis an exercise and practice problem that will seem a bit like a puzzle. Playing 
“mathematics detective" will give you some practice with the order of evaluation in 
the composition of functions. 























BEI Example 2 Given the function f : N > Q* with the rule fn) zs and the 
function g : Q* — Q with the rule g(r) = zi, answer the following questions about 
the composite function (g o f): 








70 Chapter 4 / Functions and Relations 





what is (g o f(a)? 

7/8, whatisb? — 

3. Af f(c) = 9/10, what is (g o fc)? 
4. If g( f(d)) = 7/13, what is d? 








SOLUTION 


1. (g © f)4) = gC f(4)) = g(4/7) and 
D 
sn - rut 
7/8, solving the equation ty = 7/8 gives us b = 21. 
3. If f(e) = 9/10, then (g © f)(c) = gC Fle) = g(9/10) and 





i 10 
0) = ye 
(9/10) Eat 715 


Notice that we never need to know the value of c to find this, 
A. I g(f(d)) = 7/13, then this means that 7r] = 7, which can be solved to 


tell us that f(d) = 6/7. This, in turn, leads to the equation ;/^, = 6/7, which 
has as its solution d = 18, 








n 


Practice Problem 1 Given the function g: N — N with the rule g(z) - 221 
and the function f :% — Z with the rule f(m) = Sm — 7, fill in the missing values in 
Table 4-7. 


So we see that when our functions have a context, either from the real world or 
from specified mathematical operations, the composition of functions is a perfectly 
natural thing. When the functions are merely abstract objects, we rely on tools like 
our arrow diagrams to help us form the proper mental picture of composition. 


Tl ane 3 0 Var oss functions. g : (a,b, c,d) > (1,2) and 

A: (1.2) > (X. Y, Zh. Draw the arrow diagram. of the composition (h og): 

(a, b c, d] — (X. Y, Z). 
SOLUTION The diagram for (h og): (a, b, c, d) — (X, Y, Z) is found by 
thinking about the "evaluation order" of the function. That is, we take each 
input in the oval for (a, b, c, d), trace its g arrow to the "intermediate" set 
1.2], and then continue along the A arrow from that element of (1, 2) into the 
final output set shown by the oval containing (X. Y, Z). (It often helps to see 
the two diagrams merged together as in Figure 4-24 to show them sharing the 













go) 
fii) 








Table 4-7 Table for Practice Problem 1 








42 The Composition Operation 271 




















x 
Y 
z 
xs a 
Figure 4-24 Another lool 
PE ~ diagrams for Example 3. 
5 +x 
c sy | intermediate set (1, 2).) For example, we can trace one such input like this: 
d e a1 Y 
hog Ci 
D? 
Figure 4-25 Solution 
for Example 3. Figure 4-25 shows the complete arrow diagram for (A © g). We do not show 
the intermediate set (1, 2] in this diagram since that set is neither the domain 
nor the codomain of the new function (A » g) o 


ad uw Example 4 Consider the functions f and g with the set (1,2,3, 4) as domain 
NI | and codomain, and whose rules are given by the arrow diagrams in Figure 4-26. (The 
an Wa 2 Ed 

s 





"loop" at 2 represents an arrow that starts at 2 and also ends at 2—this means that 
£(2)=2,) Make a table to illustrate the rule for the function g © f 
SOLUTION Tocalculate (g o f)(1), westartat 1 in the f diagram, and follow 
Figure 4-26 the arrow to f(1) = 3. Then in the g diagram we start at 3 and follow the arrow 
Diagrams for Example to g(3] Thus, (g o f)(1) = e(f(1)) = g(3) ‘We can find all the outputs 















D otg © f in this way. 
© (g o (0) = af) =t 
9 (go NO =s( FO) 
* (c DG) =8(FG)) - (0) =4 
© (g o A=) = 8) =2 








er 4 | Functions and Relations 


Practice Problem 2 For the functions f and g in Example 4, give arrow diagrams 
forg o f, f og, and f o f. 


Inverse Functions Revisited 


The composition operation on functions gives us another way to express the inverse 
relationship between functions. To understand this new perspective on inverse func- 
tions, we need to first understand the concept of an identity function. 





Definition For a given set A, the identity function" on A is the function 
ta: A — A with the rule tafx) =x for all x € A. If the set A is clear from 
the context of a problem, we will often omit A and just use t. Note that (4 = 
I(x, x) :x € A] for when we wish to consider t4 as a binary relation. 








In plain terms, : is a function that performs no action at all, or equivalently, it. 
is a relation that only associates each element with itself. Our first notion of inverse 
functions was built on the idea of one function undoing the action of the other. 
Another way to say this is that the successive actions of the two functions, in the end, 
do nothing at all. Let's revisit the pair of functions from Example 17 from Section 
4.1 to see this in action. 


uw Example 5 Let f :Q — Q be the function with the rule f(x) = ix — 2, and 
let g: Q — Q be the function with the rule g(y) = $y 5. 


SOLUTION Letx € Q be given. Then 


(5 DG) - (G0) =8 (3-2) 


as A 
AG he 





2/5 
s) 


* n 


It might seem silly to check both f c g and g o f, but it is possible for one of 
them to be equal to the identity function but not the other. 





* The traditional symbol for the identity function is 1, the lowercase Greek letter “iota” 








42 The Composition Operation 273 





BE Example 6 LerA = (1,2,3, 4) and consider the functions f 
with the rule f(a) = (a,a)andg : (A x A) — A with the rule g(x, y) 
8 © f = 1a. Explain why EIA 
SOLUTION 
Chim gc f= 
PROOF Note that(g o f): A — A,so (g © f) and ta have the same domain 
and codomain. We must only check then that their rules agree. Let a € A be 
given, We can evaluate 


(g © f(a) = ef(a)) = g(a. a) 


tosee that g > f agrees with the rule for ‘4 for every element of their domains. 
Hence, these two functions are the same. 

To see why f og # laxa, We can simply consider the example (1,2) € 
A x A, and evaluate 


(f 281.2) f(&(1.2) = f(1) = (1.1) 


Hence, (f 2 g)(1, 2) # (1,2), while t4 xa(1, 2) = (1.2), so the rules for these 
functions do not agree. " 


— (A xA) 
= x. Show that 

















We are ready for the formal statement of this equivalent way to express the 
inverse relationship between functions. The proof of the theorem is given in the 
form of “fill-in-the-blank” exercises at the end of this section. 





Theorem 1 Functions f:A — B and g : B — A are inverses of each other if 
and only if f 0g — ig and g © f = ia 





PROOF See Exercises 11 and 12. " 


Under this interpretation, we finally see the sense in using the notation ^! to 
denote the inverse of function f. Atleast we can understand it by analogy tothe more 
familiar use of the —1 exponent in arithmetic. Think about the parallels between the 
following statements: 


9 27 is the inverse of 2 under the multiplication operation x. This means that 


2x1!-23 





x2 





@ f~ is the inverse of f under the composi 
feftapte 





operation o. This means that 





Practice Problem 3. Use Theorem 1 to show that f : Q -» Q with the rule f(x) = 
:Q — Q with the rule g(y) = 2:2!* are inverses of each other. 





Composition of Binary Relations 


Binary relations can be combined to create new relations in much the same way that 
we combined functions earlier in this section. Sometimes the actual meaning of the 
new relation is somewhat elusive, so our examples and exercises will emphasize the 
interpretation of composition of relations. 





OS w-U- AS — EM 


Chapter 4 / Functions and Relations 





Relation A Relation S 
Figure 4-27 





agrams for Example 7. 


u Example 7 Let R be the binary relation with domain A = (1,2,3,4, 5] and 
codomain B = (a. e, i, o, u} defined by the rule 


R = (x) : y is a vowel in the English word for x) 





Let S be the binary relation with domain B = (a, e, i, o, u) and codomain 
C = (math, English, history) defined by the rule 
S = (Cx, y) + x is a vowel in the word y} 
Use an analogy to composition of functions to draw an arrow diagram for the com- 
position S oR. 
SOLUTION Figure 4-27 shows the arrow diagrams for the relations R and S. 
By following the arrows from the set A to the set B and then on to the set 
C, we can form the diagram for the new relation (S o R) with domain A and 
codomain C as shown in Figure 4-28, o 





lation (So A) 


e4-28 Solution to 
Example 7. 


Interpreting the composition of relations can sometimes be tricky and sometimes 
confusing, In the previous example, the rule can be described as “(x, y) € (S o R) if 
the English word for the number x has a common vowel with the word y." This leads 
to our formal definition of the composition of relations. 








Definition Given the relation R, with domain A and codomain B and 
the relation Rz with domain B and codomain C, we define the new relation 
(Rz © Ri) with domain A, codomainC. and the rule *(a, c) € (Rp o Ri) ifthere 
is an element b € B such that (a, b) e Ry and (b,c) € Ro.” 

This new relation is called she composition of Rz with Ry, and we read the 
notation (Rz o Ri) as “Ry of Ri.” 











When the relations express "real relationships," their composition has a natural 
interpretation. 


"EI Example 8 Les be the relation “is a sibling of” and C be the relation "isa 
child of” where both relations have the set of all people as domain and codomain. 
That is, we will have (x, y) € S if x isa sibling of y, and we will have (a, b) e C ifa is 
a child of b. What is the natural interpretation of the relations (S 3 C) and (C © S)? 





42 The Composition Operation. 275 


SOLUTION The best approach is to use the formal definition of these com- 
positions: 





© According to the formal definition, (a, c) € (S oC) if there is a person b 
such that (a, b) € C and (b, c) € S. In words, (a, c) € (S sC) ifa is a child 
of a person b who is a sibling of c—that is, a isa child of a sibling of c. 

© According to the formal definition, (x, z) € (C c S) if there is a person y 
such that (x, y) € $ and (y, z) € C. In words, (x, z) € (C ^S) if x isa sibling 
of a person y who is a child of z—that is, x is a sibling of a child of z. 


‘The two relations (S oC) and (C oS) are def different. The relation 
(S s C) expresses (part of) the niece/nephew relationship, while (C © S) is not 
very different from the relation C itself o 








Practice Problem 4 For the relations described below, draw an arrow diagram 
for (DistributedBy o Stocks), and give a natural interpretation of this relation: 


© Domain of Stocks: The set A of grocery stores in town, A = | Big Food, Food King, 
Leviathan, Big Market, Super Food, Grocelor}. 


© Codomain of Stocks: The set B of brands of cereal 
B = (Krunchies, Prime-O's, Great Mornin’, Quotidians} 
© Rule for Stocks: (a, b) € Stocks if store a stocks cereal brand b, as given by the 
arrow diagram on the left in Figure 4-29. 
© Domain of DistributedBy: The set B of brands of cereal 
B = (Krunchies, Prime-O's, Great Mornin’, Quotidians} 
© Codomain of DistributedBy: The set C of distribution companies, C 
Trucks, National Distributing, Interstate Amalgamated, Allied Foods}. 


© Rule for DistributedBy: (b. c) € S if cereal brand b is distributed by company c, as 
given by the arrow diagram on the right in Figure 4-20. 


Toms 














K 
P< 
eu <4 
a 
A B B c 
‘Arrow diagram for Stocks Arrow diagram for Distributed By 


Figure 4-29 Diagrams for Practice Problem 4, 


Se m cow —— ^. ë ë - 


276 Chapter 4 / Functions and Relations 











x 3 Sai = 
s0) 7 in FS 5 
sew) || 28 8 |8 | -2 


Table 4-8 Solution for Practice Problem 1 


‘The previous practice problem is an example of an “inventory” problem that 
is common in computer science courses on databases. In that context, the composi- 
tion of relations is similar to the join operation for relational databases. In fact, the 
University of Virginia Computer Science Department uses a movie database’ for an 
application of “composition of relations” made famous by a parlor game based on 
the 1993 movie Six Degrees of Separation. 


p OCT TII TRI 
a commercially released motion picture, and define the relation R on A by the rule 
(x. y) € Rifactors x and y appeared in the same film. Answer the following questions 
about this relation (KB stands for Kevin Bacon): 


1. What does it mean in plain English for (x, y) € R o R? 
2. Is RC (R c R)? Prove your answer is correct. 


3. Using your own knowledge of movies or UVA’ "Oracle of Bacon," find an actor 
X such that (X, KB) € R © R but (X, KB) ¢ R. 


The "Kevin Bacon Game" is examined in more detail in Exercise 22. 
SOLUTION 


1. (x. y) € R c R means that there is an actor z who appeared in a movie with 
actor x and appeared in a movie with actor y. 

2. Claim: R C (R c R). 

Proof Let (a, b) € R be given. This means that actors a and b appeared 
together in a film. It is of course also true that actor b appeared in some film 
with him or herself, so trivially (b, b) € R. So there is an actor (namely b) 
who has appeared in a film with actor a and in a film with actor b; hence, 
(a,b) € (R oR). 

3. According to the UVA “Oracle of Bacon,” Orson Welles meets the given 
criteria. Namely, Orson Welles was in A Safe Place (1971) with Jack Nichol- 
son, who was later in A Few Good Men (1992) with Kevin Bacon, but 
Orson Welles and Kevin Bacon never appeared in a film together. Hence, 
(Orson Welles, Kevin Bacon) is in (R  R) but not in R itself. 





n 
Solutions to Practice Problems 
| The complete table is given in Table 4-8. 3 Proof Letx e Q be given. Then 
The diagrams are given in Figure 4.30. ui 
esposan - (7:2) 





* Accessible through hetp:/www.cs virginia eduloracle as of June 2005. 








gef 


T 


£i 
23 


tog 


42 The Composition Operation 277 


iQ. 


1" o 


fet 


Figure 4-30 Solutions to Practice Problem 2. 





Nowlet y € Q be given. Then. 


eo) = r (82:9) 
= 


GU ego 








Exercises for Section 4.2 


1. All functions in this problem have the set of real num- 
bers R as their domain and codomain. 





()1f f(x) =2e+1 and g(y)- what is 
(f eg 

(b) IF f(x) = 3r — Zand (f © g)(y) = 12y +7, whatis 
so»? 

(©) If g() 2 2y — 1 and (f sg 1, what is 





fe? 





(y 10) - 10 








Therefore, f and g are inverses of each other. LI 
Figure 431 shows the arrow diagram for 
the relation (DistribuiedBy © Stocks). (a,c) € 


(DistributedBy © Stocks) means that there is a brand 
of cereal b € B such that store a stocks brand b, which is 
distributed by company c. So the natural interpretation. 
is “Stocks something distributed by" So this relation 
tells us which distributors each store does business with 
lor its cereal sales. 








2. All functions in this problem have the set of real num- 
bers R as their domain and codomain. 








If f()—2r-1 and g()- y —1. what is 
(g c fX? 

(b) If f(x) — 3x — Zand (g o f)(z) = 92 — 9z, what is 
80)? 

(©) If g(y)=2y—1 and (g c f)(z) «47 —1, what is 
fay? 





Figure 4-31 Solution to Practice Problem 4. 








3 Chapter 4 / Functions and Relations 





x [wo f[u [2 [s 





fe) | ufe | 10 




















se) | 4 fs ]4 fo 




















Table 4-9 Table for Exercise 3 


Here are two functions f : (1,2,3) — (10, 11,12, 13} 
and g : 10, 11, 12, 13) — [4.5.6] whose rulesare given 
in Table 4-9. 

(a) What is f(1)? e(11)? 

(b) Give arrow diagrams for f and g. 

(c) Which of these compositions can be defined: g © f, 

nf ogor f of? 

(d) For any of the compositions above that are defined, 
give the domain and codomain, and draw the arrow 
diagram, 

Let e: P([x. y, z]) — P(x, y, 2) be the function with 

rule c(A) = (x y. z) A, and let n: P y z]) = 

(0,1, 2,3] be the function such that n(A) is the number 

of elements in the set A. Which composition is defined, 

c on or n c c? For the one that is defined, describe the 

domain and the codomain, and give an arrow diagram 

for the function. 

Let $ = (0. 1,2,3,4. 5,6,7, 8,9, 10). Define f:5 — S 

by f(n) =the number of letters in the English-language 

spelling of the number. For example, f(7) = 5, since the 
word "seven" has five letters. 

(a) Give an arrow diagram for f. Is f invertible? 

(b) Give an arrow diagram for f © f. 

(c) Give an arrow diagram for f o f o f. 

For each of the functions whose arrow diagrams are 

shown, provide the missing function's diagram. If it is 

impossible to do so, or if there is more than one diagram. 
that will work, say so. 

(a) In Figure 4-32, what is g © f? 

b) In Figure 4-33, what is g? 

(c) In Figure 4-34, what is f? 

















a 
i 
‘Sb 
reed 


z 








7. For each of the functions whose arrow diagrams are 
F shown, provide the missing function's diagram. If it is 
impossible to doso, or if there is more than one diagram 
that will work, say so. 
(a) In Figure 4-35, what is f? 
b) In Figure 4-36, what is g' 
(©) In Figure 4-37, what is f? 
4) In Figure 438, what is g? 
For each of the functions given below, describe the miss- 
ing function by giving its domain, codomain, and rule 
asa set of ordered pairs I itis impossible to do so, orif 
there is more than one function that will work, say so. 
(a) 1f f has domain (1,2, 3,4), codomain (a, c,d), 
and rule 
Kb), (2.a), (3.d),(4.c)) and g has do- 
main (a,b,c,d), codomain (x,y,z), and rule 
Ka, x), (b, y), (c. 2). (d. x), whatis g o f? 
(1.2, 3.4), codomain (a, b, c, d), 





8 











(C, b), (2, a). (3. d), (4, a)) and g c f has domain 
(1,2,3,4), codomain (4, B.C, D. E), and rule 
((1,A), Q.C). @, B), (4. D)), what is g? 

(©) 1f g has domain (a, b c, d), codomain (W, X, Y. Z), 
and rule 
(a, X), (b. Y), (e, Z), (d. X)) and g o f has do- 
main (1,2,3,4), codomain [W. X, Y, Z}, and rule 
101, W). (2. Z). 3, X). (4. Y)), what is f? 

9. For each of the functions given below, describe the miss- 
ing function by giving its domain, codomain, and rule 
as a set of ordered pairs. If it is impossible to do so, or if 
there is more than one function that will work, say so. 








ger 


Figure 4-32. Diagrams for Problem 6(a). 











42 The Composition Operation 279. 





Figure 4-34 Diagrams for Problem 6(c). 





) 











( 


t 9 
Figure 4-36 Diagrams for Problem 7(b). 








280 Chapter 4 / Functions and Relations 





D 


v | 


X^ 





Figure 4-37. Diagrams for Problem 7(c).. 


(a) If f has domain (1,2,3,4), codomain (a, P, c. d], 
and rule. 
{(1,a), (2,5), (3, d), (4) and g has do- 
main (a,b,c,d), codomain (x,y,z), and rule 
Ka, 2), (b, y), (€, z). (d, x)], what is g 3 f? 

(b) If f has domain (1,2,3,4), codomain (a, b, c, d), 
and rule 
1C, b), (2.a), (3. d). (4, a)) and g o f has domain 
11,2,3,4], codomain (A. B,C, D, E], and rule 
((1, D), QC). (3, E), (4, C)], what is g? 

6) V g has domain (a, b, c, d], codomain (W, X, Y, Z), 
and rule 

(b. W), (e, X), (d, Y)) and g o f has do- 

11, 2,3,4], codomain (W, X, Y. Z), and rule 

(1. W). (2. Z), (3. X). (4. Y)), what is f? 

0. Let B be the set of binary strings* of length 5, Define 
J: B = 40,1,2,3,4,5], where f(s) is the number of 
Vsin thestrings. Define g : (0, 1,2, 3,4,5) — B, where 
(n) is the binary string consisting of n 1's followed by 
S=n0's 
(a) Find /(11011), (01101), and /(11000).1s f invert- 

ible? If not, why not? (Be specific.) 
(b) Find (0), (2). and g(4). Is g invertible? If not, why 
not? (Be specific.) 

















(©) Find (f 2). (f 900). (g 
aio. 

d) Are f and g inverses of each other? 

11. Fill in the blanks in the first part of the proof of 
Theorem 1, 
Clm. A /:A — B and g: B — A are inverses of 
each other, then g o f = i4 and f og = ip 
Proof, Let f: — B and g: B — A be functions. 
that are inverses of each other. From our definition in 
the previous section, this means that 


Jyat0t0), (g © 








Foralla € A and b € B, f(a) = b if and only if 











at = (aay 
We must show both g o f = i4 and fog = un 

Let a € A be given, and set b = f(a). In this case, 
(ofa) eg) g(h); The fact that b 





fa) tells us, by equation (4.1), that g(h) 
Hence, (g © f(a) = lor alla € A, pro 
that (g © f) has the same rule as 14. 

Now let € B be given, and seta = (b). In this case, 
men f(a). The fact that a = g(b) 
tells us, by equation (4.1), that f )zb. 
Hence, (f s g)(b) = forall € B, proving 
that has the same rule as iy. L 























gor 


Figure 4-38 Diagrams for Problem 7(d). 





V binary string is made up of the characters and 1. The length of the binary string is the numberof characters used, For example, the 


ength of the binary string 00110 is 5 








12. Fill in the blanks in the second part of the proof of 
Theorem 1 
Chim 1/14 — Bandg: B — Asuisfyg o f = i4 
and f og = in, then f and g are inverses of cach other. 
Proof Let f:A — B and g: B — A be functions 
satisfying g c / = i, and f og = in. To show that f 
and g are inverses of each other, we must show that 





Foralla € A and be B, f(a) =b 
i and only if g(b) 








We will prove this biconditional statement using two 
separate proofs, 

Leta € A and b € B be given such that b = f(a). In 
this case, (5) = gC ) = (g © fa). The fact 
that tells us that (g © /)(a) = a. Hence, 














ab) . as desired. 

Nowleta € A and b € B be given such that a = g(b). 
In this case, fi f )- 
(0). The fact that Tells us that 





f(a) 






h relation R on the given set A, compare the ar- 
row diagram for R with the arrow diagram for R © R. 





(a) A = (1,2,3,4,5, 6] and (x, y) € R means x < y. 
(b) A= [1,2,3...., 11,12) and (x, y) € R means that 
x isa factor of y. 





(©) A= (1,2,3,...,9,10] and (x, y) € R means that 
x + yis divisible by 4, 
14. For each relation R described in Exercise 13, describe 
the relation R © R in English. 





s 
Figure 4-39 Diagrams for Problem 15, 





42 The Composition Operation 281 





N 
wta, 2) i2) 
(1.3) «(3l 
ota) (1.4) 
s(a) i23) 
(2, 4) (2,4) 
(3.4) (8,4) 

p € 

Son 


15, Let Rand S be the relations whose arrow diagrams are 
shown in Figure 4-39, Draw the arrow diagram for the 
relation $ © R. 

16. Let R and S be the relations shown in Table 4-10, In 
each case, an associated codomain element is listed di 
rectly below each domain element, Create a similar ta 
ble showing the relation R © $ 

interpretation in English of relations R, S, 








17. For the relation R in Example 15 of Section 4.1 (re- 

peated in Figure 4-40), what is the meaning of R c R? 

Draw the arrow diagram for this new relation 

© Domain: The set 5 = (A,B,C, D, E, F} of men's 
basketball teams in a certain college athletic con- 








ference, 
© Codomain: The same set S. 
© Rule: (x, y) € R (or equivalently, “x R y") means “x 






beat y this yea 
aram below. 

18. Let A be the set consisting of the following 12 peo: 

ple: Andy, Angela, Brian, Chris, Clint, Jennifer, Jessica, 

Julie, Katie, Kristina, Luke, and Paula, Given the rela: 

tions deseribed below, give English descriptions of the 
Sib 


" where R is given by the arrow di- 


relations SiblingOf  MarriedTo and MarriedTo 
lingOf, and draw their arrow 

Relation SiblingOf is d 
(x,y) eSiblingOf means thatx and yaresiblings, where 
© Andy and Katie are siblings 
© Chris, Julie, and Brian are siblings 
© Kristina and Paula are siblings 












































Domain of R | 10010 | 10100 | 0010 | oono | oroo | 10001 | o100: [ oorr | 0001: | 11000 

Codomainof R | 021 | or | ir | 2m | 102 | ox | 120 | 210 | 300 | o3 
+ E + 

Domain ofS | (1.2) | (531 | 4 | ws | $3] ee | es) | eaf es] us 

Codomain ofS || 11000 | 10100 | 10010 | 10001 | or100 | o1010 | oroor | oomo | omor | oooi 


























Table 4-10 Rules for Relations R (above) and S (below) 





282 Chapter 4 / Functions and Relations 





Figure 4-40 Diagram for Problem 17. 


© Luke and Jennifer are siblings. 
© Jessica and Angela are siblings. 
Relation Married To is defined on A such that (a, b) € 
MarriedTo means that a is married to b, where 
© Andy is married to Jennifer. 
© Katie is married to Luke. 
© Chris is married to Paula. 
© Brian is married to Jessica, 
© Kristina is married to Clint, an only child, 
© Julie and Angela are single. 
Let A be the set of all students at your school this 
semester, B the set of all sections of all courses offered 
at your school this semester, and C the set of all instruc- 
tors at your school. If (x. y) € R means that student x 
is enrolled in course y, and (a, b) € $ means course a is 
being taught this semester by instructor b, what is the 
meaning of the relation S « R? 
Let A be the set of all students at your school this 
semester, B the set of all towns on Earth, and C the 
set of all instructors at your school. If (x, y) € R means 
that student x was born in town y, and (a, b) e$ oR 
means student a was born in the same town as instruc- 
tor b, what is the meaning of the relation $? 
Let A be the set of all production facilities, B the set 
of all machine parts, and C the set of all parts distrib- 
‘tors. If (x, y) € S means that part x is available from 
distributor y, and (a, b) € S o R means facility a can do 
business with distributor b, what is the meaning of the 
relation R with domain A and codomain B? 
(This exercise continues our look at the “Kevin Bacon 
F Game"in Example 9.) Let be the set of all people who 
have ever appeared in a commercially released movie, 
and let the relation on A be defined by the fact that 
(x, y) € R means actors x and y appeared in a film to- 
gether. 
(a) Use the UVA "Oracle of Bacon" Web site to find 
a specific pair of actors a and b such that (a,b) € 
R © (R © R) but (a,b)  (R oR). 











n. 


R 


(b) Suppose 
RoRoRoR=RoROR 
What would this mean in plain English? 
(©) The premise of the “Kevin Bacon Game” is that ev- 
ery movie actor is related to Kevin Bacon through 
a "chain" of no more than six movies. State this 
precisely in terms of the relation R, the set A, and 
Kevin Bacon (KH). 
Consider the relation Ronthe set A = (0, 1,2 6) 
with the rule that x R y if x — y is divisible by 3. Draw 
the one-setarrow diagram for Rand R c R,and describe 
the latter relation with an English sentence. 
For each relation whose complete arrow diagramsare 
shown in Figure 441, draw the arrow diagram for the 
new relations R o Rand R o (R © R). 
For each relation R in Figure 4-41, give an arrow dia- 
gram for the relation R o 8712 
Consider once again the relations in Example 8 of Sec- 
tion 4.1. 
© Relation Ri 
Domain: The set S of all students at your college 
this semester. 
Codomain; The set C of all classes offered at your 
college this semester. 
Rule: (x, y) is in Ry if student x is enrolled in class 
y this semester, 
© Relation R: 
Domain: The set A = (1.2,3,4, 5.6] 
 Codomain: The same set A. 
Rule: (x, y) is in fo if x — y is an even integer, 
© Relation Rs 
Domain: The set N of natural numbers. 
Codomain: The set Z of integers. 














Rul IJEN x Zix— yl or equiva- 
ently, Rs = (0>, y): y € 2). 
© Relation Ry 
Domain: The set Z of integers. 
Codomain: The set N of natural numbers. 
Rule: Ry = (x.y) €N x Z: y — x^], or equiva- 


lently, Re = (s, 2) ix € N). 

(a) Whatis the meaning of the relation Ry © R7? What 
is the meaning of the relation R;* o R1? 

(b) Whatis the meaning of the relation R; 

is the meaning of the relation R;! © 

(©) What is the meaning of the relation Rs 
is the meaning of the relation Ry! © Ry? 

(@) Whatisthe meaning ofthe relation Ra o Ry"? What 
is the meaning of the relation Ry! c Ry? 


1% What 






R5"? What 











43 Properties of Functions and Set Cardinalty 283 


6 5 





(a) Relation A. (b) Relation A 
Figure 4-41 Diagrams for Problem 24. 


27. For the relation R in Exercise 17, what is the meaning 29. Investigate the precise relationship between the rela- 
oR cR? tion R © R^! and the identity relations (4 and ty for an 
28. Investigate the precise relationship between the rels- arbitrary relation R with domain A and codomain B. 
tions R © R^! and R-! © R for an arbitrary relation R 
with domain A and codomain B. 








4.3 Properties of Functions and Set Cardinality 





So far in this chapter, we have discussed the “invertibility” of functions on more than 
one occasion. In this section, we will see how this property of a function f : A — B 
tells us about the relative size of the domain and codomain, A and B. In order to t 
full advantage of this relationship, we will need to write formal proofs about functions. 
As usual, before we can write proofs, we must decide on the formal definitions that 
both the Reaper and AUTHOR will agree on. 





Invertibility and Other Properties 


We have seen informally that invertiblity is the ability to reverse the rule for a func- 
tion f:A — B and get a new function g : B — A. In this way, we can think of g 
as “undoing” the effects of f. In Section 4.1, we called g the inverse of f, but to 
emphasize the connection to the original function f, we will now use the notation 
7. just as we did with relations. 








Definition The function f : A + Bisinvertibleif there isa function f! : 
B — A such that f(x) = y if and only if f-* x. The notation f-! is read. 
as“ f inverse,” and the symmetry of the definition means that ( f~) = f. 

















Recall that Theorem 1 of Section 42 tells us that the above characterization of 
inverse functions is equivalent to saying that f (f(x) =x and f(f-"(y)) = y for 
all x € A and all y € B. This characterization is often easier to use in formal proofs, 
especially when we are thinking of inverse functions in terms of "opposite" actions. 














284 Chapter 4 / Functions and Relations 


7 X 


Function f Function 1-1 
Figure 4-42 Diagrams for Example 1. 


The following example will allow us to review the basic idea of inverse functions 
while using our new inverse notation. 


TA Example 1 


1. Recall that R^? = [x € R : x > 0) is the set of nonnegative real numbers. Verify that 
the function f : R2? — R> defined by f(x) = x? has as its inverse the function 

£o f RE — R defined by f° (x) = Jz. 
2. If f : a. bc] — (1,2, 3] isthe function with the rule f(a) — 2, f(b) = 3, f(c) =1, 
la, b. c] — (1,2, 3) is the function with the rule f^ (1) = c, f^ (2) 
b. Draw the arrow diagrams for f and f~, and discuss how they are 











SOLUTION 


1. We can verify algebraically that f(x) = y means x? = y, which since x and 
y are both nonnegative, implies that x = y, which means x = g(y). The 
reverse steps are similarly true, so f(x) = y if and only if x — g(»). That is, 
g is the inverse of f. 

‘Using the alternate characterization allows us to approach the issue alge- 
braically. Specifically, for every x € R°, 








s(f())=gG")= Ve =x (since x > 0) 


and for every y € R=", 
p» fis) = ft = (IY = y 
K 2. In this case, to obtain an arrow diagram for f~}, we simply reverse each of 
|] the arrows in the diagram of f. as shown in Figure 4-42. 
n 
The funcion g In addition to making it easy to visualize the inverse of a function, the arrow 
Figure 4-43 A diagram makes it easy to see why some functions are not invertible. We first saw this 


noninvertible function. at the end of Section 4.1 when we explained the two reasons why the function g in 
Example 19 (and repeated in Figure 4-43) is not invertible. 
‘The function in this example has two problems that keep it from being invertible. 
‘Mathematicians have defined terms that help them describe these situations, 


© The first problem with g is that nothing in the domain maps to 4. This causes the 
reversed arrow diagram (shown in Figure 4-44) to have no arrow coming from 4, 
which is against the rules for a function. We describe the original function as not 
being onto when this happens. 








43 Properties of Funcions and Set Cardnalty 285 


P Pi 
© O 


The function g Reversing the arrows 
Figure 4-44 The first problem with g. 





The function g Reversing the arrows 
Figure 4-45 The second problem with g. 


© The second problem with g is that two elements of the domain map to the samé 
element 3 in the codomain. This causes the reversed arrow diagram (shown i 
Figure 4-45) to have two arrows coming from 3, which is against the rules for 
a function. We describe the original function as not being one-to-one when thi 
happens. 





"lll Example 2 Which of he following functions are invertible? For any that ar 
not invertible, explain why itis not one-to-one or not onto. 


1. f =Z — Z given by the rule f(x) — 2x +3 for all x € Z 
2. g:Z — N defined by the rule 


2z ifz<0 
s= I ifz>0 
4. iN > N defined so that forall n € (n) is the sum of the digits in the (bus 
ten) numeral n. 


SOLUTION Only the second function is invertible. 


1. This function is not onto because, for example, there is no value n € Z such 
that 2n + 3 = 0. Hence, Dis not an output of the function f. You will prove 
this function is one-to-one in Exercise 11. 

2. This function is invertible. The partial arrow diagram in Figure 4-46 sug- 
gests that the positive integers are mapping to the odd natural numbers 
and the remaining integers are mapping to the even natural numbers. 
This observation is the basis for the proof, which is left for the student in 
Exercise 13. 








Chapter 4 / Functions and Relations 





Figure 4-46 An invertible function. 


3. This function is not one-to-one because, for example, A(13) = A(22). That 
is, there are two inputs (13 and 22) that have the same output (4). You will 
prove this function is onto in Exercise 12. o 


Like any good mathematical properties, the terms onto and one-to-one can be 
defined more formally, which we do below. The form of these definitions is impor- 
tant for the subsequent discussion of mathematical proofs about these properties of 
functions. 





Definition 


1. A function f is onto if everything in the codomain really is an output of f. 
‘That is, for every element y in the codomain, there must be (at least one) x 
in the domain where f(x) = y. 

2. A function f is one-to-one if nothing in the codomain is an output via two 
different inputs. That is, for every choice of different elements x; and x2 in 
the domain, f(xi) and f(x.) must be different, 

3. A function f is a one-to-one correspondence if it is both one-to-one and 
onto. This is equivalent to saying that f is invertible. 














‘These definitions can also be stated specifically in terms of arrow diagrams: 


© A function is onto if every point in the codomain has an arrow ending at that point. 
© A function is one-to-one if no point in the codomain has two or more arrows 
ending at that point. 


© A function is a one-to-one correspondence (or invertible) if every point in the 
codomain has exactly one arrow ending at that point, 


Practice Problem 1 Which of the following functions are invertible? For each 
noninvertible function, explain why it is not one-to-one or not onto. 


(a) c: Z — Z given by c(x) =x forall x € Z. 

(b) 5:N — N defined so that s(x) is the closest whole number to 4/5 for all x € N. 

(e) h:40,1,2,3,4] — {1,2,4, 6,8) given so that h(n) is the ones" digit of 2" for all 
ne (0,1,2,3,4) 

(d) :(0,1,2,3,4,5,6,7,8,9) — (0,1,2,3,4,5,6,7,8,9} given so that g(n) is the 
ones digit of 2" for all n € (0,1,2,3,4,5,6,7,8,9]. 











43 Properties of Functions and Set Cardinality 287 


Formal Proofs About Functions 


Having formal definitions for one-to-one and onto allows us to discuss formal proofs 
of these properties. Although these properties are fairly concrete in the context of 
arrow diagrams, they can quickly seem very abstract when we begin to write proofs. 
This is very similar to the difference between visualizing a property of sets with Venn 
diagrams and writing a formal proof of that property. In this book, we will primarily 
use these proofs as further practice of our mathematical writing skills. 


wt Example3 Prove that the function f : Ñ — N with the rule f(x) 
one-to-one. 


SOLUTION To show that f : N — N is one-to-one, we must show that “for 
every choice of different elements x; and x; in Z, f(x) and f(x2) must be 
different.” If we state this as an “if, then” statement, it reads, "For all xi, x2 € Z, 
if xı # xa, then f(t) # f(x:):" When we want to prove that a given function 
is one-to-one, we will always prove the contrapositive of this statement. 
Claim Define the function f: N — N with the rule f(x) = 5x +7. For all 
33 € N, if f1) = f), then x; = x». 

Proof Letv, x € Ñ be given such that f(x) = f(42). By the definition of 
f. this means that 





w+ Tis 








5n +7 = 5x47 


By simple algebra (subtract 7 from both sides, then divide by 5 on both sides), 


we conclude that x, = x2, completing the proof. 
o 


Once we made the decision to prove this property using the contrapositive of it 
formal definition, it became so straightforward to prove a function is one-to-one, it 
almost doesn't seem like we are doing anything, To convince you that this in fact, is 
a proof, try the following problem. 


"ll Exacopíe 4: inal theyntsake inthe Foltowing proof have finetion gi R > 
R with g(x) =x? is one-to-one. 


PROOF Let x1, x2 € R be given such that g(xi) = g(x). By the definition of g, 
this means that 


x- 
i 





By simple algebra (take the square root of both sides), we conclude that x 
completing the proof 





SOLUTION We know the proof must be wrong because the given function is 
not one-to-one, as the partial arrow diagram in Figure 4-47 shows. The easy way 
to find the mistake in the proof is to play the role of the Reaper, and when you 
are asked to give elements x; and xz with f(x) = f(x), you pick two domain 
values whose arrows point to the same codomain value. 





© The Reaper chooses x; 
the proof. 





=1 and xz = 1 as a response to the first line of 








288 Chapter 4 / Functions and Relations 


Explore more on 
the Web, 





Figure 4-47 Diagram for Example 4. 


© The Auror asserts that x? = x3, which we must agree with because of the. 
way the xy and x were chosen. 

© The Aurion then claims that taking the square root of both sides of the 
equation will make it say, “xı = x2," which is not true for our choice of xy 
and xp, Since this is the first false statement that the proof claims is true, we 
have found the mistake. 


‘The problemis that knowing x? = x3 does not allow you to conclude that x 
since you do not know whether the chosen values are positive or negative. C 








Practice Problem 2 Prove that the function g: = N — Q with the rule g(n) = Ay 
is one-to-one, 

‘The advantage of formal proofs is that they allow you to prove general proper- 
ties rather than just properties of specific objects. We saw this in Section 2.2 when 
we proved statements like “If is odd, then 3n is odd,” only later to prove the 
more general rule "The product of two odd integers is odd." It is typical that the 
more general the property, the more useful it is, but the more abstract the proof. 
Let's sce an example of moving from specific to general with the same proof struc: 
ture. 

In the following proposition, the functions are specifically f: N — N with 
f(x) = Sx +7 from Example 3 and g : N — Q with g(n) = 72; from Practice Prob- 
lem 2. 








Proposition 1 Since we know that f and g are one-to-one, then we know that 
the function h: N — Q with the rule h(x) = g( f(x) is also one-to-one. (Recall that 
the function h defined this way is the composition of g with f, which we usually write 
ash=go f and say, “his g of f.") 





PROOF We prove that is one-to-one using the contrapositive of the formal 
definition of one-to-one, Let x1, x2 € N be given such that A(x) = h(x2). By the 
definition of h, this means that 


gif) —g(fQ2) or 
5 5 
f+? f+? 


However, when we showed that g is one-to-one in Practice Problem 2, we 
showed that the only way for 7555 = 7155; 10 be true is if f(m) = fxr) is 


true. That is, 5x) +7 = 5x2 + 7. Similarly, when we showed that f is one-to-one 














43 Properties of Functions and Set Cardinaiity 289 





in Example 3, we showed that the only way for 5x; +7 = 5x2 +7 to be true i; 
xy = x, is true, Since we can conclude that x; = xz, the proof is complete. — W. 





‘The form of this proof can be generalized to give usa much more useful, universal 
rule about the composition of functions. 


Proposition 2 If f: A — Bis one-to-one and g : B — C is one-to-one, then 
the new function (go f): A — C is also one-to-one, 





PROOF Tosimplify things we will use the function name A for (g o f). Thai 
h: A — C is defined by the rule A(x) = g( f(x)). We prove that } is one-to-one 
using the contrapositive of the formal definition of one-to-one. 

Let x1, x2 € A be given such that A(x) = A(x). By the definition of A this 
means that 


a( f(x) = a(f02)) 


However, since g is one-to-one, the only way for g( f (xj) = gf) to be true is 
it (n) = f(x) is true, Similarly, since f is one-to-one, the only way for f(x) = 
co) to be true is if x; = xo is true, Since we can conclude that y = x2, the proof 
is complete. " 








Regardless of whether a function is given in terms of an algebraic formula, an 
arrow diagram or an abstract description, we always use the same form of argument 
when we want to prove that the function is one-to-one, Remember that this form is 
based on the contrapositive of the definition of one-to-one, 


Clim, f: A — Bis one-to-one. 
Proof Leta; € A and az € A be given such that /(aj) = f(a). 


| Use known information about to conclude that a 





For the property of being onto, it is much easier to connect the formal definition 
with its informal meaning, Testing whether a function is onto can even be viewed as 
a game played by our old friends, Aurnor and READER, 


a Example 5 Given the function f : N — Q* with the rule f(n) = È$}, play 
several turns of the following game: 


© Reaver chooses an element r from the codomain (Q^ ). 

© Aurion responds with an element n from the domain (N). 

© If fin) =r, then really is an output from f, so Amon gets a point. 
© Otherwise, READER gets a point. 





SOLUTION We show the moves in Table 4-11. It appears that Reaper has 
found a move (choosing 4/3) for which Aunor has no response, Hence, it 
looks like READER will be able to score as many points as she likes, and so she 
will always win this game. n 





290 Chapter 4 / Functions and Relations 
























READER Chooses (r) 
Yes (point for A) 
sn No (point for R) 
sn Yes (point for A) 
aa No (point for R) 
D No (point for R) 
48 No (point for R) 











Table 4-11 Solution for Example 5 


In terms of an arrow diagram, the value 4/3 chosen by Reaper seems to be 
an element of the codomain to which no arrow points. This would imply that this 
function is nor onto, but how do we rigorously prove this? 

Since the rule for function f is given by an algebraic equation, it makes sense 
that the explanation should involve algebra. Consider Auror’s task as an algebra 
problem: “Find a natural number n for which £21 = 4/3." The tools for solving this 
{equation come from a high school algebra course: 


‘The given equation =! = $ can be multiplied by 3(n + 1) on both sides to yield 
the equation 3 + 1) = 4(n + 1), which can be simplified to 3? — 4n — 1 = 0. 
Applying the quadratic formula to this equation gives us the solutions n = 
P 











ù 
Since these values for n are not in N, we conclude that it is not possible to solve 
this equation with a natural number for n. So in this case, even to show the function 
is not onto requires a formal justification. 
On the other hand, the game itself suggests the following proof structure for 
arguing that a function is onto. We present this proof structure first as an informal 
letter from Aumion to READER. 


Proposition 3 The function f : R* — (1, 00) with the rule f(x) = 58 is onto. 
(Recall that the interval notation (1, oc) denotes the set |z € R : 1 <z).) 


PROOF 
Dear READER, 


Suppose you choose an element r from the codomain (1, oc). I will show you 
an element z of the domain R* such that f(z) =r- 





L Letz= zh. 
2. You can check 








* This notation was introduced in Exercise 21 of Section 3.1. 











43 Properties of Functions and Set Cardinalty 291 


3. Since r € (1, oc), we know that r > 1, from which it follows that r — 1 > 0, and 
so +; € R*. Hence, the z we have chosen is in the domain regardless of which 
codomain value for r you use. 





Hence, my z € R* will make the equation f(z) = r true, so your choice of r 
really is an ouput of f. 


Sincerely yours, 
Autor m" 


Before you try your hand at this sort of proof, let's look at one more sample. 
proof in which the algebra is less messy but the language is much more formal. 


Proposition 4 The function g: Q -» Q with the rule g(x) = 5x — 1 is onto. 


PROOF Let y € Q (the codomain of g) be given. Take x to be 22*. Since y € Q, 
it follows that x € Q, the domain of g. Now we can see that 





=0+1)-1=y 


Hence, any chosen y in the codomain is really an output of gso gisonto. — W 


Notice that much of the “algebra fiddling” is left out of this proof since it 
as logically sound (but much shorter) to demonstrate that a solution works as it is to 
algebraically derive the solution. 





iple mien Practice Problem 3 Prove that the function f : Q — Q with the rule f(x. 
the Web, 3x +7 is onto. Prove that the function g : Q — Q with the rule g(x) = x? + Sx — 10 
is not onto. 





Just as with the property of one-to-one, the real test for this proof technique is 
how it holds up when working with abstract functions. The proof of the following 
proposition is addressed in Exercises 8 and 9 at the end of the section. 


Proposition $ If f:A — Bandg:B — C are both onto, then the new function 
(go f):A — C is also onto. 


Just as we are able to summarize our approach for proving functions are one- 
10-0ne, we can also describe the form of argument when we want to prove that a 
function is onto. 





o9 SS EU —cCEEUN 


292 Chapter 4 / Functions and Relations 





| Chim. fA + B is onto. 
| Proof Letb e B be given. 


Use known information about f to produce a domain element a € A for 
| which f(a) = b. | 





We end our discussion of formal proofs with a reminder that a function that is 
both one-to-one and onto is invertible, Therefore, to formally prove that an abstract 
function is invertible, we can prove that the function is one-to-one and onto in two 
separate arguments. We currently have all the components on hand to establish 
perhaps the most important abstract property of invertible functions, 


Theorem 6 If f :A — B is invertible and g : B — C is invertible, then the new 
function (g o f): A — C is invertible. 





PROOF Let invertible functions f and g as described be given. Since the func- 
tions are invertible, then it follows they are each one-to-one, and this implies, 
according to Proposition 2, that the composition (g o f) is one-to-one. 

On the other hand, since the functions are invertible, then it follows they 
are each onto, and this implies, according to Proposition 5, that the composition 
(g © f) is onto. 

Since (g 





) is one-to-one and onto, then this means that (g © f)is invertible, 


Set Cardinality and the Pigeonhole Principle 


For finite sets, there is an interesting relationship among the concepts of one-to-one, 
onto, one-to-one correspondence,” and the sizes of the domain and codomain. We 
will state the properties as a theorem, and give a very informal explan: 
of arrow diagrams. 





Theorem 7 Let f:A — B bea function, where A and B are finite sets of sizes 
mand n, respectively. 


1. If f is one-to-one, then m <n. 
2. If f is onto, thenm s n. 
3. If f is a one-to-one correspondence, then m 





PROOF Envision an arrow diagram for the function f. Since A contains m 
elements, there are exactly m arrows, one starting at each element of A. 


1, If f is one-to-one, then the m arrows all lead to different elements of B. So B 
has at least these m elements, Hence, n > m. 





“A one-to-one correspondence isthe term for invertible function that we will use in Chapter 5. 








43 Properties of Functions and Set Cardinalty 293 


2. With only m arrows, they can end at no more than m different points, In an 
onto function, every element of B is an ending point of an arrow, so there are 
no more than m elements in B. Hence, n < m. 

3. This follows from parts (1) and (2), but we can also get a nice visualization. 
For a one-to-one correspondence, every element of the codomain is pointed 
to by one and only one of the m arrows from the domain elements. Hence, the 
number n of elements in the codomain is exactly m. 











We will make heavy use of the third part of this theorem in Chapter 5. The 
result contends that if we can find a one-to-one correspondence between two sets, 
then we will know the two sets are the same size, We will use this fact to show that 
different counting problems have the same answer without being able to answer 
either problem! However, even the first two parts of the Theorem are useful for 
conclusions about the relative sizes of sets. 


un Example 6 Ler A be the set of all three-element subsets of (1,2, 3, 4, 5], let B 
be the set of all two-element subsets of (1,2, 3, 4}, and let f : A — B be the function 
so that for all X € A, f(X) is the set obtained by removing the largest number from 
the set X. For example, f((1,3,4}) = (1.3). Prove that f is onto and cite the part of 
Theorem 7 that allows us to conclude that n(A) = n(B). 
SOLUTION Proof that f is onto. Let Y € B be given. That is, Y is a two- 
element subset of (1,2, 3,4). Let X = Y U [5]. Since 5 g Y (after all, Y is a 
subset of (1, 2, 3, 4]), we know that X is a three-element subset of (1, 2, 3,4, 5] 
whose largest element is 5. Hence, f(X) = Y, so Y is an output of the function 


f 





By the second part of Theorem 7, we can conclude that n(A) > n(B). 
o 


Practice Problem 4 Ler A be the set of all three-element subsets of (1, 2, 3,4, 5], 
let B be the set of all two- or three-element subsets of (1,2,3,4}, and let f :A — B 
have the rule f(S) =S — (51 for each S € A. Prove that f is onto, and then write the 
conclusion to the second part of Theorem 7 as an English sentence about subsets. 





The first part of Theorem 7 is closely related to the basic pigeonhole principle 
from Section 2.5. In fact, we can more formally prove the contrapositive of this part 
of Theorem 7 using that result directly. 








Theorem 8 (The Pigeonhole Principle Revisited) Let f: A + B be 
a function, where A and B are finite sets of sizes m and n, respectively. If m > n, then 
f is not one-to-one, 


PROOF Let f:A — B be a function, where A and B are finite sets of sizes 
m and n, where m > n. Define n boxes labeled by the elements of B. For cach 
elementi € B, we put an object a from set A into box i if f(a) = i. According to 
this rule, we have n boxes into which we are placing m > n objects. By the basic 
pigeonhole principle, there must be a box z with at least two objects. This means 








294 Chapter 4 / Functions and Relations 


that there are two elements x and y in A for which f(x) = zand f(y) = z. This, 
in turn, means that f is not one-to-one. a 


p: Example 7 For each of the following statements, specify sets A and B and a 
function f so that Theorem 8 applies and the fact that f is not one-to-one is equivalent 
to the conclusion of the statement, 


1. In a particular card game, a "hand" consists of five cards from a standard deck. 
Prove that in every hand there is a suit that appears more than once. 


2, Prove that the last four digits of my Social Security number (SSN) contain two 
digits whose difference is divisible by 3. 


SOLUTION 


1. Let A be the set of all cards in my hand, let B be the set (clubs, hearts, spades, 
diamonds), and let f : A — B be the function where f(x) is the suit of card 
x, Since n(A) = 5 and n(B) = 4, we know from Theorem 8 that f is not 
‘one-to-one, and hence there is some element (suit) of the codomain that is 
mapped to by two elements (cards) in the domain (my hand). 

2. Let A be the set of digits in the last four digits of my SSN, let B be the 
set (0, 1,2}, and let f : A — B be the function where f(z) is the remainder 
obtained when zis divided by 3. (This definition of f guarantees that z — f(z) 
is divisible by 3 no matter what z is, a fact proven in Section 2.2.) 

Now you do not necessarily know the size of the set A because you do not 
know how many different digits are in the last four digits of my SSN, so we 
must proceed in two cases 
© Case 1: Suppose some digit appears twice in the last four digits of my 

SSN. Then these two digits differ by 0, which is a multiple of 3, and we 

are done. 
© Case 2: Suppose the four digits at the end of my SSN are all different. In 

this case, since n(A) = 4 and n(B) = 3, we know from Theorem 8 that f 
is not one-to-one, and hence there is some element r of the codomain 
that is mapped to by two elements w and x in the domain. That is, 

f(x) — f(w) = 0. Because f(w) — w and f(x) — x are each divisible by 

3, we can choose integers K and Z so that w — f(w) =3K and 

x- f(x) =3L, 














wx (wx) + (f) fw) 
w- Fw) == f() 
=3K -3L =3(K - L) 


Hence, w — x is divisible by 3. 
a 


Practice Problem 5 — Prove that for every set of five positive integers (a. b, c, d. e), 
at least two numbers in the set (3", 3°, 3°, 34, 3°) have the same ones’ digit 











pmr C 


43 Properties of Functions and Set Cardinality 295 


Infinite Sets 


So far our discussion about the relative sizes of sets has been directed toward finite 
sets, but the same ideas can be applied to infinite sets. This raises issues about the 
foundations of mathematics that, while very interesting, are not really central to this 
course as a whole, We end this section by addressing some of these issues in order t 
illustrate some surprising ideas about infinity. 








Definition | Sets A and B have the same cardinality if there exists an invert- 
ible function f :A > B. 








Note that we use the word “cardinality” instead of "size" as we start down the 
path to formal thinking about infinity. We informally think of a finire set being one 
whose cardinality is a natural number, but this leads to a negative definition of infinity 
(ie. a set is infinite if it is not finite), and we have seen on more than one occasion 
that negative definitions are difficult to work with in proofs, Now that we understand 
the concepts of one-to-one and onto functions, another definition presents itself thai 
is actually much easier to work with. 











Definition A set A is infinite if there exists a function f : A — A that is 
‘one-to-one but not onto, 











a Example 8 Show that N is infinite, using the above definition. 


SOLUTION Let f:N — N be defined by the rule f(x) = 2x. It is straight- 
forward to prove that f is one-to-one, so we leave that detail to the reader, 
Since the outputs of f are always even integers, there are many examples (like 
1 € N) of codomain elements that are not outputs of f. Hence, f is not onto. 
By our new definition of infinite, this proves that N is infinite, D 









Practice Problem 6 Show directly that Z and N have the same cardinality. 


‘This practice problem illustrates one shortcoming of our formal definition of 
same cardinality. Constructing invertible functions between sets can be more chal- 
lenging than finding more natural one-to-one functions. For this reason, the famous 
German mathematician Georg Cantor (1845-1918) proposed" the following result, 
published in 1895, that makes it much easier to work formally with infinite sets 1 
result is typically included in a first course in formal set theory, and even though the 
idea is fairly intuitive, the proof is well beyond the scope of this book. 














Theorem 9 (Cantor-Bernstein Theorem) Given sets A and B, if there 
is a one-to-one function f : A — B and a one-to-one function g : B — A, then sets 
A und B have the same cardinality. 


"Cantor did not actually prove this, Felix Bernstein (1878-1956) gave the first correct proof. 








296 Chapter 4 / Funcions and Relations 


a Example 9 Use the Cantor-Bernstein theorem to show that Q™° has the same 
size as the set N. 


SOLUTION  SinceN € Q*°, weimmediately know that the function f : N — 
QP? defined by f(x) = x is one-to-one. On the other hand, the function g : 
Q*? — N can be defined as follows: Given r € Q*?, there is a unique choice of 
relatively prime natural numbers a and b such that = 2, so let g(r) = 2" x 3^. 
Because the prime factorization of numbers is unique, the function g is also 
one-to-one, o 


All this energy to show that two infinite sets have the same cardinality might 
scem wasted if you have never thought hard about infinity before, After all, aren't 
all infinite sets the same size? If you have never given it any thought, you are 
in good company. Mathematics history suggests that no one had carefully cons 
ered questions involving “infinity” until Georg Cantor in the twilight of the ninc- 
teenth century. We have already seen one of his important ideas, but his most fa- 
mous theorem might leave you scratching your head. The basic idea in the proof 
of this theorem has led to some of the most famous important results about 
the foundations of mathematics, including Kurt Gödel's famous incompleteness 
theorems. 








Theorem 10 (Cantor's Theorem) For every set A, A and P(A) do not 


have the same size. (That is, no set is the same size as its power set.) 


PROOF Letaset A anda function f : A — P(A) be given. We will show that f 
is not invertible by showing that it is not onto, That is, we will produce a member 
B of the codomain P(.A) that is not equal to f(x) for any x € A, We can define 
the set B directly as follows: 


B-ixeA:xg f(x) 


Since B C A, this means that B € P(A), so B is a member of the codomain 
of f. To see that f(x) & B for all x € A, we imagine what would happen 
if there is a be A with f(b) = B. A contradiction then arises when consid- 
ering the question “Is b€ B?” To see this, we just consider both possible 
answers: 


© Case 1: If € B, then by the definition of B, this means that b ¢ f(b). But f(b) 
is B, so this says that b ¢ B, a contradiction. 


© Case 2: If b g B, then this means b ¢ f(b), so according to the way the set B is 
defined, b is one of the elements in B, a contradiction. 


Thus, we have the strange contradiction that b € B if and only if b ¢ B. Because 
of this contradiction, we conclude that there is no value of x for which f(x) = B. 
so f is not invertible. " 


Vll tonsple E E pescfof Camo halo wii ca single ex. 
ample. 





MEL tM UENEROT UE T 


4.3 Properties of Functions and Set Cardinaty 297 
SOLUTION Suppose A = {1,2,3, 4) and f : A — P(A) is given below: 


f() = 11,4) 
=n, 
) 
(3) 
Note that 1 € f(1),2 € f(2),but3 ¢ f(3) and 4 g f(4),so the set B defined in 


the proof is B = (3, 4). Clearly in our example, there is no value of x for which 
f(x) =B. a 





We saw in Exercise 24 of Section 3.1 that a finite set with n elements has 2" 
subsets, so the fact that n # 2" is hardly a news flash. The significant thing about 
Cantor's proof is that it applies to all sets, even infinite ones. 


Corollary 11 Nand P(N) do not have the same size. 
‘The conclusion from this is the part that might make you scratch your head: 


Cantor showed that there are different sizes of infinity! With this in mind, the set N 
takes a special place representing the smallest possible infinity. 





Definition An infinite set A is countable if it has the same cardinality as 
N. An infinite set is uncountable if it is not countable, 








With this terminology, our work in this section can be rephrased as follows: 


© Nis countable (from the definition of countable). 
© Z is countable (from Practice Problem 6). 

© Q is countable (from Example 9). 

© P(N) is uncountable (from Corollary 11). 


Solutions to Practice Problems 

1 Only the third function is invertible. Since the rule for 0 or 7 (or any odd number for that matter), and it 
the fourth function is the same as the third, obviously is not one-to-one because there are multiple arrows 
the question of invertibility has to do with domain and pointing to numbers like 2 and 4, 
‘codomain as well as with the rule. 1 (We prove the contrapositive of the definition of one- 


(a) Thereisnoa € Zforwhicha! =2,s02isanelement — to-one) Let m € N and m € N with (m) = f(m) be 
‘of the codomain Z to which no domain elementis given. This means that 
mapped. (This function is not onto.) 
4b) s(3) = s(5) = 2, so more than one domain element e. 
maps to the same codomain element. (This function mea ADR 
is not one-to-one.) Multiplying by (m +2) (n; +2) on both sides of this 
4c) This is invertible, ‘equation gives us 5- ( 5- (m +2), which can. 
ies by S and 








+2) 
(W) gisnotinvertible sinceit hasboth kinds of problems: be algebraically simplified (divide both 








itis not onto because there is no arrow pointing to then subtract 2 from both sides) to n 





n. 








298 Chapter 4 / Functions and Relations 


3 We first argue that the function g :Q — Q with the 
rule g(x) = x° 5x — 10 is not onto, We will show that 
there is no input € Qsuch that g(r) = ~20, that is, with 
r? + Sr + 10 = 0. The quadratic formula tells us that in 
order to obtain r? + Sr + 10 = 0, we must have 


248-310 
2 











r 





and since $? — 4-10 = —15 < O, this is impossible for a 
rational number r. Hence, no rational number input can 
make g(x) = —10, which means that ~10 € Q is not an 
output of g. 

Clim The function f :Q — Q with the rule f(x) = 
3x +7 is onto 

Proof Let r € Q be given. Create the number q = ^. 
itis easy to see that q iv in Q (since we used only sub- 
traction and division on the rational number r), and we 
can check: 





Ha) es (52) 
=(r-7)+7 
=r 


So we have produced an input q that will map to the 
given rational number r. Hence, every rational number 
is an output of f. . 

4 Proof that f is onto. Let Y € B be given. That is, Y is 
a two-clement subset of [1.2.3.4] or a three-element 
subset of (1,2, 3, 4], Since we do not know which, we 
proceed to argue in two cases: 


Exercises for Section 4.3 


1, Each of the following functions is not onto. To demon- 
strate this, provide an example of an element in the 
codomain and explain why no element of the domain is 
associated with it. 

(a) f: R — R with the rule f(x) = 
(b) g:Q* > Q* with the rule g(x) = =. 
(c) A: [1,00) — [1. oc) with the rule h(x) = hy. 

2. Each of the following functions is not one-to-one. To 
demonstrate this, provide an example of two elements 
of the domain that are associated with the same element. 
of the codomain. 

(a) f:R — R with the rule fü) = x4 Ar. 
(b) g:Q* — Q* with the rule g(x 
(oh: 














— Q with the rule A(x) = Bx 1. 
3. For each of the following functions, decide if it is one- 
to-one, onto, invertible, or none of these: 


(a) f:Q — Qwith the rule f(x) = 3x +1- 





© Suppose Y is a two-element subset of (1, 2,3, 4}. Let 
X = Y U {5}, and X will bea three-element subset of 
11.2,3,4,5] for which f(X) = X — [5] = Y. Hence, 
(X) = Y in this case. 
© Suppose Y is a three-clement subset of [1, 2,3,4). 
Let Z = Y, and Z will be a three-element subset of 
{1,2,3.4,5} for which f(Z) i$] = Z. Hence, 
J(Z) = Y in this case. 
In cither case, Y is an output of the function /, so we 
conclude that / is onto, LI 
By the second part of Theorem 7, we can conclude 
that n(A) > n(B). That is, there are at least as many 
three-element subsets of (1,2, 3,4, 5) as there are two- 
element subsets and three-element subsets of (1,2, 3,4] 
put together, 

S Let A be the set (a,b. c, d, e) alluded to in the problem. 
With a little investigation, we can determine that the 
‘only ones digits that can occur in a number of the form 
3^ are digits from the set (3.9.7, 1] o let this be the set 
B, and let f : A — B be the function where f(x) is the. 
ones digit of 3*. Since n(A) = $ and n(B) 4, we know 
from Theorem 8 that f is not one-to-one, and hence 
there is some element (ones digit) of the codomain that 
is mapped to by two elements (numbers) in the domain 
(my set of positive integers). 

6 We will use a rule that associates positive inputs with 
even natural numbers and associates negative inputs 
with odd natural rores 


fe 








ifrzü 
-2r ifr<0 


(b) g:Q* — Q' with the rule g(x) = 7x +5. 
(e) f:R* — RY with the rule f(x) = hy. 
4) g: (1,20) — (1,00) with the rule f(x 
4. For each of the following functions, decide 
to-one, onto, invertible, or none of these: 
(a) NN — N with the rule f(a. b) 








de JEN xN + N with the rule f(a. b 
(a. b.c], and define a function c: P(S) — 
CPUS) yel) =S—A. 
(a) Is the function c one-to-one? 
b) Is the function c onto? 
(e) Ifthe function c is invertible, then describe the in- 
verse of c. If c is not invertible, explain why not. 
6. Let S = (a, b, c) and let c be the function of Exercise S. 
Letn: P(S) — (0.1,2.3] be defined so that n(X) isthe 




















i fof: opm mec 
O| o| e | en | «a 




















Table 4-12 Table for Exercise 6 


‘number of elements in X, and ets: (0.1.2. 

be defined by Table 4-12. 

(a) Which composition is defined, c os or s «c? For 
the one that is defined, describe the domain and 
the codomain, and give an arrow diagram for the 
function. 

(b) Give an arrow diagram for s on. Is s «n oneto- 
one? ls s « n onto? 

(©) Give an arrow diagram for n «s. Is m 
one? Is n os onto? 

Fill in the blanks in the following proof: 

F Proposition if f:R — R is one-to-one, then the 

function A: R — R with the rule A(x) = /(2 x) isalso 
‘one-to-one, 
Proof We prove that h is one-to-one using the con- 
trapositive of the formal definition of one-to-one. Let 
xj.) € R be given such that. » By the defi- 
nition of A, this means that. 


[Q-x) s fQ-m) 


However, since „ the only way for /(2 
41) = fQ-x)tbewueiif — i$ true, Di- 
Viding by 2 on both sides of this equation, we conclude 
tha . completing the proof. 

8. Fill in the blanks in the following proof: 

F Proposition If f :R — R is onto, then the function 
h: R — R with the rule h(x) = f(2 +) is also onto. 
Proof Let y € R be given. Since . there 
must be an x € R with f(x)=y. If we take z 

ER, it follows that 
he JRD f 
From this it follows that A completing the 
proot. . 
9. Generalize the previous exercise to prove that if f 

A — Band g : B — C are both onto, then the compo- 

sition (g « f):A — C isonto. 

10, For each of the following statements, specify sets A and 
B and a function f so that Theorem 8 applies and the 
fact that f is not one-to-one is equivalent to the conclu- 
sion of the statement. 

(a) In a particular card game, an entire standard 52- 
card deck is divided as evenly as possible among all 
players at the start of the game. Prove that if there 
are three players playing, everyone must have at 


) > P(S) 





s one-to- 

















43 Properties of Functions and Set Cardinality 299. 


least two cards of the same value at the start of the 

game, 

(b) The Boatsville Youth Basketball League consists of 
10 teams, each playing once a week for a total of 10 
‘weeks. Conclude that every team must play some 
team twice. 

(©) Prove that for every set of five positive inte- 
gers |a, b c, d. e], at least two numbers in the set 
(7,79, 7,74, T") have the same ones’ digit, 

D. Prove that in any set of five integers, there must be 
at least two whose difference is divisible by 4. 

(©) Prove that in anyone's seven-digit telephone num- 
ber, there must be two digits whose difference is 
divisible by 6. 

40 Each person living in New York City has between 
0 and 500,000 hairs on his or her head. According 
to the 2000 census, there are about 8,010,000 peo- 
ple living in the five boroughs of New York City. 
‘Conclude that there are at least two people of the 
same sex living in the same borough of New York 
City with exactly the same number of hairs on their 
heads. 

11. Prove that the function f: Z — Z with f(x) =2x +3 
from Example 2 is one-to-one. 

12. Prove that the following function from Example 2 is 
onto: A: N = N, where A(n) is the sum of the digits in 
the (base ten) numeral n. 

13, Prove that the function g : Z — N with the rule 


o 


z>0 








sia [37% 





from Example 2 is one-to-one and onto. 
14, Let f :[0, 00) — [4,00) be the function with the rule 
F nose 
(a) Prove that f is one-to-one, 
b) Prove that f is onto, and conclude that / is invert- 
ible. 
(©) Demonstrate that f is invertible directly by finding. 
an algebraic formula to describe the rule for f. 
15. Let g:R — R be the function with the rule g(x) = 
Fs 
(a) Prove that g is one-to-one, 
(b) Prove that g is onto, and conclude that g is invert- 
ible. 
(©) Demonstrate that g is invertible directly by finding. 
an algebraic formula to describe the rule for g“ 
16. Let f:R — R be the function with the rule f(x 
f P-2 
(a) Prove that f is one-to-one. 
(b) Prove that f is onto, and conclude that f is invert- 
ible. 

















300 chapter 4 / Functions and Relations 


(c) Demonstrate that f is invertible directly by finding 
an algebraic formula to describe the rule for f-'. 

17. Let h: [2,00) — (0. 1] be the function with the rule 

F osh 
(a) Prove that A is one-to-one. 

(b) Prove that / is onto, and conclude that A is invert- 
‘ible 
e) Demonstrate that his invertible directly by finding 
an algebraic formula to describe the rule for 4. 
IR. Let f : (1, oc) — (1, x) be the function with the rule 
F i=. 
(a) Prove that / is one-to-one, 
(b) Prove that / is onto, and conclude that f is invert- 
ible. 
(c) Demonstrate that / is invertible directly by finding 
an algebraic formula to describe the rule for f 

19. Let A = (0, 1,2] x (0,1, 2,3) and B = {n : n is a posi- 
tive factor of 500], and let f: A — B be the function 
with the rule f(a, b) = 2^ x 5. 

(a) Prove that f is one-to-one, 

(b) Prove that f is onto. 

(e) What theorem in this chapter guarantees that A and 
B have the same cardinality? 

(d) Which set (A or B) is it easier to determine the car- 
 dinality of? What is the cardinality? 

20. Let A = (0,1,2,3,4] x (0.1, 2) and B = (n: n is a pos- 
itive factor of 144], and let f : A — B be the function 
with the rule f(a, b) = 2^ x 3. 
(a) Prove that f is one-to-one. 

(b) Prove that f is onto. 

(c) What theorem in this chapter guarantees that A and 
B have the same cardinality? 

(d) Which set (A or B) is it easier to determine the car- 
dinality of? What is the cardinality? 

For Exercises 21 to 23, we will call a set E of charac- 
ters an alphabet and a string using only characters from. 
this set a word over the alphabet ©. For example, if the 
alphabet is the set (a, 1, z}, then some possible words 
are lala, 120, and zal. 

21. Let A be the set of words over the alphabet (a, b, c] with 
length Sor less, let B be the set of words over the alpha- 
bet (a, b] with length 4 or less, and let f : A — B bethe 
function so that for every word W, f(W) is the word W 
with every occurrence of the character c deleted. Prove 
f is one-to-one or demonstrate it is not. Prove f is onto 
or demonstrate that it is not. Which set is bigger, A or 
m 

22. With this notation, a binary string (from Section 4.1) 
is simply a word over the alphabet (0, 1]. Let A be the 
set of all binary strings of length 6 consisting of exactly 














two O's and four 1’s let B be the set of all binary strings 
of length 5 consisting of two 0's and three T's and let 
f :A — B be the function so that for all W e A, f(W) 
is the result of changing the leftmost 1 in W intoa and 
then removing the leftmost character altogether. For 
‘example, f(101101) = 01101 and f(011110) = 01110, 
Prove f is one-to-one or demonstrate it is not. Prove f 
is onto or demonstrate that itis not. Which set is bigger, 
Aor B? 

23, We will call a word over a given alphabet a permutation 
if the word does not use the same character twice. For 
example, abc and bead are permutations over the al- 
phabet (a. b, c, d). Let A be the set of all permutations 
of length 4 over the alphabet E = (a, b c d. x. y. zh 
let B be the set of all subsets of £ of size 4, and let 
f :A — B be the function so that for all p € A, (p) 
is the set of characters that occur in p. For example, 
f(bcxa) = (a. b, c, x). Prove f is one-to-one or demon- 
strate it is not, Prove f is onto or demonstrate that it is 
not. Which set is bigger, A or B? 

24. Consider the function f : P((1,2, 3)) — P(1,2]) with 
the rule f(A) = A — (3). 

(a) Prove that f is onto, 

(b) Demonstrate that f is not one-to-one. 

(€) Demonstrate that every B in the codomain of f is 
associated with exactly two elements in the domain 
of f. 

(d) Explain in your own words how the above facts im- 
ply that the domain of f is twice the size of the 
codomain of f. 

28, Generalize the argument in the previous exercise to 
show that P((1,2,3,...,n— 1,n]) is twice the cardi- 

ity of PU(L,2.3.-...0— 1). 

26. Use Exercise 25 as part of a proof by induction that the 
cardinality of P((1,2,3,...,m]) is2^ forall n> 1. 

27. Prove Part (2) of Theorem 7, 

2%, Find a one-to-one function from R to R°. What result 
in this chapter allows you to conclude that these two. 
sets have the same size? 

29. Suppose A = la, b,c, d] and f A.» P(A)isgivenbe- 
low: 












f(a) 

F(b) = (b,c. d) 
S(O) = la. d] 
fd) = (b.d) 
Fle) = a. b.c, d) 





What is the set B constructed by the proof of Cantor's 
theorem for which f(x) = B is not true for any x € A? 








44 Properties of Relations 301 


30. Suppose A = la. b, c, d.e) and f :A — P(A)isgiven 31. Suppose A = (a.b, c. d, e] and f : A — P(A) is given 





below: below: 

fa) = (b.d.e} sla) 

JO) = ta,c.d] 

Sle) = ta.d.el 

Fid) = {be} 

fou 
‘What is the set B constructed by the proof of Cantors What is the set B constructed by the proof of Cantor's 
theorem for which f(x) = Bis not rue foranyx eA? theorem for which f(x) = B is not true for any x € A? 


4A Properties of Relations 





‘One moral of the previous section is the fact that when studying abstract mathe- 
matical objects like functions, understanding special properties of the objects (like 
‘one-to-one or onto) allows us to apply them to other mathematical problems, just as 
we did to set cardinality problems at the end of that section. This is a major reason 
that we study mathematical structures abstractly, as opposed to focusing only on 
specific functions on familiar domains, for example. We will continue this theme with 
the more general mathematical structure of binary relations in this section, 

The properties of binary relations that will be studied are geared toward two 
classes of examples of binary relations. We will broadly refer to these as "order rela- 
tions” and “equivalence relations” The former class should consist of relations that 
compare objects through phrases like "a is no bigger than b,” where the word “big- 
ger” can be interpreted in many ways. Similarly, the latter class consists of relations 
that equate objects through phrases like “a is the same as 6,” where the word "same" 
can have many interpretations. 

Of course, not every relation belongs to one of these two classes, but the rel- 
evance of these classes to other mathematical topics makes them worthy of our 
attention. In this section, we will address order relations and discuss proofs of the 
properties relevant to them. We will discuss equivalence relations in the subsequent 
section. 








Order Relations 


We already know several examples of order relations from this course and before. 
We will start with these to discover the common properties that are always present 
in notions of “order.” 


a Example 1 Draw arrow diagrams for each of the following relations: 


1. The relation Ry on the set {1,2,3,4} with the rule “(x, y) e Ry if x < y." 
2. The relation Rz on the set P((1,2, 3}) with the rule “(x, y) € Rz if x € y." 
3. The relation Rs on the set (1,2, 3, 6] with the rule "(x, y) € Rs if x divides y." 





SOLUTION The complete arrow diagrams are shown in Figure 4-48. o 





302 Chapter 4 / Functions and Relations 


Relation A, 


Relation A, 





We can simplify the arrow diagram for order relations in several ways, and the. 
complexity of the diagram for Rz certainly gives us ample motivation to do so. Each 
of the three examples above satisfies three properties, and these properties allow us 
to simplify the arrow diagrams significantly. 


Let R be a binary relation on a set A. 


1. Ris said to be reflexive if (a, a) € R for all a € A, In terms of the arrow 
diagram, this means that every node has a loop. 

2. Arelation Riscalled antisymmetric if for alla, b € A, ifa # band (a, b) € R, 
then (b, a) ¢ R. In terms of the arrow diagram, this means that arrows only 
20 in one direction. 

3. A relation Ris called transitive if whenever (a, b) € Rand (b, c) € R,it must 
also be the case that (a, c) € R. In terms of the arrow diagram, this means 
that whenever you can follow two arrows to get from node a to node c, you 
can also get there along a single arrow. 





‘These properties lead to a simplification of the arrow diagram that is named for 
the German mathematician Helmut Hasse (1898-1979). Of course, when we use a 
Hasse diagram, we must convey separately that the relation being shown satisfies 
these three properties, so we will use the following definition to make this easy: 





Definition A relation R on a set A is called a partial order on A if R is 
antisymmetric, transitive, and reflexive. 











In this case, we will simplify the arrow diagram and refer to it as the Hasse diagram 
of the partial order. A Hasse diagram will simplify the arrow diagram in two ways. 


1. We will suppress the loops since these are not necessary when we know in advance 
that the relation is reflexive. 











4 


4 
Rolation A, 


Doug 


John Any 


Billie Jack 
Don © Audrey 
Relation D 


Figure 4-50 
Diagram for Practice 
Problem 1 


44 Properties of Relations 303 


> E. gq (3) à * 


Figure 4-49. Solution to Example 2. 


2. We will use line segments instead of arrows where the direction of the relation is 
always from a node lower on the page to one that is higher on the page. Since we 
know in advance that the relation is antisymmetric, the arrows would only go in 
one direction anyway. 

3. We only draw a line segment from the (lower) node a to the (higher) node b if 
(a,b) € R and there is no c € A that lies "between" a and b. Since we know in 
advance that the relation is transitive, any time we see a line from a to P and a line 
from b to c, we know that the regular arrow diagram would have an arrow from a 
to c—we always suppress this extra arrow in a Hasse diagram, 





@ Example 2 Draw the Hasse diagrams for the three partial orders in Example 


SOLUTION The Hasse diagrams shown in Figure 4-49 are significantly sim- 
pler than the arrow diagrams we saw in Example 1. a 


Practice Problem 1 Draw the Hasse diagram for each of the following partial 
orders: 


(a) The relation D on the set E = (John, Jessica, Doug, Amy, Don, Billie, Audrey, 
Jack) with the rule (a, b) € D if a is a descendant of b. Assuming we allow that. 
people are descendants of themselves, the arrow diagram for this relation shown 
in Figure 4-50 is a partial order. 

(b) Let R be the relation on the set A ={1,2,3,6,9, 18) given by the rule (a,b) € R 
ifa divides b. 


To prove properties of binary relations, we must first be sure that we adequately 
understand the definitions of the properties. Hence, before our discussion turns to 
proofs, let's consider several examples of relations that may or may not have the 
three special properties we have defined so far, 


E] Example 3 Let A = (1,2,3,4,5). Draw an arrow diagram for each of the 
two relations on A described below, and then give a specific example that justifies each 
statement that follows. 











304 Chapter 4 / Functions and Relations 


: Q 
iet 
5 2 s 2 
4 3 4 E 
Relation A Relation S 
Figure 4-51 Diagram for 
Example 3. 


© Relation R is defined so that x R y means x - y is even. 
© Relation S is defined so that a S b means |b — a| < 1. 


1. Relation R is not reflexive. 
2, Relation S is not transitive. 
3. The relations R and $ are not antisymmetric. 


SOLUTION We give the two diagrams in Figure 4-51, and then address the 
statements below. Notice how we use "double arrows" to keep the diagrams 
less cluttered. 


L. Since 1 € A and (1, 1) ¢ R, R is not reflexive. 
2. Since (1,2) € S, (2.3) € S, and (1,3) ¢ S, we see that S is not transitive. 


3. Since both (1,2) € Rand (2, 1) € R, we see that R is not antisymmetric, The 
same example shows that S is not antisymmetric either. a 


Practice Problem 2 For each of the following relations, decide if it is a partial 
ordering. If itis not, give specific examples to show which of the three properties are 
not satisfied. 


(a) LetA = P((1.2,3]) anddefinethe relation Ry on A so thats Ry t meansn(s n1) # 
0 

(b) Let B= (1,2,3,4, S], and define the relation Rz on B so that x Ra y means x yis 
odd. 

(c) LetC = (Rap, Ram, Cram, Map, Arm, Ramp, Camp, Car, Trap, Race, Part}, and 
define the relation Rs on A so that x Rs y means every letter in word x is also in 
word y. 


Proofs About Properties of Relations 

As usual, the form of a proof about an abstract object is directly tied to the formal 
definitions involved. In the case of the reflexive and transitive properties for relations, 
the definitions are simple if, then statements, allowing us to incorporate the direct 
proof structure with which we have become familiar. 








44 Properties of Relations 305 
‘The reflexive and transitive properties 


E] Example 4 Let A= (1,2,3,6,9, 18}, and let R be the relation on A given by 
R= (x,y) € A x A: y is a multiple of x). Prove that R is reflexive. 
SOLUTION To show that R is reflexive, we must prove to the Reaper that 
"for every a € A, (a, a) € R” We do this ing the READER to choose any 
element of A at all and call it a, and then explaining to her how we know (even 
though we do not know what value of a she picked) that (a, a) € R. 
Proof that R is reflexive. Let a € A be given, We know that a = 1 - a, so 
certainly a is a multiple of a. By the definition of R, this means that (a, a) € R. 
Hence, R is reflexive. o 











In general, a proof that a relation R on a set A isreflexive always has the following 
form: 


| Claim. R is reflexive. 


Proof Let a € A be given. 


Use the definition of R to conclude that (a, 4) € R. 


Practice Problem 3 Let R be the relation on Z given by the rule R = (a, b) € 
Z x Z: a — b is even). Prove that R is reflexive. 


TE example 5 Ler R be the relation on Z given by the rule R = (a,b) € Z. x 2: 
a — b is even). Prove that R is transitive. 
SOLUTION The formal definition says that R is transitive means, “If (a, b) € 
R and (b. c) € R, then (a.c) € R,” so we can once again prove this if, then 
statement directly. 
Proof that R is transitive. Let a, b, and c be given so that (a, b) € R and 
(b,c) € R. This means (by the definition of R) that a — b is even and b — c is 
even. In other words, a — b = 2- K for some integer K, and b— c — 2: L for 
some integer L. But in this case, 


















a-c= (a -b)+ (b-c) 
=2K+2L 
=AK+L) 


Since K + L is an integer, we can conclude that a — c is even. Therefore, by the 
definition of R, (a, c) € R. This means that R is transitive, [s 


In general, a proof that a relation R on a set A is transitive always has the 
following form: 








306 Chapter 4 / Functions and Relations 


Explore more on 
the Web. 





Claim. R is transitive. 
Proof Let a, b,c € A be given such that (a, b) € Rand (b. c) € R. Use the 
definition of R to state what this means about a, b, and c. | 


Use the definition of R to conclude that (a, c) € R. 





Practice Problem 4 Let A = {1,2,3, 4, 6,9, 12, 18, 36), and let R be the relation 
on A with the rule R = (a, b) : b is a multiple of a}. Prove that R is transitive, 





The antisymmetric property 


Proofs about the antisymmetric property seem somewhat less direct because the 
definition of antisymmetric is a bit more complex. Recall that R is antisymmetric if 
the following holds: 


Foralla,be A 





# band (a, b) € R, then (b, a) ¢ R. 


‘The additional complexity in this case comes primarily from the fact that the definition 
involves negative conditions, so we must rewrite it as an equivalent statement that is 
casier to apply our proof strategies to. 

We discussed earlier that the informal meaning of antisymmetry is that the 
arrows only go in one direction, so it seems like the statement "For are all a, b € A, if 
(a,b) € R, then (b, a) ¢ R” would be just as suitable as the definition of antisymmetry. 
There is only one problem. If a = b in this statement (remember that using two 
different variable names does not necessarily mean that the variables cannot have the 
same value), then the statement would read, “If (a, a) € R, then (a, a) ¢ R.” Hence, 
no antisymmetric relation could be reflexive, and we do not want these properties to 
have that kind of interdependence. (Sce Exercise 14.) 

In light of this distinction, we could rephrase the informal meaning to say, “In 
the arrow diagram for an antisymmetric relation, the only arrows that go in both 
directions are loops” The formalization of this statement will be the form on which 
we will base proofs about antisymmetry. 





Proposition } A relation Ron aset A is antisymmetric ifand only ifthe following 
statement is true: 


For alla, b € A, if (a,b) € Rand (b, a) € R, then a = 





PROOF The proof is simply a matter of verifying that this rewording of the 
definition is logically equivalent to the original statement. (See Exercise 16.) B 


"MI Example 6 LerA = (1,2,3, 4,6,9, 12, 18,36) and et R be the relation on A 

given by R = (x. y) € A x A: y is a multiple of x). Prove that R is antisymmetric. 
SOLUTION Proof Leta, b € A be given such that (a, b) € Rand (b, a) € R. 
From the definition of R, this means that bis a multiple of a and a is a multiple 
of b. This, in turn, means that a = K - band b = L -a for some integers K and 











44 Properties of Relations 307 





L. Combining these equations tells us that a = K - (L - a). from which it follows 
that K - L = 1. Since the elements in A are all positive, then K and L must be 
positive integers, so K - L = 1 implies that K = L = 1. This, in turn, implies that 
a=b. E 








Soeven though it is not the most intuitive definition of antisymmetry, our proof 
that a relation R on a set A is antisymmetric will always have the following form: 








Claim. R is antisymmetric. 
Proof Let a, b € A be given such that (a, b) € R and (b, a) € R. Use the 
| definition of R to state what this means about a and b. 


‘Conclude that a = b. 





Practice Problem 5 Let A = P((1,2,3, 4)), and let R be the relation on A wit 


Explore more on 
C t). Prove that R is antisymmetric. 


the Web. the rule R (s.t) € A x A 





Other Types of Orders 


Transitivity and antisymmetry are the essential properties of “order relations." Fo 
example, if a is smaller than b and b is smaller than c, then it will follow that a i 
smaller than c. Similarly, if a is smaller than b, then it cannot be the case that b i 
smaller than a. This is significant because it will be true whether "smaller" refers t 
sizes of numbers, areas of triangles, or heights of people, 





Vll Game 7 sch af he flog nein conve dc ratonbie mint 
"order" among the elements of A. Explain why each is not a partial ordering. 


1. Let A = (1,2,3,4, 5 and let R = ((a,b) € A x Aza < b). 
2. Let A be the set of classes required for graduation, and let P = |(c, d) € A x A 
Class c must be completed before starting class d). 


SOLUTION Relation R is not reflexive because, for example, the statement. 
“1 < 1” is not true. Similarly, relation 5 is not reflexive because, for example, 
the statement "Calculus I must be completed before starting Calculus I" is not 
true. E 


In each of these examples, it is the reflexive property that fails. This means tha 
reflexivity is not essential for a relation to seem like an “ordering,” However, i 
each of these examples, reflexivity not only fails but fails spectacularly. The arro 
diagrams for cach of the previous relations have no loops at all. We will call relation 
like this strict partial orders. 


308 Chapter 4 / Functions and Relations 





Definition Arelation Ron A isirreflexiveifforalla € A,(a,a) ¢ R. (That 
is, the arrow diagram of an irreflexive relation R has no loops.) A strict partial 
ordering on the set A is a relation R on A that is transitive, antisymmetric, and 
ck: 





Practice Problem 6 Let A = (1,2, 3.4,6, 12) and let 
R = ((m.n) € A x A : mis a proper factor ofn) 


Prove that R is irreflexive. 
2, E 
Note that "irreflexive" does nor merely mean "not reflexive;" as the following 
example shows. 
© * a Example 8 Draw an arrow diagram of a relation on A = (1,2,3, 4} that is 
Figure 4-52 transitive and antisymmetric but neither reflexive nor irreflexive. 
Shad vs SOLUTION The relation in Figure 4-52 is not reflexive since there is no loop 
Simple: at node 2, However, it is not irreflexive since there is a loop at node 1. a 


Partial orders (strict or otherwise) are relevant in many applications in the sci- 
ence and business world. Scheduling problems provide one such use of these struc- 


212 wo 
tures in task management, a topic of common interest to businessmen, engineers, 
ay 228 and computer scientists. In its simplest form, we define a relation on a set of tasks so 
that “a is related to b" means “Task a must be completed before task b can begin." 
The following particular example shows how this situation is relevant to students at 
333320 





a college. 


u Example 9 LerP be the relation on the set M = (211, 212, 225, 320, 322, 326, 


Relation P 329,333] of math courses defined by the rule (a, b) € P if course a must be completed 
Figure 4-53 Diagram before beginning course b, where this "prerequisite structure” is given by the arrow 
for Example 9. diagram in Figure 4-53. Is P a strict partial order? If so, draw the Hasse diagram" for 

the relation. 


SOLUTION Since there are no loops, the relation is irreflexive. Since there 
is no pair of nodes with arrows in both directions, the relation is antisymmetric. 


"e = ‘Transitivity can also be checked by noting that any time there is a "two-arrow" 
path from a node a to a node b (like 211 — 212 — 322) there is an arrow 
322 300 directly from a to P. The Hasse diagram for this strict partial ordering is given 
22 
zm: ms 


in Figure 4-54. a 


Another variation on the relations we have studied comes from considering the 
difference between relations R and Rz in Example 1: 


The Hasse diagram for P — « When we give such a Hasse diagram, we must make it clear that the relation is a strict order for P so 
Figure 4-54 Solution that the reader knows whether the relation is reflexive or irrelexive. This will not be apparent from the 
to Example 9. diagram alone! 











44 Properties of Relations 309 


© The relation Ri on the set {1.2,3, 4) with the rule “(x, y) € Ry ifx < y.” 
© The relation Rz on the set P({1, 2, 3)) with the rule “(x, y) € Rz if x € y.” 





Relation R; forces a decision about every pair of numbers, while Rz allows for 
the possibility that some pairs of sets are completely unrelated. An unrelated pair it 
a partial ordering is called an incomparable pair, and itis the existence of these pairs 
that makes the ordering "partial" If we know for a fact that every pair is comparable, 
then we can say that the ordering is rota! instead of partial. 

















Definition We say that R is a total ordering on A if R is a reflexive, transi- 
tive, and antisymmetric relation on A that also satisfies the property 


For all a, b € A, if a # b, either (a, b) € R or (b, a) € R. 


As with partial orderings, a strict total ordering has the same properties except 
that it is irreflexive instead of reflexive, 











uw Example 10 Let A = P((1, 2, 4, 8}). Identify each of the following relations on 
A asa partial ordering, a total ordering, a strict partial ordering, a strict total ordering, 
or none of these: 


1. The relation Ry is defined so that (S, T) € Ry means every element of Sis less than 
or equal to every element of T. 

2. The relation Rz is defined so that (S, T) € Ry means n(S) < (T). 

3. The relation Rs is defined so that (S, T) € Rs means the sum of the elements in S is 
less than or equal to the sum of the elements in T. 


SOLUTION Relation A, is a partial ordering, Rz is a strict partial ordering, 
and Rs is a total ordering. If any of these surprise you, draw the appropriate 
Hasse diagram to check! o 


Practice Problem 7 Let B be the set of binary strings of length exactly 4. Label 
each of the following relations on B as a partial order, a strict partial order, a total 
order, a strict total order, or none of these. 


(a) Ry = (a. B) € B x B: a has fewer I's than B has). 

(b) (For those who are familiar with Section 2.6.) Let V : B — N be the function such 
that V (a) is the value of the binary numeral a. For example, V (1100) = 12 and 
V(0011) = 3. Let Ry = ((a, B) € B x B: V(a) < V(8)). 


Summary 


There are two important classes of binary relations: “order relations” and “equiv- 
alence relations” We have seen several types of order relation in this section, and 
discussed how their behavior can be explained in terms of abstract properties such 
as reflexivity, antisymmetry, and transitivity. We have also continued practicing 





310 Chapter 4 / Functions and Relations 





Don Bille Jack Audrey 
18 
Doug Amy 6 * 
2 E 
John Jessica 1 
Relation D Relation A 


Figure 4-55 Solution to Practice Problem 1. 


our logic and proof-writing skills by proving these properties do or do not hold 


for particular relations. 


Solutions to Practice Problems 


The two Hasse dingrams are given in Figure 4-55, 

None of the relations are partial orders. 

(a) Ry is not antisymmetric since (I1). (1,2)) € Ri 
amd (1,2). (1)) € Ri. Ri i$ not transitive since 
(1.21.23) eR, and (2,3, (3) eR, but 
(1,2). (3I) Æ Ri. Ri is also not reflexive since 
(0.0) ER. 

(b) Rois notantisymmetricsince (1,3) € Ra and (3,1) € 
Rs. Ry is not reflexive since (2,2) g Ro. 

(©) Ra isnot antisymmetric since (Arm, Ram) € Ry and 
(Ram, Arm) € Rs. 

3 Proof, Let a € Z be given. Clearly, a — a = 0, and 0 is 
even since 0 = 2. 0. Thus, it follows from the definition. 
of R that (a, a) € R. Hence, R is reflexive, . 

4. Proof. Let a,b,c € A be given so that (a,b) € R and 
(b.c) € R. This means (by the definition of R) that b is 
a multiple of a and c is a multiple of b. In other words, 

= a+ K for some integer K, and c = b- L for some in- 











Exercises for Section 4.4 


1. Complete the arrow diagram (Figure 4-56) for each of 
the following relations on A = (1,2,3,4, 5,6, 7,8), and 
decide if it has any of the reflexive, antisymmetric, or 
transitive properties, For each property a relation does 
not have, illustrate this failure with a specific example. 
(a) Ry = (C, 1). (1,2), (1.4). (1,8), (2,2). (2.4), (2.8), 
(3,3), (3,6). 4,4), (4, 8), (5,5), (6,6), (7.7), (8.8)] 

(b) Ry = (2.2). (2,4), (2,6), (2.8), (3. 1). (4, 4), (4.6). 
(4,8). (5,1). (5,3). (6.6), (6,8). (7. 1) (7.3). 07.5). 
(8.8) 











teger L. But in this case, 


c= bL = (aK)L =a(KL) 


Since KL is an integer, we can conclude that c is 
a multiple of a, Therefore, by the definition of R, 
(a c) e R. 

5. Proof that R is antisymmetric. Let S, T € P(N, 
be given such that (S, 7) € R and (7.5) € R. By the 
definition of R, this means that $ c T and 7 € S. But 
by the definition of equality of sets, this means that 
S=T. . 

6 Proof that R is irreflexive Let a € (1,2, 3,4,6, 12) be 
given. According to the definition of “proper factor,” a 
proper factor of a must be smaller than a. Hence, a can- 
not be a proper factor of a, Therefore, by the definition 
of R, (a, a) ¢ R. We can conclude from this that R is 
irreflexive. LI 

7. Riis astrict partial order, and Ay is a total order. 











(©) Ry = ((1, 1). 0,3) 0,5). 0,7). (2 2), 24 
(3,3), (3.5), 3,7), (4, 2), (4, 4), (4,8). (5.3) 
4) 


a. L8). 
a. 
(6.6). (6.8), (8,2), (8,4). (8, 8)] 
a. 
" 


), (2. 
).(5.7). 





(1,3). 0.5), (1,7). (2,2), (2, 4), (2.6). (2.8), 


p 
7). 4.4). (4,6), (4, 8). (5. 7). (6.6). (6.8). 





2. For each of the following relations on Z, decide if the 

F relation is reflexive or irreflexive Ifit does not have one 
(or both) of these properties, give a specific example to 
illustrate this. 








S MES 
1 tereti " 
ee "m m 
d 6 F 6 
Polaton A; Relation Az 


44 Properties of Relations 311 


JT CES 

1 4 ds 4 

e s og. “5 
y* y To% 
Relation Fy Relation As 


Figure 4-56 Diagrams for Problem 1. 


(a) Ry = ((a,b) € Z x Z: a + bis even} 
(b) Ri = (a.b) € Z x Z: a+ bis odd) 
(©) Rs = ((a,6) € Z x Z: a + 2bis even) 
3. For each of the following relations on Z, decide if the 
F relation is antisymmetric, I it is not, give a specific ex- 
ample to illustrate this 
(2) Ri = (a,b) € Z x Z ia + bis even) 
(b) Ri = (a, b) € Z x Za? + bis odd] 
(©) Ry = (a,b) € Z x Z : ab + bis odd] 
4. For each of the following relations on Z, decide if the 
F relation is transitive. IF itis not, give a specific example 
to illustrate this. 
(0) Ri = (a,b) € Z x Z ia + bis even) 
(9) Ra = (a,b) € Z x Z +a +b is odd) 
(6) Ry = (a, b) € Z x Zsa +2bis even} 
5, Consider the two relations whose diagrams are shown 























in Figure 4-57, 
(a) Describe each of the relations by filling in the 
blanks: 
9 Rie labea xA Lo where 
A ) 
9 RelubeBxB: h where 
Ee. 


b) Are either of these relations reflexive? If not, add 
arrows to the diagram to make the relation reflex- 


3 8 
Relation A, 
Figure 4-57 





ive, Give a description in words of this new relation 
in each case. 

(e) Are either of these relations transitive? If not, add 
arrows to the diagram to make the relation transi- 
tive, Give a description in words of this new relation 
in each ease, 

Let A be the set of letters in the English alphabet. For 

each of the following relations on A, decide if it is re- 
flexive, irrellexive, transitive, or antisymmetric, (Each 
can satisfy more than one of these properties.) 

(a) Ri = ((a, A) € A x Ace immediately precedes # 
in alphabetical order) 

(b) Ry Ku, A) € A x Aca comes before f in alpha 
betical order] 

7. Let P be the set of people who have ever lived. For each 
‘of the following relations on P^ decide i itis reflexive, ir- 
reflexive, transitive, or antisymmetric, (Each can satisfy 
more than one of these properties.) 

(a) Ry = ((a, B) € P x Pais a child of f] 
(b) Ra = ((a, B) € P x P: is a descendant of A) 

8, Let C be the set of airports in the world, For each of 
the following relations on C, decide il it is reflexive 
reflexive, transitive, or antisymmetric. (Each can satisfy 
more than one of these properties.) 

(a) Ry = Ka. A) € C x C : There is a direct Might from 























atop) 
2 E 
1 4 
5 
Relation Re 


Diagrams for Problem 5. 








312. Chapter 4 / Functions and Relations 


(b) Rs = ((a, B) € C x C : There is a direct flight be- 
tween a and 8) 

(©) Rs = la B) €C xC: One can travel between 
cities a and f by airplane(s)) 

Fora non-empty finite set of numbers S, define max 5 to 

be the largest number in the set and min S the smallest. 

number in the set S. 

For each of the following relations on A= 
P(I1,2,3, 4]) — 16), decide if it is reflexive, irreflex- 
ive, transitive, or antisymmetric. (Each can satisfy more 
than one of these properties.) 

(a) Ry = (S.T) € A x Az maxs < maxT] 

(b) Ra = (S. T) €A x A: maxS < min T) 

(©) Ry = (S.T) € A x A: maxS < max T] 

For each property that a relation fails to satisfy, give a 
specific example to illustrate this. 

For a finite set of numbers S, define a (S) to be the sum 
of the numbers in the set S, where o ()= 0. 

For each of the following relations on A= 
P({1,2,3,4)), decide i itis reflexive, irreflexive, transi- 
live, or antisymmetric. (Each can satisfy more than onc. 
‘of these properties.) 

(a) Ri = (S. T) €A «Az a(S) < aC) 

(b) Ri = (S. T) € A x A:o(5) < o(TI 

(© Rye (S.T) £A x Aso(S—T) < (SAT) 

For each property that a relation fails to satisfy, give a 
specific example to illustrate this. 

Decide whether the relation R on the set [1.2.3,4,5] 
defined below is reflexive, antisymmetric, or transitive. 
Predict what this property (or properties) will imply 
about the diagram of R. Draw the diagram to sce if you 
are correct. 


R (1.1). (1,2). 0.4) 2.2), (2,1), 2.4). .3). 
(3,4). 4.4) 
Let the relation R on the set E = (David, Mary, Doug. 


Rob, Don, Bille, John, Jessica, Andrew, Allen, Jacob, 
Christopher, Saache, Tommy] be given by 





= l(a. b): a is a son of b] 


For those of you who do not know these people, this 

means. 

R = ((David, Don), (Doug, Don), (Rob, Don), 
(David, Billie), (Doug, Billie), (Rob, Billie), 
(Andrew, Mary), (Allen, Mary), (Jacob, Mary), 





(ohn, Doug), (Christopher, Rob). 
(Tommy, David) 


If we think of these people as being numbered 
1,2.3,.... 12, 13,14 in the order they appear above, 
‘what is the diagram for R? Does this relation have re- 
flexive, irreflexive, antisymmetric, or transitive proper- 
ties? 

13. Write one sentence about what property shows up in 
the arrow diagram 
(a) Of a reflexive relation, 

(b) Of an antisymmetric relation. 
(©) Of a transitive relation. 

M. Let A = (1, 2,3]. Give an example of a relation R on A 
that is 
(a) Transitive and reflexive but not antisymmetric. 

(b) Antisymmetric and reflexive but not transitive, 
(©) Antisymmetric and transitive but not reflexive, 

15. Let S = (1, 2,3). For each of the following relations on 
P(S), draw the arrow diagram and decide if the rel 
tion is reflexive, antisymmetric, or transitive, I it fails 
any of these properties, give a specific example to illus- 
trate this Iit has all three properties (i. ifitis a partial 
ordering), give the corresponding Hasse diagram, 

(a) Ri = (A,B) € P(S) x P(S) +A € BI 

(b) Ri = (A,B) € PS) x P(S): B — A = (II 
40) Ry ALB) € P) x PIS) à 1B VI 
(@) Ra = (A,B) € P(S) x PG) A NB AM 
4e) Rs = (A,B) € P(S) x PIS) : n(A) < n(B)) 

16, Fill in the missing details in the following proof of 
Proposition 1. 

Proof. Let a relation R on a set A be given, and let 
a, b € A be given as well. The equivalence of the two 
characterizations of antisymmetry is based on the log- 
cal equivalence of the statements rather than on con- 
text. Consider the following propositional variables: 
© Let p be the statement “a =h.” 

© Let be the statement “(a, b) € R:" 

© Let r be the statement "(b a) € R.” 

In terms of p. q. and r, the compound proposition “If 
a # band (a. b) € R, then (b. a) € R” is written as 




















(pag -r e 


Similarly, the compound proposition “If (a, b) € Rand 
(b, a) € R, then a = b” is written as 


Place this statement in the rightmost column of the 
truth table in Table 4-13, and show that it is equiva- 
lent to (+). Since the compound propositions are equiv- 
alent regardless of the choice of a and b, it follows 
that the two characterizations of antisymmetry are 
equivalent, . 








45 Equivalence Relations 313 





(pag —r 
































Table 4-13 Table for Exercise 16 


17. Proveor give a counterexample to cach of the following 
statements: 
(a) If Risa reflexive relation on A, then R^! isa reflex- 
ive relation on A, 
(b) IER is an antisymmetric relation on A, then R is 
an antisymmetric relation on A, 
(©) IER is a transitive relation on A, then R= isa tran- 
sitive relation on A. 
18, Proveorgive a counterexample to each of the following 
statements: 
(a) If R is a reflexive relation on A, then Ro R is a 
reflexive relation on A. 
b) If R is an antisymmetric relation on A, then Ro R 
is an antisymmetric relation on A. 
(©) IE R is a transitive relation on A, then R o R is a 
transitive relation on A. 
19. Give two different examples of a relation R on the set 
A = la, b, c, d) satisfying R oR = R. 
20. Since the rule for a relation on A is technically a 
subset of A x A, we can use the usual set oper- 
ations to combine relations on the same set. Let 








A=(0.1.2.3.4) and define relations Ri and Ry 
by Ry (0.1). (1. 2) (0, 3). (1.4). (0,0)) and R: = 
10, 0.0.2). (2.1). 2.2) 

(a) What is Ri U R; 
(b) What is Ri NR: 
(©) Whatis Ri — R:? 

21, Prove or give a counterexample to cach of the following 
statements for a given set A: 

(a) IR, and Rs are reflexive rel 
Ry U Reis reflexive 
b) IF; and FR: are reflexive relations on A, then the 
relation Ry n Rs is reflexive, 

(c) IER, and Rz are antisymmetric relations on A, then 
the relation Ri U Ra is antisymmetric, 

(4) 1E Ri and Rs are antisymmetric relations on A, then 
the relation Ry (Rs is antisymmetric. 

22, Prove that for any relation Ron a set A, if R is transitive, 
then R o RE R. 

23. Prove that for any relation R on a set A. if R o R C R, 
then R is transitive. 











ions on A, then the 














4.5 Equivalence Relations 





There are many times when we would like to express a relationship in which objects 
in a set are thought of as being "the same" for some particular application. For 
example, in geometry, we might treat two triangles as being "the same" if they are 


similar triangles. In arithmetic, we treat two fractions $ and $ as 


the same" 





if ad — bc. Or perhaps a teacher assigns a group project and gives the same grade 
to everyone in the same group. In each of these examples, there is an underlying 
binary relation for which the phrase "a is related to b" loosely means that “a and b 


are treated the same." 








Functions and Relations 


A relation of this type is called an equivalence relation." In this section, we give 
a formal definition of equivalence relation, discuss the properties that characterize 
these relations, and continue to develop our proof-writing skills with these abstract 
mathematical objects, 


Equivalence Relations and Partitions 


We study binary relations on a set because they are able to capture important in- 
formation within a very simple structure, For example, the two relations below re- 
flect basic properties of positive integers as they apply to the numbers in the set 
A=(1,2,3,4,5,6). 


"lll Example 1. Draw the arrow diagram foreach ofthe following relations: 


1. Rye (ab) e AxA:bsal] 
2. Ry = (a,b) € A x A :a — bis even} 


SOLUTION The diagrams for R, and Rz are shown in Figure 4-58. o 


Relation Ra is our first example of an equivalence relation. In this case, two 
numbers are treated as being “the same” if they have the same parity—that is, if 
they are both even or both odd, In this way, the relation Rz divides the set A into 
two disjoint (i.e, nonoverlapping) subsets: (1, 3, 5}, every two elements of which are 
related by R», and (2,4, 6], every two elements of which are related by R». We call 
this collection of subsets a partition of A. This is the central feature of our definition 
of an equivalence relation. 











Relation A, 
Figure 4-58. Diagrams for Example 1. 


* Here we think of “equivalence” as being specific to the application at hand, always to be distinct from 
the notion of “equal” that we will continue using to mean "identical" 





45 Equivalence Relations 315 





Definition For a set A, a partition of A is a set S = ($1,253...) of 
subsets of A (each set S, is called a part of S) such that 


1. For all i, S; #0. That is, each part is nonempty. 

2. For all i and j, if S; # Sy, then S; NS; = 0. That is, different parts have 
nothing in common. 

3 S1 US US3 U -++ = A. That is, every element in A is in some part, 











Definition For aset A anda relation R on A, R is an equivalence relation 
on A i there is a partition S of A such that (x, y) € R if and only if x and y 
are in the same part of S. In this case, we will refer to S as the partition of A 
induced by R. 











u Example 2 Imagine a simplified card game where a player is dealt two face- 
up cards from a "deck" of five cards numbered 1,2, 3,4, 5. Figure 4-59 shows every 
possible "hand." If the rules of the game call for the value of your hand to be based on 
which cards you have and not the order in which they are received, then some of these 
hands (like hand 1 and hand 5, e.) would be considered the same. Ifthe relation Ron 
the set of hands showed above is defined so that (x, y) € R means hands x and y are 
considered the same in the game, then what is the partition induced by this relation? 


SOLUTION If we use the shorthand H; for “hand i," we can efficiently write 
this partition as follows: 
(WA, Hs), (H2. Ho), (H3, His}, (Ha, Hi7), (He, Hio). 
(Hy, Hia), (Hs, His), (Hi, His), (Hiz, Hio). (His. Hao)) 
Practice Problem 1 In Example 2, suppose we play a game in which the rules 


treat two hands as the same if the sum of the values on the cards is the same. Whar is 
the partition induced by this relation? 


o 


E] Example 3 Determine which of the following relations are equivalence rela- 
tions on the given sets by describing the induced partition: 


1. A = (1.2,3,4,5,6,7, 8,9), and the relation R on A given by the rule (a,b) € R 
‘means that a and b have a common factor greater than 1. 

2, The relation R on Z given by the rule (x, y) € R means that x — y is divisible by 4, 

3. A = (0,1.2,3.4,5,6] x (1.2, 3], and the relation R on A given by the rule where 
(a, b) R (c. d) means that ad = be. 





SOLUTION The first relation is not an equivalence relation since in a parti- 
tion, it would be impossible for 4 and 6 to be in the same part, 6 and 9 to be in 
the same part, and 4 and 9 not to be in the same part. The other two relations 
are equivalence relations with the following induced partitions. 








316 Chapter 4 / Functions and Relations 

















ET SSeS) ae p3 rm 

z 3 3 E T 3 8 

2 3 4 5 1 3 4 5 
2 Shed pee ples eae in [7 
Handi Hand2 Handia Hands Hand Hande — Hand7 Handa 
I 9 pp rS es EA r; 
7 z 3 E T g 3 E 

1 2 4 5 1 2 3 5 






































1 e p^ exami Noc [2 [pole Pls PF 


Hando Handi Handit Handia Handi3  Handi4  HandiS — Hand!6 





























4 [rs i LAN ep] LR rm 
Hand 17 — Hand18 —Hand!9 — Hand20 
Figure 4-59 Possible card hands for Example 2. 




















2, The set Z of integers is partitioned into the four parts by this relation: 






-3,1,5,9, 
1,3,7,11, 








-2,2,6,10, 





Another way to describe this partition of the infinite set 2 is as the set 
(Po, Pi, Ps, P3], where 

9 P-laez 
e P-laez 





ik for some k € Z) 

k+ 1 for some k € Z} 
P; = (a € Zia = 4k + 2 for some k € Z) 

e Py=(aeZ Ik +3 for some k € Z} 


3. The induced partition of A is the set 








(So, S1, S2, S3, Sa, S5, S6, S172, Says Ssy2 Sij Syys: Says 85/3) 


where 





{(0,1), (0,2), (0.3)} Sta = (1, 2)} 
(2,1), (2.2),8,3)) S32 =(G,2)} 
(2,1), (4,2), (6,3) Ss = (5.2) 
(3,1), (6, 2)) S15 =((1,3)} 
(4.0) Sys = (2,3) 
{6.1)} Says = ((4,3)) 
Ss = ((6,1)) Ssa =((5,3)} 








"o 
AQ. " 


y 


a 
Figure 4-60 
Diagram of the 
symmetric 
relation in 
Example 4, 


45 Equivalence Relations. 317 


Practice Problem 2 Draw arrow diagrams for the following equivalence relations. 
In each case, give the partition on A induced by the relation. 





(a) For A ={0,1,2,3,4,5,6), R = [(x, y) € A x A £ (12.5) (Y — 25) = 0) 
(b) For A= 0,1,2,3,4,5,6, R= (Gy) EA x Aci? ivisible by 7) 





‘Two properties of equivalence relations should be immediately apparent. For 
any reasonable notion of "the same,” we should know that any element is “the same" 
as itself, and we should know that the statement “a is the same as b” is equivalent to 
the statement “bis the same as a." The first of these properties should look familiar: 
An equivalence relation must be reflexive. The latter property is one we have not 
discussed before, but it is related to the notion of antisymmetry that we studied in 
our discussion of order relations. 

















Definition A relation R on set A is said to be symmetric if for all a, b € A, 
if (a,b) € R, then (b, a) € R. 











In terms of arrow diagrams, a symmetric relation has the property that every 
pair of nodes connected by an arrow is actually connected by two arrows, one in each 
direction. So only loops and “double arrows" will be used in the arrow diagrams of 
symmetric relations. 


un Example 4 Let A = (Tommy, Christopher, Allen, Jacob, Andrew), and de- 
fine the relation Rs on A so that x Ry y means x and y share at least one letter of their 
names, Draw the arrow diagram for the relation R. 


SOLUTION The arrow diagram is given in Figure 4-60. ü 


As with other properties, we can write pi 
based on the straightforward if, then structure oí 





ofs about the symmetric property 
definition, 





"Ml Example 5 ror the relation R on the ser Z given by R= ((x,)) €Z x 2 
y is even), prove that R is symmetric. 








SOLUTION To show that R is symmetric, we must show that, “If (a, b) € Ra, 
then (b, a) € R,” so we can also prove this if, then statement directly by paying 
close attention to the definition of R. 

Proof. Let a, b e (1,2,3,4, 5, 6} be given so that (a, b) € R. This means 
(by the definition of R) that a — b is even. In other words, a — b — 2- K for 
some integer K. But in this case, b — a = —(a — b) — 2- (-K), and since -K 
is an integer too, we can conclude that b — a is even as well. Therefore, by the 
definition of R, (b, a) € R. We conclude that R is symmetri 

















In gene; 
following form. 


a proof that a relation R on a set A is symmetric always has the 








318 Chapter 4 / Functions and Relations 





Claim. R is symmetric. 
Proof Let a,b € A be given such that (a, b) € R. Use the definition of R 
to say what this means about a and 5. 


Use the definition of R to conclude that (b, a) € R. 





Practice Problem 3 Let A = (0. 1,2, 3,4, 5] and let R be the relation on A given 
by the rule R = ((a, b) : à? — b is a multiple of 3). 





(a) Prove that R is reflexive. 
(b) Prove that R is symmetric. 





It seems reasonable that any equivalence relation should be reflexive and sym- 
metric. The next example shows that to be an equivalence relation, a relation must. 
satisfy more than just these two properties. 


lll example 6 LS beth set of all sudents at your school his semester, and le 

the relation R be given so that (x, y) € R means students x and y have a class together. 
this semester. Explain why R is reflexive and symmetric but is not an equivalence. 
relation on S. 


SOLUTION Certainly, any student enrolled this semester has a class with 
him or herself, so R is reflexive. The relation R is clearly symmetric because the 
statements “x and y have a class together” and "y and x have a class together” 
express the same thing. However, it is easy to imagine students like Amy, Beth, 
and Carolyn, where Amy and Beth have a class together, Beth and Carolyn 
have a different class together, and Amy and Carolyn do not have any classes 
together. Since the parts of a partition may not overlap, it is impossible for 
there to be a partition P where Amy and Beth are in the same part, Beth and 
Carolyn are in the same part, but Amy and Carolyn are not in the same part, 
[s] 


In a plot twist worthy of an action thriller, it turns out the property that will 
alleviate this particular problem is our old friend, transitivity. 


a Example 7 Each of the following relations on Z is reflexive and symmetric. 
Which of them is also transitive? 


1, T, =((a,b) € Z x Z : b — a is divisible by 5) 
((a, b) € Z x Z : a? — b? is divisible by 5| 
(a.b) €Z x Z: |a bi < 2} 





SOLUTION The relations T; and Tz are both transitive (see Exercise 9), 
but relation T; is not transitive. To see this, simply notice that (1, 3) € Ts and 
(3,4) € Ts but (1,4) ¢ Ts. 











45 Equivalence Relations 319 


Practice Problem 4 Ler C = (2,3,4,5,6,7, 8,9. 10], and define the relation R 
onC so that (a, b) € R means the greatest common divisor of a and b is greater than 
1. Is this relation transitive? Is it reflexive or symmetric? 


To show that transitivity really is the last piece of the puzzle, we will formally 
prove that a binary relation that is reflexive, symmetric, and transitive must be an 
‘equivalence relation. 


Theorem 1 Ifthe relation R on the set A is reflexis 
then R is an equivalence relation. 


ymmetric, and transitive, 





PROOF For each element a € A, define the set P, = (x € A : (x,a) € R}. That 
is, P, is the set of elements in A that are related to a, To prove that the set 
IP, :a € A] isa partition of A, we must verify each of the three properties in the 
definition of partition from page 315. 


1. Claim: Each P, is nonempty. 
Proof, For each a € A, (a, a) € R because R is reflexive. From this, it follows 
that a € Py; hence, P, is nonempty. 

2. Chim: For all a, b € A, if P, N Py # Ø, then P, = Pp. (Note that this is actually 

the contrapositive of the statement given in the definition.) 
Proof Let a, b € A be given such that P, N P, # Ø. This means that there 
is an element c € P, N Pp. This, in turn, means that (c, a) € R and (c, b) € R. 
Now the symmetry of R tells us that (a, c) € R and (b, c) € R. We can now use 
the fact that R is transitive. From our knowledge that (a, c) € R and (c, b) € R, 
we can conclude (by transitivity) that (a, b) e R. Similarly, from our knowledge 
that (b, c) € R and (c, a) € R, we can conclude (by transitivity) that (b, a) € R. 
Sce Exercise 5 for details on how to conclude that P, = Pp. 

3. Claim: The union of all the P's is the set A. 

Proof For each a e A, (a.a) € R because Ris reflexive. From this, it follows 
that for any given a € A, we must have a € P,, and so every element of A is 
certainly in the union of all the P's. 

" 


In Theorem 1, we established that any relation on a set A that is reflexive, sym- 
metric, and transitive must be an equivalence relation—that is, the relation induces 
a partition on A. Now that we have the tools for proving that a relation has these 
properties, we can show that the converse of Theorem 1 is true. We illustrate the idea 
with an example, and leave the more general proof as an exercise, 











@ Example 8 Let S be the set of students in a class where a group project is 
assigned, and define the relation G on S with the rule 


(a.b) € G ifa and b are in the same group. 
Assume that every student is part of exactly one group. Explain why G is reflexive, 
symmetric, and transitive. 


SOLUTION We have to show that G is reflexive, symmetric, and transitive, 
so we take one at a time. 





eo ea a 


320 Chapter 4 / Functions and Relations. 


© Claim: G is reflexive. 
Proof Let a € S be given. Certainl 
herself. Hence, (a, a) € G. 
© Claim: G is symmetric. 
Proof Let (a, b) € G be given. The definition of G tells us that this means a 
and b are in the same group. But in this case, we can say (b, a) € G as well. 
© Claim: G is transitive. 
Proof Let (a, b) € G and (b, c) € G be given. The definition of G tells us 
that this means a and b are in the same group, and b and c are in the same 
group. Since each student is part of only one group, this mcans that a and c 
are in the same group. Hence, we can conclude that (a, c) € G. 


is in the same group as him or 





n 

Theorem 2 An equivalence relation R on a set A is reflexive, symmetric, and 
transitive. 

PROOF Generalize the solution to Example 8. See Exercise 23. a 


Since we have established this statement and its converse, we have completely 
characterized equivalence relations in terms of their abstract properties. 


Theorem 3 A relation R on a set A is an equivalence relation on A if and only if 
Ris reflexive, symmetric, and transitive, 


Practice Problem 5 We have seen that the following relations are reflexive, sym- 
metric, and transitive. Describe the partition of Z induced by each equivalence rela- 





tion. 
(a) T, =((a,b) € Z x Z : b ais divisible by 5) 
(b) T =((a.b) € Z x Z: a? — b? is divisible by 5} 





Solutions to Practice Problems 


1 Thepartition of the set of hands (using the notation from 
Example 2) is 


MEG, Hs), (Ha. Hy), (Hs, He, Hio, His), (Ha, Hs. Hias 
Hir), (Hay His. His, His). (Hias Mio), (Hie, Hall 





2 The arrow diagrams are given in Figure 4-61. The par- 
tition of A induced by Ri is (10.1.2). 13, 4, 5.6). The 
partition of A induced by Rs is ((0}, (1.2. 4]. (3.5.61). 

3 This requires two separate proofs. 

(a) Proof. Let a € A be given. Since a? — a? = 
3-0, we know that a? —a? is a multiple of 3, and 
hence, (a, a) € R- 

(b) Proof. Leta and b be given so that (a, b) € R. This 
means (by the definition of R) that aè — b^ is a mul- 
tiple of 3. In other words, a? — b? — 3- K for some 
integer K. But in this case, $? — a? = (à — 5) = 








3- (-K). Since —K is an integer, we can conclude 
that 7 — a? is a multiple of 3 as well. Therefore, by 
the definition of R, (b. a) € R. . 
4. This relation is reflexive since the greatest common di- 
visor ofa and itself isa, and all elements of C are greater 
than 1. The relation is also symmetric since the greatest 
common divisor of two numbers does not depend on the 
order in which the two numbers are given. However, the 
given relation is not transitive since, for example, 4 and 
6 are related (their greatest common divisor is 2) and 6 
and 9 are related (their greatest common divisor is 3). 
but 4 and 9 are not related. 
We learned in Section 22 that for integers a and b, a — b 
is divisible by 5 if and only if and b leave the same re- 
mainder when divided by 5. Hence, there are five parts 
in the partition of Z induced by equivalence relation Ti- 











N 


Relation A, 


Thatis, 7, induces the partition (Py. P. Ps, Ps, Pi] of Z, 

where 

© Py = ne Z in is divisible by S} 
5.10..] 

© P, = [n e Z:n — 1 is divisible by 5} =(... 
1,6,11...) 

© Py = [n € Z:n — 2is divisible by 5} 
2,12... 

© Py = |n € Z:n—3is divisible by 5) = |. 
38.13... 


+10, -5.0, 














Exercises for Section 4.5 


1, Complete the arrow diagram in Figure 4-62 for 
each of the following relations on the set A= 
(1,2, 3.4, 5,6, 7, 8}, and decide if it has any of the reflex- 
ive, symmetric, or transitive properties. For each prop- 
erty a relation does not have, illustrate this failure with 
a specific example. 

(9) Ri = (1,0). (12) (1,4), 0,8. 2, 1), 2.2. 2.4), 
(2,8), (3,3), (3. 6), (4, 1) (4,2). (4, 4). 4,8). (5.5), 
(6,3), (6, 6), (7, 7), (8,1), (8, 2), (8, 4), (8, 8)] 

0) Ra = (01, 4), (1, 8). (2,2), 2.4). (2,6), Q, 8). 3.3). 
(3,4), (3. 6). (4 1). (4, 2), C8, 3), 4,4), (4, 5). (4,6), 
(47), (8. 8). (5. 4). (5. 5). (6,2), (6,3), (6.4). (6.6), 
(6.8), (7.4). (7. 7). (8. 1) (8. 2), (8, 4), (8,6), (8, 8)) 








45 Equivalence Relations 321 


KA 


Figure 4-61 Solution for Practice Problem 2. 


* hne 
4,9,14...) 

Similarly, the partition of Z induced by equivalence re- 

lation 7; is determined by remainders on division by 

5. Specifically, T; induces the partition (A. B,C) of Z, 

where 

= P, (from above) 

Pi UP. (from above) 

Pa U Ps (from above) 





— 4 is divisible by 5 








49 Ry = (0,0, 0,3) 0,5) (1, 7), (2, 2). (2,4), (2,6), 
(2.8), (3,5), (3,7), (4,4), (4,6), (4,8), (5,7). (6.6). 
(6,8), (8,8)) 

2. For each of the following relations on Z, decide if the 
relation is symmetric. If itis not, give a specific example 
to illustrate this 
(a) Ry = (a,b) € Z x Zsa + bis even) 

(b) Ri  ((a.D) € Z x Za +b is odd] 

(6) Ry = [(a, b) € Z x Z: a  2bis even} 

3. Which of the following relations is symmetric, which 
are antisymmetric, and which arc neither? Support any 
negative conclusions with a specific example, 





t t CONES LE d 
Se . at r d 
. *s "i i fis 
LET pU S 
Relation A, Relation A; 





Figure 4-62 Diagrams for Exercise 1. 


eee: = fl CU - - 





322 Chapter 4 / Functions and Relations 


(a) The relation R on the set A = (2, 3, 5, 7) is defined 

by the rule R = (x. y) € A x A: x + y is even). 

(b) The relation R on the set Z is defined by the rule 
R= (x. y) eZ x Z : xy + yis even} 

(c) The relation R on the set A = P({1,2,3, 41) is de- 
fined by the rule R = (s.t) e A x Azo(s) =a(0))- 
where c(X) is the sum of the elements in the set X- 

(@) The relation R on the set Z is defined by the rule 
R= (x,y) € Z x Z: 2r+ y iseven]. 

4. Which of the following are partitions of the set A = 
11.2, 17,8]? For each that is not, explain why 
not. 

(9) (1.2. (3.4.5), (6.7.81) 

b) ((1.5). 16.7, 2444,3. 5]. 181) 

(©) 41.4]. (6.8, 2). 5]. 7)) 

0) (11.8), 8.3.5]. 7.20] 

5. In the proof of Theorem 1, we showed that (a, b) € R 
and (b, a) € R. Fill in the blanks below to complete the 
proof that P, = Ps. 

Proof of Theorem 1 Continued. To show that P, 

we must show that each set is a subset of the other. 

Chim P, c Pp. 

Proof. Let z€ P, be given. From the definition of P... 

we know that ER, so by the 

property of R, it follows that ER as well 

But this means that z € P+, completing the proof of this 











Pr 





claim. . 
Chim P, cP, 

Proof Let z e Ps be given. From the definition of Ps, 
we know that € R, so by the 


property of R, it follows that © Ras well. 
But this means that z € P,, completing the proof of this. 
claim. - 

6. Imagine four people (A. B,C, and D) lined up to enter 
arestaurant. There are 24 different orders in which they 
can wait in line: 





ABCD ABDC ACBD ACDB ADBC ADCH 
BCAD BCDA BDAC BDCA CABD CADB 
CDAB CDBA DABC DACB DBAC DBCA 
BACD BADC CBAD CBDA DCAB DCBA 





When the four people are seated at a round table (as- 
suming they are seated counterclockwise in the order in 
which they are in line), some of the line arrangements 
cend up looking the same. For example, whether the peo- 
pleare seated in the order ABC D or C DAB, the seating. 
at the table looks like that in Figure 4-63. Note that each 
person only cares about who isto his or her left and who 
is to his or her right, not about which particular chair 


A E 
Figure 4-63 Example for Problem 6. 


he or she sits in. If we think of the relation R on the 
set S of line arrangements with the rule (X, Y) € Rit 
arrangements X and Y are the same when seated at a 
round table, what is the partition of the set S induced 
by R? 

7. Imagine a company that makes charm bracelets, each of 
which has four charms A, B,C, and D. In the manufac- 
turing process, the charms are lined up to entera thread- 
ing machine. There are 24 different ordersin which they 
can enter the machine: 





ABCD ABDC ACBD ACDB ADBC ADCB 
BCAD BCDA BDAC BDCA CABD CADB 
CDAB CDBA DABC DACB DBAC DBCA 
BACD BADC CBAD CBDA DCAB DCBA 





Once the charms are threaded on a bracelet, many of 
these arrangements look the same, If we think of the 
relation R on the set B of line arrangements with the 
rule (X, Y) e Rif arrangements X and Y look the same 
when threaded on a bracelet, what is the partition of the 
set B induced by R? 
|. 2, 3). For each of the following relations on 
P(S), draw the arrow diagram and decide if the rela- 
tion is reflexive, symmetric, or transitive. If i is all three 
(ie., an equivalence relation), give the corresponding 
partition of P(S). 
VA. B) e P(S) x P(S): A € B) 
KA. B) € P(S) x P(S): A c B) 
KA. B) € P(S) x P(S): A NB =0) 
KA. B) € P(S) x P(S): A NB #0) 
KA. B) € P(S) x P(S) + n(A) = n(B)) 
To complete Example 7, prove that each of the follow- 
ing relations on Z is transitive: 
((@. 6) € Z x Z: b— a is divisible by SI 
(b) 7: — (a.b) € Z x Za? — b? is divisible by S) 
10, Let A= (0,1.2,3,4,5,6) and R=((x,))€A xA: 
x — yis divisible by 3}. 
(a) Prove that R is reflexive. 



































1 


14, 


1s. 


16. 





(b) Prove that R is symmetric. 
c) Prove that R is transitive. 
of A induced by the equiva- 


Letn 2 2be given. Repeat the previous exercise for the 
relation 
R= (x,y) €Z x Z: x — yis divisible by n} 


on the set A =Z. 


Let the relation R on the set E = (David, Mary, Doug. 
Rob, Don, Billie, John, Jessica, Andrew, Allen, Jacob, 
Christopher, Saache, Tommy] be given by 


a.b) +a is a son of b] 
For those of you who do not know these people, this 











David, Don), (Doug, Don). (Rob, Don), (David. 
Billie), (Doug, Billie). (Rob, Billie). (Andrew, 
Mary). (Allen, Mary), (Jacob, Mary), (John, Doug), 
(Christopher, Rob), (Tommy, David)] 

What is the meaning of the relation Ro R? Does the 

relation R o R have reflexive, symmetric, or transitive. 

properties? If it has all three, give the partition of the 

set E induced by the relation Ro R“. 

Write one sentence about what property shows up in 

the arrow diagram. 

(a) Of a reflexive relation. 

(b) Of a symmetric relation. 

(©) Of a transitive relation. 

Let A = (1, 2,3). Give an example of a relation R on A 

that is 

(a) Transitive and reflexive but not symmetric. 

(b) Symmetric and reflexive but not transitive. 

(6) Symmetric and transitive but not reflexive. 

Prove or give a counterexample to cach of the following 

statements about symmetric relations: 
(a) If Risa symmetric relation on A, then R- isa sym- 
metric relation on A. 

(b) If R is a symmetric relation on A, then Ro Ris a 
symmetric relation on A. 

(e) IfRisany relation on A, then Ro R-! 
relation on A. 

(@) If Ris any relation on A, then R^! > R is a symmet- 
ric relation on A. 

Since the rule for a relation on A is technically a subset 

of A x A, we can use the usual set operations to com- 





isasymmetric 








45 Equivalence Reasons 323 


bine relations on the same set. Let 

define relations Ry and R: by Ry = ((0.1), (1,2). (0, 

3.4). 4,3)}and Re = (0.0). (1.2). 2.1), 8.4), (4.3). 

6.3). (4.9). 

(2) Whatis Ri U Ri? 

(b) What is Ri n Rz? 

(e) Whatis Ra — Ri? 

43) Find the smallest transitive relation Ry’ such that 
Ry C Ry’. (Sucha relation R' is called the transitive 
closure of R.) 

(e) Find the transitive closure of relation Rz. 

Prove or give a counterexample to each of the following. 

statements: 

(a) For rclations R, and Rz on A, if Ri issymmetric and 
R: C Rì, then R3 is symmetric. 

(b) For relations R; and R: on A, if Ry is reflexive and 
R; € Ry, then Rs is reflexive. 

Give a counterexample to each of the following claims 

about the set A = [1,2,3.4,5]. 

(a) IER, isa reflexive relation on A and Ry is a symmet- 
ric relation on A, then the relation Ri 1 Rs is both 
reflexive and symmetric, 

(b) IER, isa reflexive relation on A and Re is a symmet- 
ric relation on A, then the relation Ri U Rz is both 
reflexive and symmetric. 

(©) IER, isa transitive relation on A and Re is a transi- 
tive relation on A, then the relation R, U Rs is also 
transitive, 

‘The elation RU R^! canlendinsight into the properties 

of the original relation R. 

),1,2,3} and R = {(0, 1), (0,0). (1.2). 

(0.3). Q.2)). Whatis RU R? 

(6) Is R UR“ always symmetric? 
(e) What is special about RU R^! when the relation R 
is symmetric? 

20, Prove each of the following statements for a given set 
^ 
(a) f R, and Rs are symmetric relations on A, then the 

relation R: U Rs is symmetric. 
(b) IER, and R: are symmetric relations on A, then the 
relation R; N Rz is symmetric. 

21. Prove for any relation R on a set A, if R is symmet- 
ric, then R= R5. (Hist: Show both R € R- and 
R^ C Ruo establish the conclusion.) 

22. Prove for any relation R on a set A, if R= R then R 
is symmetric. 

23. Prove Theorem 2. 








T7. 





1% 











324 Chapter 4 / Functions and Relations 


4.6 Numerical Functions in Discrete Math 





In this chapter, we have shown that functions are more general than mere manip- 
ulations of numbers, but we cannot deny that certain numerical functions are very 
important in discrete math. We complete the chapter by using some of the more 
important numerical functions for specific applications. 


Exponents and Logarithms 

A question that frequently arises in discrete math concerns the size of a finite struc- 
ture. This often comes up within the context of representing the structure in a com- 
puter (which of course has a finite amount of space), but it can be a central issue in 
mathematics applications as well. The simplest example of this is when the structure 
is simply a large integer. 


BE Example 1. How many digis are inthe number 20? 


TE 240 — dy lO" + di 1077 +--+ dil0 + do, then we know 10" <2 < 
10"*!, Hence, we need to know what power of 10 is approximately equal to 299, 
The logarithm function does exactly this. 





Definition The base ten logarithm of a positive real number r is the real 


number p for which 10^ = r. In this case, we write logig" = p. 





"Ml example 2 Note shat since 10° 
at least a little bit of work. 


| we write log, 1 = 0, Other values require 





1. Since 10° = 1,000, we write logg 1,000 = 3. 
2. Since V 999? ~ 5 (try it on your calculator and see!), we write log,,(5) = 0.69897. 
3, Since 10°" = 2 (try it on your calculator and see!), we write logy(2) = 0.30103. 


We now know that we can find the number of digits in 24 by calculating 
Jogi (2), but this is not entirely helpful if we have to calculate 2’ and then 
find the base ten logarithm of that answer—if we could calculate 2?" we would 
have already answered the question! However, if we think about the familiar rules 
for exponents in the right way, we can derive some properties of logarithms that will 
make finding the answer easy! 


Theorem 1 The following properties hold for all real numbers x and y: 
1. (10% = 1077 


2, 10°- 10" = 107 
3. 10/10 = 10" 








4.6 Numerical Funcions in Discrete Math 325 
We will not prove these properties here (although they are simple to derive for 


integer values of x and y as you will see in the exercises), but we will use them to 
derive the corresponding properties of the logarithm function. 


Theorem 2 The following properties hold for all positive real numbers a and b: 






1. logio(a") = b- log (a) 
2. log(a + b) = log (a) + log, (b) 
3. logy(a/b) = logi (a) — log, (b) 


PROOF | We will derive only the first two parts, and we will leave the third part 
asa practice problem. 


1. Let c — log (a). This means that a — 10°, and so a^ = (10°) = 10" by 
rule #1 of Theorem 1. This means that log, (a^) = b«c = b  logyo(a), as de- 
sired. 

2. Letx = logio(a) and y = logyo(6). Then a = 10" and b = 10", which means that 

a:b = 10-10 
= 10'*” by rule #2 of Theorem 1 





But this means that logyo(a - b) =x + y = logy (a) + logio(b), as desired. 
3. See the practice problem 1 below. - 
‘We can now solve the problem given in Example 1. 


SOLUTION In Example 2 above, we found that log,,(2) = 0.30103. From 
rule #1 of Theorem 1, 


log, (2/99) = 1,000 - log, (2) = 1,000 - (0.30103) = 301.03 


Hence, 10% < 2199 < 109? so we conclude that 21^? is a 302-digit number. 
u 


Practice Problem 1 Show how the third part of Theorem 2 can be derived from 
the third part of Theorem 1. 


The logarithm properties in Theorem 2 allow us to answer a number of different 
questions about the size of large integers. 


"I Example 3 How many digits does 1,000! have? 

As before, the real problem here is to compute log;;(1.0007) without computing 
1,000! itself first. We would like to take advantage of the product structure of 1,000!, 
so we investigate a simpler, similar problem. Rule #2 in Theorem 2 gives us a clue 
about how to find the log,, of a number that is expressed as a product. This is all we 
need to find the answer to the question in Example 3. 











326 Chapter 4 / Functions and Relations 


SOLUTION Since 1,000! = 1,000 999-998- ....4.3-2-1, property #2 of 
Theorem 2 can be used to write 


log, (1.000) = log (1.000) log (99) log (998) + -=+ 
+ logyo(2) + log, (1) 


The right-hand side of this equation involves summing 1,000 numbers, so 
it still seems computationally challenging, but this approach alleviates our 
biggest problem before in that each of the values on the right-hand side 
is very easy to compute with a calculator. For example, logy;(1,000) = 3, 
log, (999) = 2.995655, log, (998) ~ 2.99913054, and so on. To obtain the final 
answer, it would be easiest to get either a spreadsheet or a calculator involved.” 
The final answer is logio (1,000!) = 2,567.6, so we conclude that 


1075" < 1,000! < 10255 
which means that 1,000! has 2,568 digits. o 


‘The only thing special about the base of ten is the connection to the number of 
digits that is tied to our base ten place value system of writing numerals Since any 
positive number b can be used as a base for exponentiation, it follows that any such 
b can also be the base of a logarithm. 





Definition Foranypositiverealnumberb # 1,wewritelog,r = ptomean 


that b^ = r, and we call p the base b logarithm of r. 








Because the exponent rules in Theorem 1 work for bases other than ten, the 
same rules that apply to base ten logarithms also apply to logarithms in any base. It 
is arguable whether the most important base for discrete math is base two or base 
ten, but base two logarithms are certainly the most common type of logarithm in 
this particular text. Because computers represent numbers internally as base two 
numbers, then the number of base two digits in a number is typically more of a 
concern than base ten digits. 


a Hbc ro aequat pow d 
2 that your calculator can display exactly? Explain how you can get the answer just 
from knowing that log, (2) = 0.30103. 
SOLUTION The first 11-digit number is 10'°, so we need to know the largest 
value of m such that 
zr <10 


This is the same as saying that logio (2") < 10, which by rule #1 of Theo- 
rem 1 means the same thing as saying that m - log; (2) < 10. Since we know 





* Ona TI-43 calculator, for example, the command sum (seq (20910 ta) ,n, 1,500) ) yields the answer 
1134 0864... after a few dramatic seconds, 











4.6 Numerical Functions in Discrete Math 327 


n 5 6 1 8 9 10 
nego) | ns t 155 1 197 1 240 | 285 | 332 


Table 4-14 Table for Example $ 





that logi, (2) = 0.30103, this last inequality is roughly the same as saying 
‘m-0,30103 < 10, which is true of any m < 33.22. We conclude that 2" is the 
largest power of 2 that is less than 10°, o 


a Example 5 Suppose you can represent positive integers up to 2"! — 1. What 
is the largest value of n for which you can represent n” exactly? (Nove: 2?! — 1 is the 
largest number that can be represented by a 32-bit integer in a computer, so your answer 
tells you the largest value of n for which n” can be represented as such an integer.) 


SOLUTION We want to find the largest integer n with n" < 231, so we want 
log;(n^) < 31. But by the properties of logarithms, we know that log;(u") = 
nlog;(n), so we can fill in Table 4-14 to show values for this expression. Thi 
‘means that 9 is the largest number that can be represented. n 





Practice Problem 2 Suppose you can represent positive integers up to 2° — 1. 
What is the largest value of n for which you can represent n exactly? (Nowe: 2 — 1 
is the largest number that can be represented by a 64-bit integer in a computer, so 
your answer tells you the largest value of n for which n! can be represented as such an 
integer.) 






From Reals to Integers 


Arguably, the most frequently used numerical functions in discrete math are the 
functions that convert fractional numbers to integers. We are taught very early on 
rules for rounding to the nearest whole number, but there are even more basic 
functions than this. 








Definition We define the functions floor and ceiling, each as functions 
R — Z, as follows: 


© [x] = the greatest integer less than or equal to x. 
© [x] = the least integer greater than or equal to x. 








uv Example 6 Table 4-15 shows how these rules work on a few different input 
values. From the definitions of the floor and ceiling functions, explain how these two 
functions are related to each other. 


SOLUTION We can describe the relationship in two cases as follows: 


1x] ifxeZ 


I= |ixj44 ifreR-Z 








328 Chapter 4 / Functions and Relations 





Table 4-15 Table for Example 6 


A more clever relationship is the following, which one can check is equivalent 
to the one above: 


Forall x €R, |x] = —[—x] 
o 


‘One of the common uses of the floor function is to find the number of multiples 
of a given value in a range of integers. The floor function naturally arises in this 
context. 


BE Example 7 How many multiples of 6are in the set(1,2,3,. 
SOLUTION The set of multiples of 6 can be written as 
{1 -6,2-6,3-6.4-6, 


1,000)? 





6 





so the question is simply about the largest integer such that 
-6 < 1,000. We can find this by dividing 1,000/6 = 166.666... so. 

we know that 166 - 6 = 996 < 1, 000 but 167 «6 = 1,002 > 1,000, Hence, there 

are |1,000/6] mutliples of 6 in the set (1, 2, 3, ..., 1,000}. o 











This example can be easily generalized to give the following basic fact about the 
floor function: 


Proposition 3 Let k and n be positive integers. The number of multiples of k in 
the set of integers (1.2, 3, ....n] is given by |n/k].. 


Practice Problem 3 Use the floor function to express a formula for the number of 
‘multiples of kin the set (m, m-- 1, m-- 2, ...,n]. (Hir: How can you use the result 
of Proposition 3 directly?) 





BB Example 8 Use the floor function 1o fad the number of numbers inthe set 
S= (1,2,3,..., 1,000} that are evenly divisible by 3 or 7. 


SOLUTION Let A-|neS:nisdivisbleby3] and B= {ne 
divisible by 7}. According to the inclusion-exclusion principle from Section 3.1, 


n(AU B) — n(A) + n(B) - (A n B) 








But Proposition 3 tells us that n(A) = Lej is simply the number of multi- 
ples of 3 in S, n(B) = | 14 is simply the number of multiples of 7 in S, and 





4.6 Numerical Functions in Discrete Math 329 


n(AnB) 





is simply the number of multiples of 21 in S. Hence, 
n(A UB) = n(A)  n(B) — n(A n B) 


Te REDE] 


= 333+ 142-47 = 428 





nu 


We end this brief discussion with an application of the floor function to another 
question about large integers. 


a Example 9 How many consecutive 0's does the (base ten) number 72! have 
on its right end? 


SOLUTION. Ifwe imagine the large number 
72.71.70. ....3.2:1 
written as a product of its prime factors; it will look something like this: 


2n.3n.5n 7^ 





From this point of view, each consecutive 0 on the right end of this large number 
comes from one of the 2's and one of the 5's in this prime factorization. It seems 
sensible (and turns out to be true) that there are always more factors of 2 than 
factors of 5 in n!, so the number of 0's at the right end is exactly the same as 
the value of py in the prime factorization above. Hence the question becomes, 
“What is the power of 5 in the prime factorization of 72! 

Consider the numbers being multiplied to form 72!: 








1:2:3:4:5:6:7- 8:9: 10-10-64 6566-67-68 69.70.7172 


Every fifth number is underlined in the product above, since each of these 
numbers (and no others) contributes a factor of 5 to the prime factorization of. 
721. We cannot write them all in the space above, but we know there would be 

2 | =14 underlined numbers. Those numbers in the list that are multiples of 
35 cach contribute an additional factor ofS to the prime factorization. There are 
only two such numbers, 25 and 50, in this particular list. There are no multiples 
of 125 or higher powers of 5 in the list above, so the total number of factors of 


Sin 72! is 
2|,|7 
IS] 16 


Therefore, there are 16 zeroes on the right end of 721. If you have a computer 
algebra system, you can compute the 104-digit number 








"The fact that such a representation exists was proven in Section 24. 











330 Chapter 4 / Functions and Relations 


72! = 61234458376886086861524 - - .689274204160000000000000000 
to check that this is correct. o 


Practice Problem 4 How many consecutive 0's are on the right end of 1,0001? 


‘The process we used for the previous example answers a more general question 
about the prime factorization of factorial numbers, which we state as a proposition 
below. This is an important idea for some interesting number theoretic results con- 
cerning the distribution of prime numbers. 


Proposition 4 For a positive integer n and a prime number p, the power of p in 
the prime factorization of n! is given by 


blab Lalla 


PROOF See Exercise 26 at the end of this section. E 


Composition of Numerical Functions 

‘There are many reasons for studying the composition of functions. Perhaps the most 
practical reason comes from the way we solve numerical problems in real life. We 
always have some fixed set of tools like our knowledge of specific function properties, 
ourcalculator's fixed set of function keys, or a programming language with a given set 
of mathematical functions. In all these cases, we must be able to combine these basic 
tools as necessary to represent new functions or to carry out complex operations 
based on the simple ones. 


E] Example 10 A useful mathematical application of logarithms and integer func- 
tions is the calculation of the length of the base b representation of a number. Find an 
easy function to make this calculation, using the basic functions we have studied in this 


section. 
SOLUTION The base b representation of a number n comes from the unique 
representation 


n= d, BE + dy DI + oot dy b+ do 
where each d, is from (0, 1,2, ..., b — 1). (These are the base b “digits” of n.) 
The length of this representation is k +1, but how is k related to n? Since 
bY <n < B^, it follows that k < log,n < k+ 1, and so k = (Jog, n]. We state 
our simple formula as the general proposition below. 





Proposition 5 For any integer b > 2, the number of digits in the base b repre- 
sentation of a positive integer n is given by the function 

|log,n] +1 
which is built out of the log, function, the floor function, and addition. 











4.6 Numerical Functions in Discrete Math 331 


u Example 11 To determine if a positive integer n is prime, we can simply try 

dividing the number by prime numbers less than n. However, you never need to check 
a prime number p greater than vn since if p > y/n and p -m — n, then m < Jn. Find 
a function f : N — N built from the mathematical functions in this section such that 
(n) is the largest integer that is less than or equal to «fn. 


SOLUTION In general, only those p < fi need to be checked, so the largest 
value that might ever need to be checked is |./n]. So we can use the function 
with the rule f(n) = LV). n 


Arithmetic Operations as Functions 
There are many mathematical processes that can be viewed as functions, and this 


commonality often lends insight into properties shared by many processes. Ordinary 
arithmetic provides the most common examples, 





"ME example 12 The function Sum: Z x Z > Z defined by he rule 
Sumi, j) = i j 


is the usual addition operation on integers. The function Prod : Z x Z — Z defined by 
the rule 





Prod(i, j)=i-j 
is the usual multiplication operation on integers. 
Write the distributive property of multiplication over addition 
a-(b+c)= (a-b)+ (ac) 
in terms of these functions 
SOLUTION Foralla, b,c € Z, 


Prod(a, Sum(b, c) = Sum(Prod(a, b), Prod(a, c)) 
o 


Notice that since the inputs are ordered pairs, the technically correct notation 
for an output of the function Sum : Z x Z — Z is Sum((a. b)) rather than Sum(a, b), 
but we will consistently use the latter for simplicity. 





Practice Problem 5 Give a careful definition of the “difference” operation as a 
function from Z x Z to Z, and express the following property in terms of this function: 
For all integers a, b, and c, 


(a—-b)-c-(a-c)-b 








332 Chapter 4 / Functions and Relations 


Solutions to Practice Problems 


| Proof. Letx = log, (a)and y = logi, (b). Thena = 10* 
and b = 107, which means that 


a/b = wr" 
= 1077" by rule #3 of Theorem 1 
But this means that log (a/5) =x — y = logio(a) — 
log, (b), as desired. " 
2 We will have to use trial and error to see when log, (n!) 


first reaches 63. Table 4-16 shows values of log (n!) com- 
puted (using technology) as the sum 


Yow 
it 


From this we see that 2" < 201 < 2" < 211,so 20! is the 
largest factorial that can be represented using a 64-digit 
binary number, 





log, (1) + log;(2) + logs(3) +++» + logon) 


1 | 15 | 2 | 22 | 2 





218 | 40 





log,(nt) 611 | 655 | 699 


Table 





16 Solution to Practice Problem 2 


Exercises for Section 4.6 


1. Fill in Table 4-17 to support the relationship given in 
Example 6. 

2. For cach of the following, indicate whether you believe 
the statement is true for all real numbers x and y, and 
ive a counterexample if not. 

(9 Lr] byl = Lee yl 

(b) 20 2204 

(6 ix] [xl e 20) 

(@) = ix) =f- 

(6) Hx > 0 then Le 65] = [x = 0.51. 





-x | lx 


zl- [x] 








Table 4-17 Table for Exercise 1 


3 


E 


9» 


10, 


‘The number of multiples of kin the set (rn, mL... 1] 
is the number of multiples of kin the set (1, 2,....n} less 
the number of multiples of in the set (1,2,...,m~ 1]. 
According to Proposition 3, this is 
lf] ~ Lm — 1k] 

"To account for every factor of 5 in the list of numbers 
from 1 to 1,000, we add the number of multiples of 5, 
which is (1,000/5] = 200; the number of multiples of 
25, which is 1,000/25] = 40; the number of multiples of 
125, which is [1,000/125] = 8; and the number of mul- 
tiples of 625, which is [1,000/625] = 1. Therefore, the 
exponent of 5 in the prime factorization of 1,000! is 


1000/5] + L1,000/25 + (1,000/125] + [1,000/625] 








=% 
Hence, 1,000! ends with 249 zeroes. 
We can define 

dif Ex Zz 


with the rule dif ~ y. The given property then 
is stated as follows: for all a, b, c € Z, 


difdif(a, b). c) = dif (dif (a, c), b) 





‘Suppose you know that x is a positive real number, and. 
you would like to build a function that returns the near- 
‘est integer to x, Call this function round(x), and show 
how tis related tothe floor function. (If v is equally near. 
two integers, your function should return the higher 
number.) 

How many digits does the number 3"? have? 

How many digits does the number 3)" have? 

How many digits does the number 9!" have? 

Fermat numbers are of the form 2” + 1. How many 
digits (in base ten) does the fifth Fermat number have? 
"The tenth? 

Letay = 1,anday = 2- foralln = 1. How many digits 
docs as have? 

Explain why the third part of Theorem 2 follows from 
the first two parts. 

Investigate the relationship between the number of dig- 
its of 2" and the number of digits of 3", Prove the rela- 
tionship using the properties in Theorem 2. 














Use the definition of logarithms to explain why the fol- 
lowing equation is true: 


1 


Yide m 
ogo? = igi 


Investigate the relationship between the value of logy, 
and log, 10. Prove the relationship using the properties 
in Theorem 2. 

|. How many positive integers less than or equal to 2,000 

are divisible by 

(a) 32 

qs 

(oTt 

How many positive odd numbers less than 2,000 are 

divisible by 3? 

How many positive integers less than or equal to 2,000 

are divisible by 

(a) 3or$? 

(b) 3or7? 

(o) 2075? 

4) 2077? 

(0) 2,3, 0r? 

How many positive odd numbers less than 2,000 are 

divisible by 3 or S? 

Let's call an integer greater than 1 a faux prime it itis 

mot divisible by 2,3, 5, or 7. 

(a) How many positive integers less than or equal to 
2,000 are divisible by 2,3, 5, or 7? 

(b) What percentage of all positive numbers less than. 
‘or equal to 2,000 are faux primes? 

(©) Usea computer program or a table of primes to find 
the number of primes less than or equal to 2,000. 

(d) What percentage of the faux primes are really 
primes? 

e) Discuss what the previous answer implies about 
checking numbers less than or equal to 2,000 for 
being prime. 

i. Explain why 100! has two more digits than 991. 

|. How many base ten digits does 100! have? 

How many base ten digits does 500! have’ 

What is the smallest value of n for which n! exceeds 

10,000 digits in length? (Investigate for several values 

and try to discern a pattern.) 

How many consecutive 0's occur at the right-hand side 

of the number 1001? 

How many consecutive 0's occur at the right-hand side 

of the number 1,9851? 

1(3^ divides evenly into 1,9921, what 

that p can have? 

















he largest value 


4.6 Numerical Functions in Discrete Math. 333 


25, 1(2^ divides evenly into 1,9921, what is the largest value 
that p can have? 
26, Give a proof of Proposition 4 for the case p —2. (You 
may use the fact that if 2 divides the product a b then. 
2 must divide at least one of the numbers a or b.) 
1. The following exercises use ideas from Section 2.6. 
(a) If 500! is written using its binary representation, 
how many consecutive 0's occur on the right-hand 
side? 
b) 1.500! is written using its hexadecimal representa- 
tion, how many consecutive 0's occur on the right- 
hand side? 
(E) If 500! is written using its base six representation, 
how many consecutive 0's occur on the right-hand 
side? 
28, The following exercises use ideas from Section 2.6 
(a) If S00! is written using its binary representation, 
how many digits will be needed? 

b) 1 500! is written using its hexadecimal representa 
tion, how many digits will be needed? 

46) IF S00! is written using its base six representation, 
how many digits will be needed? 

29, According to the solution for Example 11, to determine 
ifthe number 1,361 is prime, we only have to check 1,361 
for divisibility by all prime numbers less than | /T361), 
List all the numbers that need to be checked. 

30. Repeat the previous exercise using 2,003 instead of 

1361. 

Let Pow: R* x RY = R be the function with the rule 

Pow(x, y) = x”. Using this along with the functions d 

fined in Example 12, which of the following are tru 

For each that is false, provide a specific counterexam- 

ple, 

(a) For all x, y, z € R*, Pow(Prod(x, y), 2) 
= Prod(Pow(s, 2), Pow(y, 2) 

(b) For all x, y, z € R*, Prod(Poux, y), 2) 

Pow(Prod(x, 2), Prod(y, 2). 

(6) For all x, y, z e R*, Pow(Pow(x, y) 
= Pow(x, Prodty, 2). 

4) For all x € R*, Prod(2, Pow(.x)) 

Pow(2, Sumo, 1)). 




















a 























32. Using the functions from the previous exercise and 


Practice Problem 5, write each of the following facts 
using the functions Prod, Sum, Dif, andlor Pow: 

(a) Forallz, y ze R* x?! = x? x8 

(b) Forallz, y, z€ R' (E) 
(©) For all x, y, z € R^, if x < y, then 





zer 





(bez) 





334 Chapter 4 / Functions and Relations 


33. Let Quo: Z x Z^ — Z be the function with the rule 
Quo(x. y) = 1x/yl, and let Rem: Z x Zt — Z be the 
function with the rule Rem(x, y) = [x mod yJ. Which 
of the following (which also use the functions from Ex- 
ample 12) are true statements? For each that is false, 
provide a specific counterexample, 

(a) For all x, y € Z^ x = Prod(Quo(x, y). y). 








(b) Forall x, y, z € Z*, Quo(Sum(x, y), 2) 
= Sum(Quox, z), Quot y, 2) 

(c) Forall x, y, z € Z*, Rem(Sum(x. y). 2) 
= Sum(Rem(., 2), Remy. 2). 

(d) For all x, y € Z^ x = Sum(Prod(Quo(x, y). y). 
Remix. y)). 





e 4.7 Excursion: Iterated Functions and Chaos 





Magic Tricks 


Functions play a central role in the relatively new mathematical fields of fractals 
and chaos theory, Believe it or not, the concept of composition of functions is even 


present in some magic tricks. 


"M example 1 A magician gives a “volunteer” he following instructions: 


1. Think of any positive integer. 


2. Count how many letters it takes to write your number in English. The number of 


letters is your new number. 


3. Repeat step 2 until you get the same number twice in a row. 


Ar this point the magician announces that the volunteer's current number is 4. 


Why does this trick work? 


SOLUTION. Step 2 of the instructions gives rise to the following function 
definition: f : N — N with the rule 


f(n) = the number of letters in the English expression for n 


If we call the original chosen number ao, then we are forming a sequence of 


natural numbers 


with ay = fap), d» = f(ai), as 
function f cannot be shows 





do dy, da, ds, +- 


f(a), and so on. The entire diagram for the 
ice the domain is the infinite set N; however, 


part of the diagram is shown in Figure 4-64. 
The “loop” near the dot for 4 indicates an arrow from 4 to itself. It certainly 


seems apparent that for values of n greater than 14 (i 





those not shown), 


f(n) < n. This is, in fact, a true statement, but it is based more on properties 
of the English language than on mathematics. This along with the diagram is 
enough to explain why the only value of n for which f(n) = n is n = 4, but is 
this enough to explain why the trick always works? n 


The next example shows that not every trick like this works out so well. 








Letters | A, 


4.7 Excursion: Iterated Functions and Chaos 335. 
PII 
da 
45-4 ol 
AK 


Figure 4-64 Partial arrow diagram for 
Example 1 








Score || 


LOUNLSTR]|DGP|MC|WEHVY| K |B | AX | QZ 
aaa «pete eta 


Table 4-18 Table for Example 2 


BE Example 2 According to the oficiat rules of Scrabble scores are assigned 
10 letters as shown in Table 4-18. Suppose the trick in Example 1 is revised so that 
instead of getting the new number from the length of the English word, the letters in 
the words for each number are scored according to the rules in Scrabble" (using the 
face values of the tiles and without worrying whether there are enough standard tiles 
to spell each word), and this score is the new number. 

For example, if the original number is 87, then 





© “Eighty-seven” has a score of 21. 
“Twenty-one” has a score of 15. 
ifteen" has a score of 13 
hirteen" has a score of 11. 
leven” has a score of 9. 
Wine" has a score of 4 

“Four” has a score of 7. 

“Seven” has a score of 8, 
“Bight” has a score of 9 












At this point the values will cycle 9, 4,7,8, 9,4, 7, .. and hence never settle on a fixed 
value. What does the diagram for the function g : N — N with the rule 





g(n) = the Scrabble” score for the English expression for n 


look like? 


* Scrabble? ©2005 Hasbro, Ine. Used with permission. 








336 Chapter 4 / Functions and Relations. 


ES 
m 


we? 


Figure 4-65 Partial arrow diagram for 
Example 2. 


SOLUTION In the partial diagram shown in Figure 4-65, we can see the 
pattern 9, 4,7,8,9.4,7,... reflected in the "cycle" of arrows between these 
numbers. So although there is a number n (namely 12) for which g(n) =n, 
sometimes we never get to it in our sequence. For this reason, using Scrabble" 
scores makes for a pretty poor magic trick, ü 








‘The idea of repeating the application of a function to form a sequence arises in 
a variety of otherwise unrelated mathematical problems. We call such a sequence an 
iterated function sequence, and it depends only on the choice of the first term ay and 
the function itself, 


Definition Given a function f on a set A and an initial term ag € A, the 
iterated function sequence for f starting at ay is 


4. d), 2, « 


where dws = f (dn). In the event that the sequence starts to repeat—that is, 
nck = dy for some smallest possible k—we say that aj, an+, dna 
a cycle with period k. 








‘The preceding examples use functions on numbers, but in a decidedly unmath- 
ematical way. We would like to see how we might use mathematics to settle some 
questions about these iterated function sequences 


EB example 3 Le f:N- Nbe defined by 


19-1. Ponti 





* In the mathematics literature, this is often called the orbit of ay under f, but we prefer the language 
here since it emphasizes the role of function iteration in the sequence. 





FEN i —— UU UU UU 


47 Excursion: lterated Functions and Chaos 337 





ean 


Table 4-19 Solution to Example 3 


Show that every iterated function sequence for f with ay € Z* either ends with 
the cycle 3, 6,3, 6, ... or the cycle 1,4,2, 1,4, 
SOLUTION Let's first explore the problem a bit, and then we will write a 
proof. Table 4-19 shows the sequences that arise from various choices of do. 
Notice that as we form each new sequence, it eventually looks like one that 
we formed earlier. This should remind you of the principle of mathematical 
induction, Let's see how to prove properties of these sequences with this tool. 
Let P(n) be the statement “The iterated function sequence with the 
first term a =n eventually ends with the cycle 3,6,3,6,... or the cycle 
1,4,2, .” The table above verifies statements P(1), PO)... P(9). 
Now let k > 9 be given, assume that statements P(1), P(2), 
verified, and consider statement P( + 1). That is, we are considering the 
Med function sequence with thefirst term ay = k+ 1. There are two possibilities 
based on the definition of f: 











© If ay = k +1 is even, then the iterated function sequence looks like 
k+1, 45, .... Since 4! < k+ 1, statement P(4¢1) has already been 
verified, so we know that the iterated function sequence b. g with A 
‘eventually ends with the cycle 3, 6, 3,6, .. or the cycle 1, 4,2,1,4,2,.... 
This means that the iterated function sequence beginning with k+ 1 
eventually ends with the cycle 3, 6,3, 6,... or the cycle 1, 4,2, 14,2... 

© Ifa, = k+1 is odd, then a, = ag +3 = k + 4 is even, so the iterated 
function sequence looks like k + 1, k+ 4, 4, ... Since k > 3, it follows 
that ££ < k + 1, so statement P(43) has already been verified. Hence, we 
know that the iterated function sequence beginning with 454 eventually 
ends with the cycle 3, 6, 3. 6, ... or the cycle 1, 4,2, 1,4, 2, .... This means 
that the iterated function sequence beginning with A 4-1 eventually ends 
with the cycle 3,6,3, 6... or thecycle1,4,2,1.4,2,..... 























This completes the induction. o 


Practice Problem 1 Let f : N — N be defined by 


ifnis even 
To= lta ifn is odd 














338 Chapter 4 / Functions and Relations 


52-17-34 112-2247 =-14=-28=-9 


ERI 

| IN 

12> 6+ 3-10 516—812 

Figure 4-66 Partial arrow diagram for 
Example 4, 


Show that every iterated function sequence for f with ay € Z* ends with the cycle 
TA 


Not every function is as easy to analyze as the two we have seen so far. The 
following example is called the Collatz problem, the Syracuse problem, or Ulam's 
problem depending on whom you ask. It is one of the best-known unsolved problems 
in mathematics, 


E] Example 4 Letg:N — N be defined by 


c} ifnis even 

s- iL. ifn is odd 

Does the iterated function sequence for g always end at the cycle 4,2, 1,4,2,1, 
SOLUTION. Surprisingly, no one knows. A partial diagram for this function 


is shown in Figure 4-66. Even though many people have extended this picture 
to huge numbers of terms, the question remains open. o 





Chaos in Dynamical Systems 

‘There are a variety of definitions of the term “chaos” in mathematics and physics, 
but one thing that they all imply is that chaos lies on the boundary of order and 
randomness. We will investigate this broad claim in the remainder of this section. 





a Example 5 Let f : R — R be defined by f(x) =x? — 0.8, and consider the 

iterated function sequence of f with the first term ay = 0.5 as shown in Table 4-20. 

Explain this pattern using simple algebra. 
SOLUTION It looks like the terms a, with index n even are settling in on 
one value near ~0.7236, while the terms a, with odd index n are settling in on a 
value near —0.2764. The sequence is defined by repeatedly applying the function 
f. $0 ans2 = f(ans1) = f (f (as). This is the relationship between successive 
even-indexed terms as well as between successive odd terms. If these really are 
settling on some value v, then it must be the case that 


v= ff) 
= fv? —08) 
- (4-08) -0.8 
=v! - 1607 - 0.16 











47 Excursion: iterated Functions and Chaos 339 





apo o! 2 3 4 5 6 
hs “OSS -0495 05525 -04948 -05552 -04917 





7 8 9 10 n n 
l 05582 -04884 -05615 -04848  -05650  —04808 





apos 495 ass 497 498 499 
lace -02764 -07236 -02764 -07236 -02764 


Table 4-20 Table for Example 5 


The equation v = v* — 1.6v? — 0.16 is the same (by factoring) as 


(=)= 


We can now use the quadratic formula (twice) to conclude that either v = 
$- p v105, v = 4+ V105, v = —1 + V5, or v = —} — $5. These four 
possible values of v are approximately v = —0. 5246951, v = 1.524695, v = 
—0.2763932, and v = —0.7236068. The last two are those that showed up in 
‘our numerical investigation above. This leads to a natural question about why 
the other values of v did not show up. The answer is discovered by noticing 
that the above v? — v — $ factor being equal to 0 is precisely the same thing as 
v? — $ =v, In other words, v = } — SiC ci, c e 
ues that satisfy f(v) = v. So if we take as the first term x; = J — jh V105, the 
iterated function sequence will be just that constant value rein and likewise. 
for the first term x; = 1 + $ V105. 
‘The three cycles for this function with period 2 or 1 are 


—0.7236068, —0.2763932, —0.7236068, —0.2763932, 
—0,5246951, —0.5246951, —0.5246951, .. ., and. 
1.524695, 1.524695, 1.524695, 























To sce that this function produces no cycles of period 3, we can simply look for 
the roots of y = f(F(f(x))) — x on its graph shown in Figure 4-67. 

We can see that the only two roots are at approximately —0.5246951 and 
1,524605, but these initial terms produced our cycles of period 1. Since there 
are no other roots, there are no cycles of period 3. o 


Practice enam be Find all cycles with period 1 or 2 for the iterated function 
sequence of g(x) = 3x — 1. 





This seems to use justa little computation and algebra, so the preceding comment 
about chaotic behavior lying between order and randomness no doubt seems strange. 
‘One way this manifests itself in this example lies within the computations themselves. 
The word "compute" refers to numerical computation devices such as computers or 
calculators. To see the problem with these devices for chaotic systems, you will need 
one of them for some experimentation. 











Chapter 4 / Functions and Relations 








Figure 4-67 Graph of y= /( /(/(x))) — x in Example 5. 


"Bl Example 6 Let f R R be the same function as in the previous example, 
and consider the iterated function sequence for f. with the first term ay = 1.5246 (ie, 
a little less than the positive value that gave us a cycle of period 1) in Table 4-21, So we 
see that even though we start the sequence very close to the value where we will get a 
cycle of period 1, the sequence, isin fact attracted to the cycle of period 2. 

Now consider the iterated function sequence for the same function f but this time 
with the first term ay = 1.5247, shown in Table 4-22. This is a litle more than the 
previous initial value, and a very litle bit more than the positive value that gave us a 
cycle of period 1. As you can see, the terms of this sequence quickly exceed the bounds 
ofa calculator. The point is that this is very different from the previous sequence, even 
though the initial terms for the two sequences were almost identical 





‘The phenomenon illustrated in the previous example is called "sensitivity to. 
initial conditions," and it is an important property in all studies of chaos theory. It 
basically means that there can be very different long-term behavior in two systems 
whose initial terms are almost identical, 

‘The most famous adage about chaos in physical system is attributed to Edward 
Lorenz (1917-) who wrote, “One meteorologist remarked that if the theory were 








ny 0 1 2 3 4 5 6 
an || 1.5246 1524 1,238 1.5220 15165 1499] 1492 





n 94 95 96 9 98 9» 
a ee T E 


Table 4-21 First Table for Example 6 








47 Excursion: Iterated Functions and Chaos 341 

















nyo 1 2 3 4 5 6 
a f 15247 15247 15247 15248 15251 15260 1.5287 
nj 7 8 9 10 D 2 13 

ap ou 15 16 uU 18 

at 2718 43x10 18x108 34x10" 11x 10% 

Table 4-22 Second Table for Example 6 

nyo 1 2 3 4 5 6 
n 1 8 9 w n 2 13 
a, | 120-056 -169 085  -128 -035 -188 


Table 4-23 Solution to Example 7 





correct, one flap of a seagull's wings would be enough to alter the course of the 
weather forever." Over time Lorenz’s seagull became a butterfly, and sensitivity to 
initial conditions became known as the butterfly effect. 

‘The implication of sensitivity to initial conditions is very important in compu- 
tation. Since digital computers cannot represent most numbers exactly, most values 
stored in a computer are only approximations to an exact value, For many compu- 
tations, using an approximation that is within, say, 10-2, of the exact value is good 
enough, but for iterated function systems, as we saw in the previous example, even 
this much difference in initial conditions can cause incredibly different results in the 
Jong term. 

Sensitivity to initial conditions is important in the interpretation of chaos in most 
disciplines, but it may seem too dramatic to say this property lies “between order and 
randomness" as we did at the beginning of this section. We will close this section with 
two examples that perhaps better illustrate the meaning of this puzzling statement. 


























ll Example 7 Examine the iterated function sequences forthe function f R > 
R defined by f(x) =x =2. 
SOLUTION Table 4-23 shows values of the iterated function sequence using 
à = 0,5. This sequence not only looks "random" for these small values of n. 
they do not settle down in any sense even when n is quite large, as Table 4-24 
illustrates. o 











n j 100 1001 1002 1003 1004 
I^ 1g — 06 — -18 — 057 


Table 4-24 The Sequence for Larger Values of n 











342. Chapter 4 / Functions and Relations 





Ba B4 
P 
P FE d 
11,2) 69 12 15.6) 


Figure 4-68 The chaos game in Example 8. 


This example shows that it is possible to get "random" behavior out of a per- 
fectly determined iterative process The next example illustrates the position of chaos 
between order and randomness by iterating a perfectly random process only to find a 
great deal of order in the result. The “game” is usually attributed to Michael Barnsley. 


"Bil Example 8 (The Chaos Game) Begin with a angle with an intial polt 
y as shown in Figure 4-68 on the left." Roll a fair six-sided die, and mark the point 
‘pias follows: 


© Ifthe outcome is 1 or 2, draw pi halfway along an invisible line segment connecting 
Po with the vertex labeled (1,2). 

© Ifthe outcome is 3 or 4, draw p, halfway along an invisible line segment connecting 
Po with the vertex labeled (3, 4). 

© Ifthe outcome is 5 or 6, draw p, halfway along an invisible line segment connecting 
Po with the vertex labeled (5. 6]. 


Repeating this process with each successive point (Le. forming point piss from 
point p, with a roll of the die) will give a sequence of points that slowly fill in the triangle. 
The picture on the right of Figure 4-68 shows the first four points in the simulation 
of three die tosses resulting in rolls of 2, then 3, and then 1. What will the long-term 
behavior of this process be, and what will the picture eventually look like? 


SOLUTION The three pictures in Figure 4-69 show the result of this process 
after 100 die rolls, 1,000 die rolls, and 10,000 die rolls. 

This figure is called the Sierpinski triangle, and itis one of the most famous. 
of a class of images called “fractals” that are closely related to chaos. Notice 
the self-similiarity of parts of the Sierpinski triangle to the whole picture. This 
is one of the defining properties of fractals 





These two examples show that there can be randomness from order and order 
from randomness. This is the essence of chaos. Current research in chaos has had 
a profound impact on the way we model real events and the techniques we use 


* The location of the initial point can be proven to be irrelevant to the impending outcome. 





A 








47 Excursion: lterated Functions and Chaos 343 


Áa 


Figure 4-69 More iterations of the chaos game, 


for numerical computation. On a theoretical level, the study of chaos has foun: 
applications ranging from astronomy to financial markets. It is surely one of th 
‘most important mathematical developments of the last century. 


Solutions to Practice Problems 


1 After investigating the behavior of the iterated function 
system (a, for f, we make the following observation, 
which we can prove by induction: 

Chim For all € Z^, the iterated function sequence 
for f with the first term ay = n eventually ends with the 
cycle 1.2.1.2... 
Proof by induction on n. Let Pin) be the statement 
"The iterated function sequence for f with the first term. 
ay = neventually ends with the cycle 1,2, 1,2... "First. 
let'slookat the sequences that arise from various choices 
of ay in Table 4-25. Let m > 10 be given such that state- 
ments P(1), P(2). ..., P(m — 1) have all been checked 
to be true. Now consider the iterated function sequence 
for f with the first term ay = m. We know that mis either 
even or odd, so we argue in two cases. 
© Case 1: If m is even, then the sequence looks like 
m... Since Ẹ <m, we have already checked 











Table 4-25 Solution to Practice 
Problem 1 


statement P (3), so we know the sequence that th 
iterated function sequence for f begins with 4 even 
tually ends with the cycle 1,2, 1,2..... Hence, th 
iterated function system that starts with ay = m mus 
eventually end with the cycle 1,2, 1,2, 
© Case 2: If m is odd, then the sequence looks lik 
m.m+1, f£... Since 1 < m for values of m: 
10, we have already checked statement P (2:4), s 
we know the sequence that the iterated function s 
quence for f begins with ^j? eventually ends vit 
the cycle 1,2,1,2,.... Hence, the iterated functio 
system that starts with a, = ri must eventually en 
with the cycle 1,2, 1,2, .-. 
Tofind cycles with period 1, we simply solve the equatio 
g(x) = x, which we can do after rewriting the equatio 
3x — | = xas2x? — 1 = 0,which hassolutions x = 4% 
‘These two values constitute cycles with period 1. 
To find cycles with period 2, we solve the equatio 
(x) = x, which we can do after determining that 











(go) 





and so the equation g(g(x)) 
equation. 


is equivalent to th 








27x! — 197 +3 = 3x* ~ 


and we can factor 24i* — 18x? +3 as 3(2x + 1)(24 
1) — 1). Hence, g(g(x)) =x when x = =1/2, x - 
V2,orx = +9. Wehave seen that these lst two valu 
are actually cycles of g with period 1, so only the valu 
£1/2 constitute cycles with period 2. 





344 Chapter 4 / Functions and Relations 


Exercises for Section 4.7 


1. Using the values A = 1, B=2, C =3, and so on, we 
can assign a number to every English word by adding 
the values of the letters in the word. We will call this 
the “ordinal value” of a word. With this in mind, we can 
define the function f : N — Nby 

‘J (n) = the ordinal value for the English expression. 
forn 


(a) Evaluate f(1) and f(16). 

(b) Give a specific example to show that f is not onto. 

(c) The function f has only one cycle, 216 — 228 — 
228+ 255+ — 216. What number goes in the. 
blank? 

(d) Explain why this function would not make much of 
a magic trick. 

2. (From [17]) A deck of 32 cards is numbered top to bot- 
tom from 1 to 32. The magician slowly deals the cards 
into three face-up piles, the spectator silently chooses 
‘one card and remembers which pile it falls into. After 
all cards are dealt (into piles numbering 11,11, and 10), 
the spectator indicates into which pile his card fell, and 
the magician sandwiches this pile between the other 
two, turning all three face-down again in his hand and 
taking care to put one of the piles of 11 on top of the 
stack. The deal is repeated, this time with the spectator 
merely silently noting which pile his card is in and shar- 
ing this information after the deal has been completed. 
‘The same reassembly of the deck is performed. This is 
repeated until, after the fourth deal, the magician re- 
veals that the chosen card is in the 17^ position of the 
deck. 

(a) In the magic trick described, let a denote the num- 
ber (between 1 and 32) of the original card, and 
an the position of the chosen card after n deals. 
Explain in English why the sequence a», a), a. 
is an iterated function sequence for the function 
£:01,2,3,....32]  (1.2,3,....32) given by 


x 
fa)=11+ II 
(b) Give a complete diagram for the function f, and 


use your diagram to explain why the trick works. 
Find all cycles for the function f : N — N defined by 








gin) 


$ — ifmiseven 
RS otherwise 


Prove your answer is correct by mathematical induc- 
tion. 


4. Find all cycles for the function g : N — N defined by 


$0 ifniseven 
(n) |3  ifnisodd and divisible by3 
nl otherwise 


Prove your answer is correct by mathematical induc- 
tion. 

5. Lothar Collatz (1910-1990) studied many iterated func- 
tions, but there is really no record that he studied the 
problem that bears his name in Example 4. A similar 
problem that does appear in his notebooks [38] con- 
cerns the function 

if nis divisible by 3 

ifn — 1 is divisible by3 


ifm +1 is divisible by 3 


a(x)= 





{a) Find any cycles of g involving only numbers less 
than 20. 
b) Coltatz’s original question concerned the iterated 
function sequence for g with the first term ay = 8, 
Use a calculator or computer to compute the first 
20 terms of this sequence. 
6. Find all cycles of period 1 for the function f :R— R 
defined by f(x) =x? —1 
7. Find all cycles of period 1 for the function g:R— R 
defined by g(x) = 2x 
R Find all cycles of period 2 for the function f :R— R 
defined by f(x) = à — 1. 
3. Find ail ces of period 2 fr the function 
defined by g(x) = 
m Spee E eee [iR 
R defined by f(x) — 3° —05. 
11. Find any cycles of period 1 or2for the function f : R — 
R defined by f(x) =x? — 15. 
12. Use a graph to show that there are no cycles of period 3 
for the function f : R — R defined by f(x) = 1° — L5. 
13. A well-known procedure for computing square roots 
dates back to the Babylonians and isa special case of a 
procedure from calculus called Newton's method: 
© Let z be a positive real number whose square root 
you would like to know. 
© Letg : R — R be defined by g(x) = 
© Let ay be your best "guess" for VZ. 
© The iterated function sequence a, a, ,... forms 
a sequence of improving approximations for 
Find all cycles of period 1 or 2 for the function g. 
M. Exercise 13 suggests a general way to approximate the 
zeroes of functions. For example, if we want to find a 


























ie 

















17 








value for x such that x 
value of x such that x = 
cle of the function g(x) 
cach of the equations below, let g(x) be as described, 
Show that a cycle of g with period 1 will be a solution to 
the given equation, and use a spreadsheet or calculator" 
to find an approximation of such a value. 

(a) Tosolve x3 —2 = 0, let g(x) = 
(b) Tosolve e* let g(x) 
(©) Tosolve cosx — x =0, let g(x) 


), we can instead find a 
by looking for a cy- 

















*. Find all cycles of period 2 or less for the function 


f = 10.1) - [0, 1] defined by 


irs 
2x otherwise 








fo- [3 


(Recall that the interval notation (0, 1] here refers to 
the set of real numbers x with 0 < x < 1) 

Using the function f in Exercise 15 and initial values 
of the form 2, where n is odd, prove that a cycle always 
occurs. (An interesting research problem is to examine 
the relationship between the value of 2 and the period 
of the resulting cycle.) 

‘Onc of the reasons that “sensitivity to initial conditions" 
is important can be seen when using technology to ex- 
plore iterated function sequences for the function f 
given in Exercise 15, Use a spreadsheet to generate 100 
terms of several such sequences using different initial 
values of the form 2, where n is an odd integer. (In an 
Excel spreadsheet, this can be done by placing: 
cell Al and then the formula «1F (A1<=0.5,2*A1,2- 
2*A1) in cell A2 and “filled down" through cell A100.) 
Writea paragraph with supporting data describing what 
surprising event occurs and how this is a side-effect of 
sensitivity to initial conditions. 

















48 Excursion: Growth of Funcions 345 


18. Consider the function g: [0 1] + [0.1] with g(x) = 
4r— lár). 
(a) Describe in words what g does to an input number 
(b) Explain why the iterated function sequence for g 
with the first term a, = 3 forms a cycle of period 1. 
(©) Use a spreadsheet or calculator to find the first 50 
terms of the iterated function sequence for g with 
the first term ay = $. (In an Excel spreadsheet, this 
can be done by placing = 2/3 in cell A1 and then 
the formula = 4*Al - floor (4*A1,1) incell 
A2 and “filed down" through cell A0) Can you 
explain what is going wrong? 
19. The Fibonacci numbers {F} are defined recursively by 
the rule 


Rz 
alln>3 














and Fy = Fay + Fy for 





We can study the ratio of consecutive Fibonacci num- 
bers by letting r, = 
(a) Prove that re = 1+ z} 
(b) Find a function g : & — R such that (7) is an iter- 

ated function sequnce for g. 

(©) Find all cycles of g with period 1. Write an English 
sentence relating one of these numbers directly to 
the Fibonacci numbers. 

20. (The following solitaire game is called Bulgarian soli 
taire and is analyzed in [1]. Take a deck of 45 cards, 
and divide it into as many face-down piles as you wish. 
Take one card from each pile, and combine these to 
form a new pile. Repeat this operation until you start 
secing the same configuration more than once. This isa. 
tricky function to make explicit, but hopefully the idea 
of "cycle" can be understood, Find the unique cycle of 
period 1. 





for all n> 





e 4.8 Excursion: Growth of Functions 





In this section we will see how formal reasoning and mathematical induct 


on can be 





used in the formal analysis of the complexity of algorithms. While this topic is primar- 
ily of interest in the field of computer science, at its heart lies a deeper examination 
of the relationship between recursive descriptions and closed-form descriptions of 
sequences. Hence, even those with no interest in computer science will still benefit 


from studying these ideas. 


* Issues about when an iterated function sequence “converges” to a value that consitutes a cycle with 
period 1 are too advanced for this course. It ìs enough for now that the problems given bere all behave 


ESE TE ——_— _ nw 


346 Chapter 4 / Functions and Relations 


Complexity of Algorithms 


Understanding algorithms is particularly important in the field of computer science. 
In particular, when presented with more than one way to algorithmically solve a 
problem, we would like to be able to judge which method is most efficient, 


BE Example 1. Consider the problem of finding which page in a dictionary a 
given word is on. There are two obvious algorithmic solutions to the problem: 


1. Start on page one of the dictionary. Ifthe given word is between (using alphabetical 
ordering) the first and last words on the page, then find it on the page; otherwise, 
turn the page and repeat the process. 

2. Open the dictionary to the middle. If the given word comes before the first word on 
this page, the word must be in the first half of the dictionary; otherwise, the word 
‘must be in the second half of the dictionary. Repeating this process will find first 
which quarter of the book contains the word, then which eighth of the book contains 
the word, and so on. Eventually, the fraction of the book you have narrowed the 
word down to will consist of a single page, 





Which of these is more efficient? 


SOLUTION This is fairly obvious since most people (other than small chil- 
dren) use the latter method. Ifthe dictionary contains 1,000 pages, then the first 
‘method involves comparing the given word to as many as 1,000 words, while 
the latter method will find the correct page after comparing the word to only 
10 words. ü 


We will measure “efficiency” (or "complexity") by finding a relationship be- 
tween the size of the input to the algorithm and the amount of time" the algorithm 
requires to run. Since “run time" is dependent on a particular m: 
time complexity by designating a particular time-consuming operation (like multi- 
plying numbers or comparing values), and counting how many times this operation is 
performed. In this way, we think of the complexity of an algorithm as a function from 
N, the possible sizes of inputs, to N, the possible number of designated operations 
required. For example, if f(n) and g(n) represent the maximum number of word 
comparisons required by the two respective algorithms in Example 1 operating on 
a dictionary with n pages, then our solution can be written as (1,000) = 1,000 and 
(1.000) = 10. 

In mathematics, to be able to compare the relative merits of different solutions to 
problems, we often need tools that allow us to see what is important in the midst of the 
things that are not. The analysis and comparison of algorithms are good examples of 
thisidea. We have already alluded to the idea of measuring efficiency by concentrating 
‘on how many times a particular time-consuming operation is performed. For example, 
suppose you have the choice of two algorithms whose time complexities are measured 
as f(n) = [0.1n*) + 10n and g(n) = 10r? — n. Which should you choose? The first 





" Sometimes efficiency is measured in terms of memory space required, but we will not address that in 
this text. 





k Am mA —_— n 


48 Excursion: Growth of Functions 347 


algorithm is better forsmall values ofn—for example, f(10) = 101 while g(10) = 990. 
However, if the input size is large, the following example shows that the second 
algorithm is preferable. 
"Bl Example 2 Prove hat for altn > 100, ft) > a(n). 

SOLUTION Letn be given satisifying n > 100. Using algebraic operations, we 





obtain: 
n> 100 
> 100 (multiply by positive number n*) 
0.1m > 10n? (multiply by positive number 0.1) 
Since f(n) > 0.1n and g(n) < 10n*, this establishes the conclusion. o 


For large values of n, the presence of n? in the first expression causes it to be 
larger than the second expression, where the highest power of n is n°. Speaking 
loosely, we say that the first algorithm has time complexity “roughly m,” while the 
time complexity for the second is "roughly n°.” All other details about f(n) and g(n) 
are insignificant as the size of the input gets larger. These ideas will be developed in 
more detail as we progress through this section. 





A few words about functions and sequences 


We have studied number sequences throughout the first two chapters of this book, 
but now we sce that sequences are simply a particular type of function. Specifically, 
given a function f : N — R, we can define a sequence (ay) with the rule a, = f(k) 
for all k € N, Similarly, any sequence with the first term ap can be associated with a 
function from N to R. In this section, we will interchange these two concepts freely 
in order to keep notation as straightforward as possible, 





Induction and Inequalities 


For some algorithms it is reasonably easy to develop a closed formula for the com- 
plexity, but for many algorithms this is a difficult if not impossible task. Fortunately, 
in order to determine when one algorithm is much better than another, we do not 
necessarily need a closed-form description of each algorithm's complexity. 

In Example 3, we imagine an algorithm that requires a, operations when given 
an input of size n. Suppose the structure of the algorithm leads us to conclude that a 
recurrence relation like 


2 





a = 


must be satisfied. We have seen in Chapters 1 and 2 that finding a closed formula for 
‘such a sequence can be difficult, so it should come as some relief to learn that we do 
not need to in this section. 


"BM example 3, orthe recursively defined sequence ay = 22,1 + 1° with 2 = 0 
prove that a, > 2^ for every natural number n > 2. 














4|s| 6 Ties 9 10 








158 | 441 | 1,098 | 2539 | $590 | 11,909 | 24,818 








i | 32 | 6 | 128 | 256 | s2 | 1,004 


Table 4-26 Table for Example 3 


SOLUTION Perhaps our first instinct is to compare the two expressions for 
several sample values of n as shown in Table 4-26, and then use mathematical 
induction to prove the given inequality. 

Proof by induction. Let P(n) be the statement "a, > 2^" From the table, 
we can see that statements P(2), PG), ..., P(10) are all true. Now let m > 11 
be given such that P(m) is the first statement not yet checked to be true. In 
particular, we know that statement P(m — 1) is true. That is, we know that 
agi 27-1. Now 





Gm 2:044 n 
»2.Q77)4 m! by statement P(m — 1) 
274m 


22" since m>1 


‘This string of relationships implies that am > 2", verifying that statement P(m) 
is true. o 


Practice Problem 1 For the recursively defined sequence ba = 3by-1 + 2n with 
b, = 1, prove that by > 3" for every natural number n > 3. 


You might have already guessed that the second "dictionary search" algorithm 
given in Example 1 really is the best way to find a value in a sorted list. It might 
not have been as obvious that the number of required comparisons is related to the 
‘number of pages in the dictionary using the logarithm and ceiling functions discussed 
in Section 4.6. 

‘We will talk later about why logarithms are unavoidable when studying certain 
types of algorithms, but for now, we want to try to understand how they affect the 
growth rate of recursively defined sequences. In Example 4, cy represents the max- 
imum number of comparisons required to sort a list of n numbers from smallest to 
largest using a particular algorithm. 





-" Example 4 For the recursively defined sequence c, = c,-1 + [logs (ny) with 
c, = 0 prove that c, < n - [log; (n)] for every natural number n > 1. 


SOLUTION We use Table 4-27 to compare the two expressions for several 
values of n, and then use mathematical induction to prove the given inequality. 

Proof by induction. Let P(n) be the statement "c, < n [log, ()]." From 
the table, we can see that statements P(1), P(2)....,P(6) are all true, Now let 
m 2-7 be given such that P(m) is the first statement not yet checked to be true. 
In particular, we know that statement P(m — 1) is true. That is, we know that 





E m a 


48 Excursion: Growth of Functions 34S 














4] 5] 6 
JETE 
ls [is | a 





Table 4-27 Table for Example 4 

«(m - 1)- flog; (m — 1)). Now 

Em = Cm-1 + [logs (m)] 
< (m= 1)- log; (m —1)] + [log (m)] by statement P(m — 1) 
< (m— 1) - flog; (m)) + [log, (m)] since log, (m — 1)] < flog, m 
=m- flog, (m) 


This string of relationships implies that cm < m- flog; (m]. verifying that state- 
ment P(r) is truc. c 





Note that the preceding proof required knowing a particular fact about loge 
rithms: log; (x — 1) < log, (x) for x > 2. In terms of functions, we say that the log 
function is increasing, a common property among functions that measure complexity 
This type of algebraic relationship is typical of the details found in induction proofs 
inequalities. Since these details are sometimes tricky, people who deal with grow! 
rates of sequences and functions have developed terms and tools that allow us t 
‘work with these ideas at a higher level. 





The Language of Function Growth: O, and Q 


The first step in simplifying our work is to characterize functions based on the 
"basic growth rates." With practice we will then be able to analyze the growth rate 
more complicated recursively defined sequences that use these functions as buildin 
blocks. These two ideas together will allow us to understand the complexity of man 
basic algorithms in computer science. 

‘We begin with some definitions that allow us to classify and compare function 


by growth rate. 








Definition For a fixed function g : N — R*, we define the following sets 
of functions: 


1. The set O(g(n)) is the set of all functions f : N — R* such that foralln > N, 
f(n) = K+ gin) 
for some real number K and natural number N. Informally, we can think of 
fn) € O(g(n)) as meaning that * f (n) grows no faster than g(n).* 
2. Theset 2(g(n))is the set of all functions f : N — R* such that for alln > N, 


f(n) z L- gin) 


for some real number L and natural number N. Informally, we can think of 
fn) € O(g(n)) as meaning that * f(n) grows at least as fast as g()." 














350 Chapter 4 Functions and Reasons 


3. O(g(n)) = O(g(n)) N:2(g(n)). Informally, we can think of f(n) € @(g(n)) 
as meaning that “ f (n) and g(n) grow at the same rate.” We often state this 
by saying, “ f(n) is of order g(n).” 





For example, in Example 3 we showed that a, € 2(2"), in Practice Problem 1 
we showed that b, € 2(3"), and in Example 4 we showed that c, € O(n log; (n)). 
Before making general observations about these sets of functions, we will look at an 
example of © notation, since that is most significant in our subsequent discussion, 


a Example S  Setecrion Sorr isa simple algorithm for sorting a list of numbers, 
and it can be shown that if a, is the number of comparisons required to sort a list of n 
numbers, then a, can be described recursively by 
) with ay = 0 
We showed in Section 2.3 that this sequence has the closed form a, = "=". Verify that 
a, € (m). 
SOLUTION We can equivalently write a, jn. For large values of n, 
the 4m term is very small compared to the $n? term. For example, when n = 
1,000, the Jr term is 500,000 and the $n term is only 500. For these large values, 
we reason that dy is "roughly" $2, To put this conjecture in terms compatible 
with our © notation, we prove the following proposition, D 





4, = yy n 











Proposition } For the sequence given by a, = 

Hence, a, € O(n). 
PROOF Letn > 2 be given. Since n is positive, z^ < $, and clearly Ẹ < m. 
This shows that a, < n. To show that a, > $n, it is much more natural to first 
rewrite the inequality so that it involves a comparison of a polynomial to 0, In this 
case the inequality 572 > Lr is the same, after multiplying by 4 and subtracting 
nè from each side, as the inequality n? — 2n > 0. We can now easily argue that if 
n> 2, it follows that both n and n — 2 are positive, in which case we know that the 
product n(n — 2) is positive. That is, r? — 2n > 0, as desired, . 





l dr? < a, < n? foralln > 2. 


Through a more general argument, we could show that ©(7?) includes all 
quadratic polynomials with a positive leading term. Also significant is the fact that 
other sets like O(n) do nor include quadratic polynomials. The following example 
illustrates a proof along these lines. 


Ell Example 6 Show that for the sequence given by au Hg, a, £ O(A): 
SOLUTION We show that a, ¢ O(n) (and hence a, ¢ ©(n)) using a proof 
by contradiction. Suppose that a, € O(n), and let N and K be the values in 
the definition of O so that £5* < K -n for all n > N. This is the same, after 
multiplying through by 2 and adding n to each side, as saying that n? < (2K + 
Dn for all n > N. Since n is positive, this implies that n < 2K +1 for alln > N. 
This final statement is absurd regardless of the particular values of N and K, so 
we have a contradiction. n 





a — — — — — — — — 


4 Excursion: Growth of Functions 351 


In Examples 5 and 6, we see that the growth rate of a, = 1n — nis determined 
by the highest-degree term in the expression. This is true of any polynomial expres 
sion, as the following theorem indicates. Although we do not prove this theorem here 
the previous examples give some indication of how such a proof might be structured 
and Exercise 10 outlines the proof of an easy special case, 


Theorem2 Let f(n)beapolynomial of degree p with positive leading coefficient. 
That is, f(n) = cpn” + Cyn?! + +--+ cyn + co, where p € N, cp > 0, and each cj € 
R Then f(n) € O(n”). Moreover, for any q # p. fin) € O(n). 


In Example, 5, we had to investigate a bit to decide that the sequence satisfied 
the property $n? <a, < n". Since we knew a closed formula for an, that was noi 
too hard. For a sequence defined recursively, however, the task may be more diffi 
cult. A valuable tool in this sort of investigation is a standard spreadsheet program 
since it models recursion very nicely. To simplify this process, we give an alternate 
characterization of O, Q. and 6 that is easier to use in practice. 





Proposition 3 Let functions f : N — R* and g : N — R* be given. 


1. fin) € O(g(n)) if and only if there are positive numbers K and N such tha 
f(n)/gin) < K for alln > N. 


2. f(n) € 2(g(n)) if and only if there are positive numbers L and N such tha 
Fln)igin) = L for all n > N. 


3. f(n) € &(g(n)) if and only if there are positive numbers K, L, and N such tha 
L < f(n)/gin) < K for alln > N. 


"lll Example 7 For the sequence ay satisfying as = ai Qn = 1) verify tha 
a, € Orr). 
SOLUTION We can first examine this sequence empirically using a spread- 
sheet as follows: 


1. Fill Column A with the values of z: 1, 2,3,..., 100. (This can be done using 
'Edit Fill Series”) 

2. Put 1 (a value for ai) in cell B1. 

3. Put the formula =B1+3*A2—1 into cell B2. 

4. Fill Down the formula to cells B3 through B100. 

5. Put the formula =A1^2 into cell Cl. 

Down this formula to cells C2 through C100. 








Put the formula -81/C1 into cell D1. 


eae 





Down this formula to cells D2 through D100. 


Table 4-28 shows some of the values generated by this spreadsheet, C 











352 chapter 4 / Functions and Relations 

a a è | ale 
1523 
1512 
1.508 
1.506 





100 || 15049 | 1000 | 1505 
Table 4-28 Sequence from Example 7 


This evidence leads to the following proposition, part of which is proven below 
and the rest of which is left for Exercise 16. 





jon 4 For the sequence satisfying a, 
alln z 1, m < a, < 2r. Hence, a, € O(n"). 


PROOF. (Byinduction) We will prove only the statement "a, < 27?" here. Since 
ay = 1, the first statement “a, < 2-1?" is true. Let m > 2 be given such that the 
‘Statement “ay, < 2m?” is the first one not yet checked to be true. In particular, the 
statement "a. < 2(m — 1)" has already been checked to be truc. Now 


Gp = dy +3m—1 from the recurrence relation 
S2(m-1y-(3m—1) by statement P(m — 1) 
2m? -m4l 
Since m > 2, it follows that 2m? — m + 1 < 2nP. Hence, 
a. X mè 
which is precisely the next statement to be checked. " 


yy + Bn — 1) and ay 





1, for 





The function growth notation is useful to express the notion that one growth 
rate is “better” than another. Since we are predominantly concerned with functions 
that measure the number of operations required by an algorithm, it makes sense 
that the better function is the one with smaller outputs reflecting a more efficient 
algorithm. The significance of this can be seen when making a direct connection 
between the number of operations and the time an algorithm requires to run. Table 
4-29 makes this connection for some specific functions assuming the algorithm can 
perform 10° (1 million) operations per second. If we have a choice of implementing 
an easy algorithm with complexity of order nè or doing a bit more work to implement 
an algorithm with complexity of order nlog n, we can see from the table that it can 
really make a difference, especially if the input is large or if the algorithm will be 
used repeatedly. 

Now using the function growth notation that we have developed, we canstate the 
first of two theorems that tell us the growth rate of a recursively defined sequence 
based on the form of its recurrence relation. Once again we will not prove this 
theorem here, but we will opt instead to investigate it further in the exercises. 








Theorem 5$ Let pandC be positive real numbers, and suppose that g(n) € O(n”). 
The sequence with recurrence relation a, =C -an-ı + g(n) has one of three possible 
growth rates: 











48 Excursion: Growth of Functions 353. 



































Table 4-29 Approximate Run Time for Various Algorithm Complexities 
1, fC <1, then ay € O(n”). 
2. IfC =1, then a, € An?*), 
3. AFC > 1, then a, € OC"). 





Using this theorem, we can say that in Example 7 (where a, = dy 1 + (3n — 1). 
since C = 1 and (3n — 1) € O(n), it follows that a, € Gm). Similarly, we can say 
that in Example 3, the sequence with recurrence relation a, = 2an- + n° must have 
a, € @(2") since in this case C =2 > 1 and g(n) € O(n"), 





Practice Problem 2 Use Theorem 5 to predict the growth rate of a sequence ay 
satisfying each of the following recurrence relations: 


(a) y= Jani (n 1) 
an-ı +? +20 3) 
i n 


Logarithms and Algorithms 

Before proceeding to the second theorem that predicts the growth rate of a sequence 
based on the form of its recursive description, we will pause here to see how loga- 
rithms arise naturally in the study of algorithms. We actually need look no further 
than the "better" algorithm that we used in Example 1 to find the page of a dictionary 
containing a given word. Following that same process, we can find any given value in 
any sorted list as follows. 








"MI Example 8 Thefoltowing algorithm determines whether asorted list L (sorted 
lowest to highest) contains a number x. How many comparisons are necessary when 
a list L with n entries is used as input? 


BINARY SEARCH 


11] Compare" x and the middle-most element of L to determine which half of 
the list x is in. 





7 We assume that in a single comparison of two numbers a and b, we can determine if a = b, a <b, or 
ab. 


354 Chapter 4 / Functions and Relations 





[2] Compare x with the middle-most element of that half to determine wi 
quarter of the list x is in. 

3] Compare x with the middle-most element of that quarter to determine 
which eighth of the list x is in. 

I4] And so on. 

15] Once the portion of the list you have narrowed it down to has only one 
element in it, you can give a “yes” or “no” answer with a single additional 
comparison. 


SOLUTION The longest this process can take is when the given item is not 
actually in the list, so that is the case we will use to discuss complexity. For 
example, in a list of 15 elements, the first comparison will split the list into two 
"halves," each with seven elements. The second comparison will split these into 
two “quarters,” each with three elements. In this way, two more comparisons 
will rule out the given value from being in the list, 

In general, the number of comparisons is the number of times one must 
repeat the splitting process to be left with a list of length 1, and then one 
additional comparison to check if the value isin that very small list, The number 
of times a number n must be halved before the result is no more than 1 can be 
found algebraically by finding the smallest integer k such that 


"(De 


Which is the same thing (using logarithms) as 














k= log, (n) 





‘The exact way in which these values are rounded (using floor or ceiling func- 
tions) requires a little more thought, so this is left for Exercise 1 at the end of 
this section. It turns out that (1og;(11)] +1 comparisons are required to veri 
that the given number is not in the sorted list of length n. o 








Closely related to this “searching algorithm” is a class of puzzles that involves 
finding a counterfeit coin among many real ones, These are popular in recreational 
‘mathematics, and the idea behind them is also fundamentally tied to logarithms, 





a Example 9 /havea bag with 16 coins that appear identical, but I know one 
of them is counterfeit. The real coins weigh 1 oz each and the counterfeit one weighs 
some different amount. I have a spring scale that tells me the exact weight of whatever 
Tputon it. 


SOLUTION Use the following algorithm: 
COUNTERFEIT DETECTOR I 
[1] Weigh eight coins, If they do not weigh 8 oz, then the counterfeit coin 


is among them; otherwise, the counterfeit coin is among the other eight. 
Keep the eight that include the counterfeit coin, 

[2] Weigh four coins. If they do not weigh 4 oz, then the counterfeit coin is 
among them; otherwise, the counterfeit coin is among the other four. Keep 
the four that include the counterfeit coin. 











ee UM UE 


48 Excursion: Growth of Functions. 355 


{3] Weigh two coins. If they do not weigh 2 oz, then the counterfeit coin is 
among them; otherwise, the counterfeit coin is among the other two, Keep 
the two that include the counterfeit coin, 

14] Weigh one of these coins to determine whether it or the other is counterfeit, 


‘This algorithm requires four weighings to find the counterfeit among 16 
coins, We can use mathematical induction to verify that it will require k weigh- 
ings to find a counterfeit among 2* coins. This can be extended to show that the 
number of weighings for n coins is [log n1. a 





Proposition 6 Algorithm Counrerrett Derector I requires k weighings to find 
a counterfeit among 2* coins. 


PROOF See Exercise 25. a 








Logarithms might seem complicated to work with, but from the point of view of 
functional growth, they can be simplified somewhat by the fact that the base of the 
logarithm does not affect the growth rate. We will first get a glimpse of why this is 
true, and then we will discuss further why it is beneficial 









BE Example 10 show shar ogyn € eg n) 


SOLUTION Letx = log, n. That is, x is the power of 3 that yields, or 
Since 3 = 23, it follows that 





3r = (go)! cars 





Hence, the power of 2 that yields n is x lop; 3—that is, log, n = x + log, 3. From 
this, it follows that 


logn _ x logs3 


log n x 





= 1,5849625 





log, 


‘This means that [24 is a constant (i, with no dependence on the value of n), 
soforalln > 1 


logn < login < 2logn 





If you are unfamiliar with logarithms, this might be a bit of a surprise, so you 
will want to experiment on your calculator to be sure you really believe it, (See 
Exercise 3.) n 





‘The same reasoning can be applied to show that any two logarithms with bases 
greater than one have the same growth rate, Therefore, we can pick a standard base 
that we understand, and use a logarithm in that base to represent all logarithmic 
growth rates. In calculus, for example, we use the “natural” logarithmic base e (¢ is 
an irrational number approximately equal to 2.718), and we even denote log, x as 
Inx instead. From our experience, using a standard base of two makes more sense in 
discrete math. 














EE ee a C ww QE 


356 Chapter 4 / Funcions and Relations 
Proposition 7 For any base b > 1, log,n € O(log; n) 


This proposition justifies why we often just use the phrase “order of logn,” with 
no specific base, to describe logarithmic growth. 

‘The other important thing to know about logarithmic growth is that it is very 
slow, which for functions measuring complexity is a very good thing. In particular, 
logarithmic functions grow more slowly than any positive power of n, a fact that can 
be easily proven using the tools of calculus. 


Proposition 8 — If p is any positive real number, then there is a positive number 


N such that 
logy x <x? 
forallx > N. 
PROOF Ask a calculus teacher and make her day! a 


The "big-oh" notation gives a natural hierarchy of elementary functions based 
on their growth rates: 


++ Ollogyn) S -++ O(n") € O(a") c O(n) c O(n) S -+ c OQ") 





We can think of each of these subset relations in its “if, then” form. For example, the 
relationship O(n) € O(P?) can be thought of as “If f(n) € O(n), then f(n) € O(n?).” 
In other words, “If f(n) grows no faster than n, then f(n) grows no faster than nè. 
The hierarchy above says that of all the growth rates we have scen, logarith- 
mic growth rate is the slowest and exponential growth rate is the fastest. Although 
other growth rates are possible, these (and combinations of them) are sufficient for 
understanding our main result about the growth of recursively defined sequences. 





Divide-and-Conquer Recurrences 


A common type of recurrence relation in computer science courses arises from an- 
alyzing the complexity of recursive algorithms. Although these are among the most 
important recurrence relations for a computer scientist to understand, they are also 
among the most difficult to find closed formulas for. Fortunately, our function growth 
notation can often be used to understand the growth rate of these sequences without 
needing an exact closed formula. 








Definition A divide-and-conquer recurrence relation is one of the form 
iy = C -anay + f(n) 
where C and kare positive constants (with k > 1) and f is any function on N. 








A typical application is the analysis of the sorting algorithm in Example 12. 
Before tackling this more difficult real-world example, however, we will first look at 
a few “abstract” examples to gain some experience with sequences like this. 





es a LS eS MM M93 n T 


48 Excursion: Growth of Funcions 357 


T oie a or a ee ee 
forn > 2, with m — 1, look like this: 

1,5, 6, 19, 20, 24, 25, . 
Itis difficult to find an exact solution to this kind of recurrence, but we can still analyze 
the long-term behavior of the sequence. Prove that for all n > 5, an < n? +n. 


SOLUTION Since as = 20 < $? +5, the first statement is true. Now let m > 6 
be given such that the inequality has been checked for ds, .... dm—1- When we 
consider the next number in the sequence, we have 





my = 3+ Ama +m 
<3. ((m/21Y + 1m/2J) +m 
<3- (m/2} +m/2+m 
3 3 


= qm" 
= (rè +m) - (ir? - 5m) 
<m+m 


since 4m? — lm > 0 when m > 5, This last inequality is somewhat mysterious, 
but it can be verified by looking at a graph of the parabola y = fx? — ix or 
proved directly by induction. 








o 


As we have seen, there is an important interplay between the components of 
the recurrence relation. In terms of the definition, the relative sizes of C and k, as 
well as the growth rate of f(n), play important parts in the overall growth rate of a 
divide-and-conquer recurrence relation. 


Practice Problem 3 List the first 10 terms of each of the following recurrence 
relations and compare them to the one in Example 11: 


L by = Land b, = by, e n 
and a, —3: C22 
, and d, = 3- dj n 









Exercise 31 explains how to use a standard spreadsheet program to gen 
many terms for these sequences. Before studying the growth of these sequences, let's 
see how this kind of recurrence relation arises from a recursive algorithm. 





= Example 12 Merce Sorr, a common recursive algorithm for sorting a list of 
values, performs the following steps: 


1, If the given list has only one number, then it already is sorted with no additional 
required work. 

2. If the given list has more than one number, then do the following three steps: 
(a) Take the first half of the list and ask this program to sort it. 








(o £f o 0 o UX- GB F ë ë 


Chapter 4 / Functions and Relations 


Co eee ee G NEHME SO 09 





x5 3 s c 1T x0 E 
Table 4-30 Mence Sorr Complexity for Small Values of n 


(b) Take the second half of the list and ask this program to sort it. (If the list has 
odd length, take one more number in the first part than in the second part.) 


(c) Merge the sorted first half with the sorted second half, and the entire list will be 
sorted. 


Find a recurrence relation for the number of comparisons required by Merce Sort to 
sort a list of length n. 


SOLUTION Let s, denote the number of comparisons required to sort a 
list of n numbers using this algorithm. The first condition above tells us that 
5 = 0 since a list with only one number does not require any comparison of 
numbers to become sorted—it is already sorted. If the list has n > 2 elements, 
then itis broken into two parts, the first containing [$] numbers and the second 
containing |4 | numbers. By the definition of s, it will take siq2) comparisons 
to sort the first part and Sjq/2, comparisons to sort the second part. With a little 
thought, we can see that it requires n — 1 comparisons to merge two sorted lists 
(containing n total numbers) together, so we have established the recurrence 





relation 
Sa pm Spera 
Table 4-30 illustrates the first few values of this sequence. o 
The Master Theorem 


‘The final result of this section characterizes the growth rate of a sequence based on 
the form of its divide-and-conquer recursive description. The complexity of many 
recursive algorithms can be determined by this theorem. 


Theorem 9 (Simplified Master Theorem) Ler positive constants L and 
K be given, and let q = logy(L). The sequence |an) with recurrence relation 

a, = Lagi + g(n) 
has one of three possible growth rates: 


1. If g(n) € O(n?) for some positive p < q, then f(n) € O(n"). 
2. If g(n) e G(n*), then f(n) € G(r* -logn). 
3. If g(n) e n) for somer > q, then f(n) € &(g(n)). 


We should note that the fraction n/K in the recurrence relation is really short- 
hand for [n/K] or |n/K]. since the result is true regardless of which of these is 
intended. Before we discuss why the theorem is true, let's see how to apply it to the 
divide-and-conquer recurrence relations we encountered earlier in this section. 








— —— —— ———— —— -— — O 





4.8 Excursion: Growth of Functions 359 


"lll Example 13 Use she mener theorem to draw a conclusion about the growth 
rate of sequences a, and b, from Example 11 and Practice Problem 3. 

SOLUTION. Sincea, = 3-22; +7, we can apply the master theorem with 
and g(n) = n. In the theorem, q = log, 3 = 1.585. Since g(n) € 
O(n" x the first case of the master theorem holds, so we can conclude that 
a, € O(n"). 
ice by = byan; +n, we can apply the master theorem with K — 2, L = 1, 
and g(n) = n. In the theorem, q = log, 1 = 0. Since g(n) € 2(n'), the third case 
of the master theorem holds, and we can conclude that 6, € @(). o 








PracticeProblem4 Usethemaster theorem to drawa conclusion about the growth 
Tate of sequences cy and d, from Practice Problem 3. 


We will not prove the master theorem, but we will verify the critical second case 
with an example. The process that we will use is the same more general process 
is used in the proof of the theorem, 





E Example 14 Verify that the master theorem is correct for the sequence s, that. 
‘measures the complexity of the Merce Som algorithm in Example 12, for values of n 
of the form 2". 
SOLUTION Without paying attention to floors and ceilings, the recurrence 
relation for s, has the form 








572-542 (n-1) 
Using K = L =2 and g(n) =n — 1 in the master theorem, we sce that q = 
log, 2 = 1, so since g(n) € @(n'), the theorem says that s, € @(nlogn). To ver- 





ify this independently, we will look at values of s», where n has the form 2”, 
since in this case, the recurrence is easy to implement. 


Sip = 2-55 + g(16) 
=2-(2-s,+8(8)) + g(16) 

-2-Q-Q-s + 8(4)) + 8(8)) +016) 
=2-(2-(2-2-s, +2@)+8(4)) +8)  g(16) 
=% -s +2 -g(2) +2? - 9(4) +2! -g(8) - g(16) 

22.0 2-2) +2 - g(2) +2! -g(2) - gQ*) 








Since g(n) = n — 1, each g(2*) is roughly equal to 2* and so this final su 
roughly 24 added to itself four times, or roughly 4 - 2*. Notice that this is exactly 
the value of the formula n - log; n for the value n = 16 we started with. ü 








In the solution to the previous problem, we saw that “unwinding” the divide- 
and-conquer recurrence relation leads to a sum of roughly logy terms, each of 
which is the product of a power of K and g evaluated at a power of K. The master 
theorem simply pits the two terms in each product against each other: If g does not 
have a high enough growth rate, then the powers of K dominate; if g has too high a 
growth rate, then the powers of K become irrelevant. The second case of the master 





360 Chapter 4 / Functions and Relations 









































as [sa] i| scope en: | a | » |w 
s[s[s]»[»]s]e|e][» 
spala [s]we][n][s][s | 18 
7 a ES 
1fe6[7[s]o[wol[u lala 
Table 4-31 Table for Practice Problem 3 
theorem shows what happens when neither of the terms dominates the other, Only 


in this case is the number (logn) 


of 





immands relevant. To understand the details 


requires only this idea and some familiarity with logarithm properties. Some of these 
details are addressed further in the exercises. 


Solutions to Practice Problems 


1 Proof by induction. The first statement to be checked is 
“by = 337 Since by = 1, it follows from the recurrence 
relation that b: =3- 1 +4=7 and by =3-74+6=27. 
Hence, the first statementis true. Now let m> 4begiven 5 
such that all statements up to and including "b. 

37-1" have been checked. We now reason as follows 
about the next statement: 


by = 3 By +2m by the recurrence relation 
23.3 42m by the previously checked 
statement 











=3"+2m 
23  since2m>0 


Li 
2 (a) In this case, C < 1 and g(n) = (3n — 1) € O(n), so 
a, € O(n). 


Exercises for Section 4.8 


1. Fill out Table 4-32 to find the number of comparisons 
required to verify that a given value is not among the 
entries in a sorted list of length n. In your analysis, as- 
sume the worst case at each step—namely, at any step 
in which the list is split unevenly, assume that the value — 3, 
is judged to be in the larger "half." (Be sure to use re- 
cursive thinking!) 

2. For cach of the following rules for the function f : N — 

N, make a table to investigate the ratio f(2n)/ f(n) for 
increasing values of n. For each definition of f, write a 
sentence of the form "When the input value is doubled, 
the output is 


(b) In this ease, C = $ 1,502, € 6((3/2)). 

(c) In this case, C = Land s(n) = V € Q(n/2),soa, € 
ei). 

Table 431 compares values of each sequence to the cor- 

responding values of the sequence [an]. 


















Since c, ai +2, we can apply the master theo- 
rem with K 3, and (n) = 2. In the theorem, 
q = log, 3 ~ 1.585. Since g(n) € Q(n'), the first case of 
the master theorem holds, and we can conclude that 
e, eem). 

Since d, =3- ds) +n, we can apply the master the- 
‘orem with K 3, and g(n) = n. In the theorem, 


q = log,3— 1. Since g(n) € ©(n'), the second case of 
the master theorem holds, and we can conclude that 
4, € Ə(nlogn). 





n w 
Sn+1 O 
inei w 
" h 


To verify the fact derived in Example 10, use your 
calculator to evaluate f(n) = log,n, g(n)- logn, 
and h(n) = f(n)/g(n), for values of n from the set 
(2,3,4,9, 10). 

In Exercises 4-6, usea computer orcalculatorspread- 
sheet asin the following example: To explain why } 
n € Qr). we first place the values 1 through 50 in cells 
AI through ASO; next the formula = 1/2 * A12 
— A2 is placed in cell B1 and filled down through cell. 




















a SEEMS 7 TET LE ITE T TTE 


48 Excursion: Growth of Funcions 361 









































Length of list (n) 1] 2] 3[4]s] 6] 7] 89 | 0 
Number of comparisons | 1 | 2 | 2| 3 
Value of (log, n] | 
Table 4-32 Table for Exercise 1 
B50; and finally the formula =B1/A1~2isplacedincell (a) 3n — 7 € Qr) because 
CI and filled down through cell C50. The fact that af- 
ter cell C4, the values in the C column stay between. 3-72 for alln > 
025 and 0.5 supports the claim that $n? — n e @(r?). . 
In the spirit of the exercises below, we would write (b) ir? € Qin) because 
025-n? < $n? — n < 05-9 foralln > 4. . 
4. Fill in the blanks to make a true statement out of each gu nica az: 
of the following, using spreadsheet data to support your x 
answers: (e) ir — 5n e QU?) because 
(a) 2n+1 € O(n) because 
ix " 
nentis n forall g^ m foralin > 
nz 7. In Proposition 1 we used the definition of © to show that 
(9) fr? — 10n +3 € O(n?) because the sequence given by a, = = satisfiesa, € O(n?) Do 
k the same thing. but using the alternative characteriza- 
mel-se 1 tiongivenin Proposition Sinstead. Specifically. find pos- 
‘ itive numbers K and L and prove that a/n 
forall n> stays between K and L forall n > 2. 
(e) n+ y7 € O(n) because 8. Modify your solution to the previous exercise to prove 
MIR Inforap Wit da = SES satisfies a, e O(P). 
9. Let a, = Sr + 4n + 6n +7. Show that a, € O(n) as 
nz follows: Taking the ratio a,/n* and simplifying yield 
6.5 X log, e O62) became Sii. Since n> I, what is the largest value 
e that 2 A, can have? How does this (along with 
ee č on CAE. 


forall n 





5. Fill in the blanks to make a true statement out of each 
of the following, using spreadsheet data to support your 


answers: 
(a) 3n— 1 € O(n) because 
3-i- ^nforalln z- 


(D) 10n +7 € OÇ) because 





30047 m for all n > 
(©) log, n] € O( Vf) because 
Vnfor all n> 





6. Fill in the blanks to make a true statement out of each 
of the following, using spreadsheet data to support your. 
answers: 


Proposition 3) show that a, e Gr)? 

10. We can generalize the previous two exercises to prove a 
special case of Theorem 2: Let f(n) be a polynomial of 
degree p with positive leading coefficient and no nega- 
tive coefficients. Thats, f(n) = cpn? c n! +--+ 
cın + co where p €N, cp > O,and each cy € R^. Prove 
that f(n) € O(n"). 

1. The following three problems give an indication for how 
Theorem 2 can be proven in general: 
(a) Leta, = Sr! — 4r? — 6n —7. Show that ay € O(n"). 

(Hovr: Taking the ratio a,/n° and simplifying yield 
5-$-$-3=5-G+3+3)) 
(b) Letas = Sn’ + dnè — 6n +7. Show that a, € O(n?) 
(Hsr: Observe that Si — 4r? -61- 752,5 
Sn? + dn? + 6n +7 forall n > 0) 
(©) Leta, = Sr? — i — 6n +7. Show that a, € (n). 
12. Give a general proof of Theorem 2. 

















"U OS 





362. Chapter 4 / Functions and Relations 


13, To show that a, ¢ O(r), we must show that for any 
value of K, there is an integer n such that $ > K. For 
cach of the following sequences, find such an integer n 
for each of the values of K in (10, 20, 50, 100]; 





14. To show that a, ¢ O(g(n)), we must show that for any 

lue of K and N, there is an integer n > N such that 

3# > K. Use this fact to prove each of the following 
Statement 
(9 Hay = rf, show that ay ¢ O(n). 

b) Ifa, = 2n +3, show that ay ¢ OUÈ). 
(e) Ia, = 3n, show that ay ¢ OC YA). 

15, For each of the following statements, use a spreadsheet 
to explore the given sequence (as is done in Example 7) 
and fll in the blanks to create a true statement based on 
yourevidence, You do not need to prove the statements 
are true. 

















(a) Given that a, = ay- + (0? + 2n) and ay = 1, it fol 
lows that 
DETE on for all 
a 


(b) Given that ay = dy.) (t 17) anda, = 10,itfol- 
lows that 





^n for all 
ne 


(©) Given that a, 
that 


a, ffi and aj = 3, it follows 


mnia s 


nn 








foralln > 





16. For each of the following statements, use a spreadsheet 
to explore the given sequence (as is done in Example 7) 
and fil in the blanks to create a true statement based on 
your evidence, You do not need to prove the statements 
are true, 


(a) Given that a, = 22,., +7? and a = 1, it follows 
that 
eas +2 for all 
az 


(b) Given that a, 
follows that 





m-i + (3n +17) and ay = 12, it 


n tor all 


(©) Given that a, = 3a,., — n° and ay 
that 





nisa = 


nd 


foralln > 


17. In the proof of Proposition 4 we proved that a sequence 
a, satisfying dy = ay. i (n ~ 1) must havea, € Or). 
Prove that such a sequence must also have a, € Qr), 
thus completing the proof that a, € Gr). 

18. Use induction to prove each of the following inequali- 
ties: 

(a) Given that a, = an- 3n and a, = 1, prove that 
a, = 37? for all n> 1, 

(b) Given that by = be-i + ("= 2) and bj = 1, prove 
that by <1 forall n > 1 

(©) Given that c, = cn- v? + ni) and cj = 1, prove 
that c, = m forall n= 1. 

(a) Given that dy = chy + [VF] and dy = 
d, > nforalln > 1 

duction to prove each of the following inequali- 











prove that 


19. Use 
ties: 
(9) Given that a, = 22,., +1 and a) = 1, prove that 
a, $2" = Vorallnz 1. 
(b) Given that by = 4by-1 n and by = 1, prove that 
b, Sn for alin > 1. 
(©) Given that c, = 3c,.,-en and cy = 1, prove that 
c, 2 3" forall n> 1 
(d) Given that d, = 24,., + n° and di = 1, prove that 
d, 2 2 for all n > 2. 
20. Prove each of the following properties for all functions 
f. g, and hon N: 
(a) For every function fon N, (n) e e(f()). 
(b) For all functions / and g on N, if fin) € O(s(n)), 
then g(n) € &(f(n). 
(©) Forall functions f. g, and Aion N, if f(n) € @(g(n)) 
and g(n) € Gin). then f(n) e etn). 
21. Prove each of the following for all functions f. g, and A 
on N: 
(a) IF. fin) € Oii) and C 1, then C^ f(n) e 
On). 
(b) IE (n) € O(h(n)) and g(n) € OA(n)), then fn) + 
an) € O(h(n)). 
(©) IE f(n) € Ogri). then gn) € fn). 
22, Prove each of the following for all functions f and g on 
N 
(a) 1t f(n) € Olein) and gt) € O( fn). then fn) € 
gin). 
4b) 1t f(n) € Ol?) and gin) e OW), then. fn) + 
a(n) e O(n"), 

















48 Excursion: Growth of Functions 363. 





Number of coins (n) | 1 


10 








Number otweehings || 
‘Value of flog (n)) 
































Table 4-33 Table for Exercise 26 


(© IE f(n) € OQ) and g(n) € O(n’), then f(n) + 
gin) e eu). 

23. Each of the following statements is false, In each case, 
provide a specific counterexample that shows the state: 
ment is fals 
(9 For alla > 1, O(a") = 02"), 

(b) For all functions / and g on N, if f(n) € O(g(n)), 
then g(n) € O(f(n)). 

(©) For all functions f and g on N, if f(n) € O(P) and 
(n) € OP). then fin) — gin) = 0. 

4) HE fir) € Ohr) and gn) € O(i(n)), then fin) 
42) € (hn). 

24. What property of logarithms 
O(log, (n^) = O(log, n) for all p > 0? 

Prove by induction that the algorithm Countenrerr De- 

Tector | of Example 9 will require k weighings to find a 

counterfeit among 2 coins, 

Describe modifications to the CounreRentt Derrcror 1 

algorithm that will make it find a counterfeit coin from 

among n coins given the conditions in Example 9, Com- 
plete Table 4-33 to support the claim that the number 

of weighing required (in the worst case) is flog, (n)]. 

(Be sure to use recursive thinking!) 

1 have a bag with 27 coins that appear identical, but 1 

know one of them is counterfeit, The real coins weigh 

1 oz each and the counterfeit one weighs more. I have 

a balance scale that tells me which of two loads is heav- 

er. Using the Covrearem Detecrox algorithm, we can 

find the counterfeit in five weighings, but the following 
algorithm does better: 

COUNTERFEIT DETECTOR I1 
@ Weigh nine coins against nine other coins It 

they do not balance, then the counterfeit coin is 
‘among the heavier nine; otherwise, the counter- 
feit coin isamong the nine that were not weighed. 
Keep the nine that include the counterfeit coin. 
Gi) Weigh three coins against three other coins. If 
they do not balance, then the counterfeit coin is 
among the heavier three; otherwise, the coun- 
torfeit coin is among the three that were not 








explains why 











28. Think of a number in (1,2, 


ED 


3L 





weighed. Keep the three that include the coun- 
terfeit coin. 

(ll) Weigh one coin against another one, If they do 
not balance, then the counterfeit coin is the heav- 
ier one; otherwise, the counterfeit coin is the one 
that was not weighed, 


‘This algorithm requires only three weighings to find the 
counterfeit among 27 coins. Prove by induction that this 
algorithm will require k weighings to find a counterfeit 
among 3* coins, 








sn], and 1 will try to 
guess it. When I guess wrong, you tell me if your num- 
beris higher or lower than my guess, What is the smallest 
number k of guesses I can make to guarantee that I am 
right on or before the K? guess? 

Match each of the recursive descriptions on the left with 
the function on the right that best characterizes the se- 
quences growth rate, 





(a) semis = Quy tl (A) Sem prm 
O) = 1s = 25g (B) sy = dn 
(c) s21n239.155 (C) sy = Den? 


In Exercise 29, the expression given in (B) deseribes a 
familiar linear function, The other two are called sub- 
linear and superlinear. Can you tell from their graphs 
Which is which? For positive constants a, b, and c (with 
b > 1), ill in the following blanks regarding the growth 
rate of the recurrence relation s, = d Sy c 

(a) If a <b, then the closed formula for s, is a 








function, 

(b) If a=b, then the closed formula for s, is a 
function, 

(©) If a b, then the closed formula for s, is a 
function. 





Prove directly by induction (i... do not just cite the mas- 
ter theorem) that the sequence s, in Example 12, which 
counts the number of comparisons in the Merci Sort 
algorithm, satisfies the inequality s, < n? for all = 1. 





364 Chapter 4 / Functions and Relations 




















Table 4-34 Table for Exercise 32 


32. A divide-and-conquer recurrence relation can be ex- 
amined with a spreadsheet" for special values of n. For 
‘example, given the sequence with recursive description 
Gy 202 +3n where c, = 1, we can easily generate 
values of this sequence for values of n of the form 2. 
Note that the master theorem predicts that c, is of order. 
ndogin). 
© Fill column A with the values of k: 0, 1, 2,3... 20. 

(This can be done using "Edit Fill Series”) 
© Put the formula =2~A2 into cell B1, and Fill Down 
the formula to cells B2 through B21. 

Put 1 (a valve for cy) in cell C1. 

Put the formula «2*c1«3*82 into cell C2, and Fill 

Down the formula to cells C3 through C21. 

Put the formula =B2*10g (81, 2) into cell DI, and 

Fill Down this formula to cells D2 through D21. 

© Put the formula «C1 /D2 into cell EI, and Fill Down. 

this formula to cells E2 through E21. 

‘The spreadsheet now has select values of n in column 

B, terms of the given sequence in column C, values of 


Chapter 4 Summary 


the predicted growth rate log; (n) in column D, and 
the ratio of the sequence to its predicted growth rate in 
column E. A sampling of values from this spreadsheet is 
shown in Table 4-34, Since the ratios in the spreadsheet 
seem to be settling in around the value of 3, it appears 
that the values ofc, are trapped between 2n log, (n)and 
Anlog, (n). corroborating the prediction of the master 
theorem. 

Use this same approach to verify the prediction of the 
‘master theorem for each of the following sequence 


(a) cn = Beag +n, where cy = d. 
A) c, = Zea + frè, where ci = 10. 
(6l c, = eqn + 20, where cy = 0. 
D c, = caz + 3n, where = 1. 
(e) c, = aa +3, where) = 3, 
33. Use induction to prove the following inequalities about 
the sequences from Practice Problem 3: 
(9) Forall n> 1, by 5 2n. 
(b) For all n > 1, c, £ Y. 
(© For all n> Ld, <n. 








41 Definitions, Diagrams and 
Inverses 
Terms und concepts 


© You should understand the definition of a function and 
the role of domain, codomain and rule in this definition. 


© You should understand the definition of a binary rela- 
tion and the role of domain, codomain and rule in this. 
definition, 

© You should undertand the definition of the inverse of a 
function or relation. 


* To see how to use a spreadsheet to examine all values of a divide-and-conquer recurrence relation, use 
the help files in your program. For example, the LOOKUP command in Microsoft Excel” can be used 


for this purpose. 





Working with functions and relations. 


© Youshould be able to use and convert between the differ- 
ent representations of a function's rule, including tables, 
algebraic expressions and arrow diagrams. 

© Youshould be able to use and convert between the differ- 
ent representations of a relation's rule, including tables, 
algebraic expressions, English descriptions, one-set arrow 
diagrams, and two-set arrow diagrams. 

© You should be able to determine if a binary relation is a 
function. 

© You should be able to determine the inverse of a binary 
relation regardless of how the rule is represented. 

© You should be able to determine whether or not a given 
function has an inverse function. 

© When a function does have an inverse, you should be 
able to determine the inverse function regardless of how 
the function’s rule is represented. 

© You should be able to prove that two given functions are 
inverses of each other. 





4.2 The Composition Operation 
Terms and concepts 


© You should know what we mean by the composition of 
functions f and g. written as f « g, and be able to distin- 
Rüish between / ^ g and g f. 

© You should know what the identity function is. 

9 You should know what we mean by the composition of 
relations R and S, written as R « 5, and be able to distin- 
guish between Ro $ and So R. 








Working with compositions. 


You should be able to find the composition of two 
functions when the rules are given as arrow diagrams, 
sets of ordered pairs, algebraic equations, or English 
descriptions. In the latter case, you should under- 
stand the English-language meaning of the composite 
function. 

€ You should be able to find the composition of two re- 
lations when the rules are given as arrow diagrams, 
sets of ordered pairs, algebraic equations, or English 
descriptions. In the latter case, you should under- 
stand the English-language meaning of the composite 
relation. 

{© You should understand the meaning of inverse functions 
in terms of composition and the identity function, and be 
able to prove that two given functions are inverse func- 
tions using this idea. 


48 Excursion: Growth of Functions 365 


4.3 Properties of Functions 
and Set Cardinality 

Terms and concepts 

© You should be able to identify when a function is not 
invertible. 

© You should be familiar with the terms one-to-one and 
onto. 

© Given a function f A — B on finite sets A to B, you 
should know how the properties of one-to-one and onto 
correspond to the relationship between the sizes of the 
domain and codomain. If you studied the pigeonhole 
principle in Section 2.5, you should understand how these 
relationships are equivalent to the basic version of that 
principle. 

© You should be familiar with the term cardinality and the 
formal definition of when two sets have the same cardi- 
ality 

© You should know the terms countable and uncountable 
as they relate to the cardinalitis of infinite sets. 


Working with function properties and set cardinality 


© You should be able to use the terms one-to-one and onto 
in the context of functions whose rules are given by ar- 
row diagrams, sets of ordered pairs, algebraic equations, 
or English descriptions. 

© You should be able to produce a specific counterexample 
to illustrate that a given function is not one-to-one. 

© You should be able to produce a specific counterexample 
to illustrate that a given function is nor onto. 

© Given any (finite) set A, you should be able to explicitly 
find the set Cantor constructed in his famous proof of 
Theorem 9. 


Proofs 


© You should be able to write a well-structured mathemat- 
ical proof that a function is one-to-one. 

© You should be able to write a well-structured mathemat- 
ical proof that a function is ono. 

© You should be able to prove when simple infinite sets like 
Nor Z have the same infinite cardinality, and you should 
be familiar with the proof that two infinite sets like N and 
PN) do not have the same infinite cardinality. 





4.4 Properties of Relations 
Terms and concepts 


© You should know the meaning of the terms reflexive, an- 
tisymmetric, and transitive, and the use of the term partial 
order to describe relations with all three properties. 














366 Chapter 4 / Functions and Relations 


© You should realize that a Hasse diagram is a simplified 
arrow diagram, and you should be able to draw the Hasse 
diagram for a relation that is a partial order. 

© You should understand the notion ofa strict partial order, 
and the corresponding irreflexive property. 

© You should be able to distinguish between a partial order 
and a total order by looking for incomparable pairs. 


Investigating and proving properties. 

© For a given relation, you should be able to determine 
whether or not it satisfies the various properties we have 
discussed: reflexive, irreflexive, antisymmetric transitive. 

© You should be able to use the result of this analysis to 
determine if the relation is a partial order, a strict partial 
order, or neither. 

© You should be able to prove that particular relations sat- 
isfy the reflexive property. 

© You should be able to prove that particular relations sat- 
isfy the transitive property. 

By thinking of the antisymmetric property in a slightly 
different (but logically equivalent) manner, you should 
be able to prove that particular relations satisfy the ant 
symmetric property. 


4.5 Equivalence Relations 


‘Terms and concepts 


© You should understand what we mean by a partition of 
a set. Partitions were introduced briefly in the preceding. 
chapter, 

‘© You should know the connection between an equivalence 
relation and the corresponding partition. For a given 
equivalence relation, you should be able to describe the 
corresponding partition. 

© You should know that an equivalence relation is char- 
acterized by three properties: reflexive, symmetric, and 
transitive, The reflexive and transitive properties are the 
same properties discussed in the preceding section for 
partial orders. 


Investigating and proving properties 

© By exploiting the connection between an equivalence re- 
lation and the corresponding partition, you should be 
able to determine if a given relation is an equivalence 
relation. 

© For a given relation, you should be able to determine 
whether or not it satisfies the various properties of an 
equivalence relation: reflexive, symmetric, transitive. 

© You should be able to prove that particular relations sat- 
isfy the reflexive property, the symmetric property, and 
the transitive property. 


4.6 Numerical Functions in Discrete 





live real number r (written p = logio). and the connec- 
tion to the expression 10” = r. 
© You should realize that the same type of function can be 
defined for other bases For example, p = log, r means 
Per. 
© You should know what we mean by the floor and ceiling 
functions, written as Lx] and [x], respectively 
© You should realize that numerical operations (addition, 
etc.) can be viewed as functions, and be able to interpret 
and use the corresponding notation Swn(i, j). Prod(i, j). 
and so on. 
© You should be thoroughly familiar with these properties 
of logarithms, and realize that similar properties hold for 
other bases. 
— log(a!) = b- log fa) 
— logu(a - b) = logiy(a) + log (5) 
— log, (a/D) = log, (a) — log, (b) 








Solving problems. 


© You should be able to use the definition and properties 
of logarithms to solve various problems. For example: 
What is the largest value of n for which you can represent 
1 exactly in a 32-bit integer in a computer? 

‘© You should be able to use the floor and ceiling function 
to solve various problems. For example: How many mul- 
iples of k are in the set (m, m+ V, m2, ... n)? 

© You should be able to use compositions of these functi- 
‘ons - for example, flog, n) - to solve problems. 





4.7 Excursion: Iterated Functions 
and Chaos 


Terms and concepts 


© You should know what is meant by the iterated function 
sequence for f starting at ao, and the associated concept 
ofa cycle of period k. 

© You should realize that some iterated function sequences 
demonstrate sensitivity to initial conditions, and be able 
to use this to explain the so-called “butterfly effect.” 

© You should realize that sensitivity to initial conditions is 
an important property in the mathematical systems stud- 
ied by chaos theory. 

© You should be familiar with the chaos game and the re- 
sulting Sierpinski triangle. 





Calculations and proofs 


© You should be able to calculate values for iterated func- 
tion sequences for various choices of ao, giving complete 
or partial arrow diagrams, and locating cycles when they 
exist. 

@ Using algebraic methods, you should be able to locate all 
cycles of length 1 and length 2 for given functions. 

© Using a calculating device such as a spreadsheet, you 
should be able to demonstrate that certain functions are 
sensitive to their initial conditions. 

© You should be able to prove that a given sequence even- 
tually reaches one of several cycles, using mathematical 
induction. 


4.8 Excursion: Growth of Functions 





© You should recognize the use of the word complexity, 
or time complexity, to indicate the time efficiency of an 
algorithm, 

© You should realize that a sequence of real numbers start- 
ing with ay is equivalent to a function f : N — R. 

© You should understand the language of function growth. 
Specifically, you should understand formal statements of 
this type: 
— fin) e Og) 
— fin) e en) 
— fin) e Qs) 
You should also recognize the informal use of phrases 
such as "order r^" to describe function growth. 

@ You should know, and understand the importance of, the 
hierarchy of complexities, part of which is given here: 

O(log n) € .. € O(n!) © O(n) c O(n) 
E On) E. E OR"), 





-— 





48 Excursion: Growth of Functions 36 


© You should recognize the form of divide and conque 
recurrences. 


Determining growth rates 

© You should understand that we use functions to measur 
the complexity of algorithms, and that the main emphas 
of complexity analysis involves comparing the behavi 
of functions as the size of the input increases, 

© You should be able to recognize and make correct stat 
ments using O, ©, and 2 ~ for example, #52 € O(n? 
and £z? ¢ O(n). 

® You should be able to examine a sequence empirical 
using a spreadsheet, in order to investigate its complexi 
properties. 

* You should be able to apply Theorem 5 to establish th 
growth rate of recursively defined functions 

© You should be able to apply Theorem 9 (the Simplifie 
Master Theorem) to establish the growth rate of divid 
and conquer recurrences. 





Proofs 

© You should be able to use algebraic methods or math. 
‘matical induction to prove that one function is eventual 
larger than another, in a variety of contexts. For exampl 
— The functions may be given in closed form, 
— The functions may be given recursively, 
— The functions may involve the use of the log, floo 

and ceiling functions. 

© For particular functions, you should be able to pro 

statements involving O, ©, and 2. 








368 








Combinatorics 


This chapter introduces the basic ideas of combinatorics, one of the major ar- 
eas of discrete mathematics. Many people refer to combinatorics questions as 
"counting problems.” Although this is an oversimplification, itis true enough 
of the topics that we will study in this chapter. We have of course seen counting 
problems in earlier chapters as we studied the sizes of various sets to better 
understand the nature of operations on sets and as we discussed the con- 
nection between properties of a function (like one-to-one and onto) and the 
relative sizes of the sets that comprised the function's domain and codomain. 
Hence, combinatorics is very much part of the other topics we have studied up 
to this point, and we will soon see that it is essential to the study of probability 
in the following chapter as well. 

We begin the present chapter with some traditional techniques for an- 
swering questions of the form "How many...?" Along the way, we will en- 
counter some less traditional techniques such as recursive modeling applied 
to more difficult problems. These techniques can, in turn, be applied to other 
topics such as methods for finding closed formulas for recurrence relations. 
We end the chapter with methods for finding closed formulas for recursively 
defined sequences of the type that we encountered in the first two chapters 
of the book. 

In all this, an important ingredient in the study of combinatorics is the 
representation of new objects or situations in terms of simpler objects. A large 
part of this is the ability to recognize when two problems are actually the same. 
This will become one of the most valuable skills you can take away from your 
study of combinatorics. 








51 Introduction 369 


5.1 Introduction 





If you ever browse through a standard pocket dictionary, you will notice that the 
word “combinatorics” does not appear within its pages. You may conclude that this 
isa word mathematicians made up on a slow day at the office. The fact is that the field 
of combinatorics encompasses many subjects of various flavors. Perhaps the best we 
can do is to examine some common properties that these share to develop a sense of 
what it is all about. 

In combinatorics, we usually deal with finite “structures” (which we discuss 
below) and the properties of the counting numbers. Combinatorics traditionally ad- 
dresses three types of questions: existence, enumeration, and optimization. That is, 
one could ask when a certain prescribed configuration exists, such as a solution to a 
puzzle. Satisfied that one exists, one can ask how many different solutions there are 
or the related question of how likely you are to find it. Finally, there may be some 
measure by which there is a best or worst solution (a solution that takes, €g., more 
than 100 years to find would probably be considered "bad"), so one could ask to find 
a particular one. 

For example, a salesperson wishes to visit 10 particular cities, but he has a lim- 
ited budget. Equipped with an airline schedule, he first sets out to determine if 
he can find a sequence of flights that will take him to all 10 cities on his budge 
Happily, he finds out that he can, so he then greedily tries to determine which 
airline is the least expensive. Discouraged by the enormous number of ways to 
schedule bis trip and the intractability of the problem of finding the very best 
deal, he quits his job, becomes a mathematician, and is suddenly more popular at 
parties. 

Enumeration questions especially abound in science and elsewhere: "How many 
different tickets are there for the Pennsylvania lottery?” “How many samples from 
‘a box of spare parts should be tested to reasonably conclude that all parts in the box 
are acceptable?" “Which of two computer algorithms will run in the shorter amount 
of time?” 

The point is that all three types of combinatorics problems can commonly arise, 
We will mostly be concerned with enumeration and existence questions in this book. 
Optimization questions are the subject of broad areas of mathematics such as op- 
erations research and calculus of variations, which generate a lot of interest in their 
‘own right. We will see some examples of optimization questions, although time will 
restrain us from discussing the solutions in much detail 

















Finite Structures 


In combinatorics, we work with the positive integers and the most basic of finite 
structures, the finite ser (eg., (1,2,3) or (John, George). Note that in a set, the 
order in which the elements are listed is irrelevant and each element needs to be 
listed only once—that is, the only thing that distinguishes between two sets is which 
objects are present and which are not. For example, (3, 1,2, 3] and (George, John} 
describe the same two sets as above. Enumerative problems in combinatorics are 
usually stated as questions of the form “How many ways are there to .. 7" That is, 
a finite set of objects or events will be described, and we will be asked how many 
elements the set contains. 





QS MES X47 ^. [OOO 


370 Chapter 5 / Combinatorics 


p pepe ovg he wane wht oe 
with the letter ‘A’?” is the same as the question "How many elements are in the set 
(Alabama, Alaska, Arizona, Arkansas?" 


In just a couple of lines, we have already seen a set of numbers, a set of people, 
and a set of states. In each case, there is a larger "universe" from which the set is 
taken. The universes in the examples above might be the set of all living people, the 
set of all US, states, and the set of all natural numbers, In this sense, the sets we 
described are more properly referred to as subsets of the larger sets—that is, sets 
whose elements all derive from the larger set. 





y RAET Y deccm MR T 
prizes from a class consisting of just four people (Andrew, Bob, Carly, Diane)?" is the 
same as the question “How many elements are in the set 


VA. B}, (A.C), (A, D), (B,C), (B, D), (C, DI? 


where we use only the first letter of each person's name for brevity. 


The previous example made an assumption about the way we should represent 
choosing two people for prizes." Using a set of two people to represent those two 
people being chosen implicitly means that (1) there is no difference between the two 
prizes, and (2) itis impossible for the same person to win both prizes. Without these 
assumptions, we would have chosen different representations: 














"lll Example 3 Consider once again the question “How many ways can two win- 
ners be chosen for prizes from a class consisting of just four people (Andrew, Bob, 
Carly, Diane)?” 


1. Assuming that there is a first prize and a second prize, how can this question be 
written as "How many elements are in the set... 2" 

2. Assuming that these are two different door prizes (so it matters who gets which 
prize) and that the same person could win both, how can this question be written 
as “How many elements are in the set... ?" 

3. Assuming that the prizes are identical door prizes (so it doesn't matter who gets 
which prize) and that the same person could win both, how can this question be 
written as "How many elements are in the set... ? 








SOLUTION 


1. We will simply list the winners so that the first person listed gets first prize 
and the second person listed gets second prize. Then the question is “How 
many elements are in the set 


(AB, BA, AC,CA, AD, DA, BC.C B, BD, DB,CD, DC)?” 


2. We will simply list the winners so that the first person listed gets one door 
prize and the second person listed gets the other door prize. Then the 











51 Inroducion 371 





question is “How many elements are in the set 


(AB, BA, AC,CA, AD, DA, BC, C B, BD, DB,CD, DC, AA, BB, 





.DDy 


3. Since the prizes are identical, we will simply list the winners in alphabetical 
order, with a person listed twice if he or she wins both prizes. Then the 
question is “How many elements are in the set 


(AA, AB, AC, AD, BB, BC, BD,CC,CD, DD)?" 





o 





‘This means that our original innocent question in Example 2 has four possible 
different answers, depending on what assumptions are made. We summarize the 
answers in Table 5-1, showing the number of ways the prizes can be given for each 
of the four interpretations of the problem. The fact that our very first question had 
four different answers might understandably cause some concern, but we can avoid 
these difficulties by being very explicit in the assumptions for our problems, A key 
ingredient for accomplishing this is to agree on names and definitions for the four 
different representations used in the problem above. Since all the problems in this 
chapter can be represented using one of these basic structures in some shape or form, 
this clarification will go a long way toward avoiding confusion, For this same reason, 
we should talk a bit more about each of these structures. 

In the discussion of the question above, we used two questions about the prizes 
to determine a precise interpretation of the problem. In terms of the representation 
of the problem, the first question “Are the prizes different?” really asks the question 
“Does the order of A and B matter in our representation?” If the prizes are different, 
then we need, for example, both AB and BA on our list—the order in which we list 
the winners makes a difference in what they win, 

An ordered list is a finite sequence of objects in which order is important (i.e. 
there is a first object, a second object, etc.). For example, in abcb we are concerned 
not only with what objects are there, but also with how many times and in what order 
they occur. So bbca and abc are ordered lists, each different from abcb and each other 
even though all three lists just use the letters a, 6, and c. Note that these particular 
examples are ordered lists of letters although the objects in the list can theoretically 
be anything, 

A natural example of an ordered list is a numeral. Every number can be repr 
sented as a list of digits, each one from (0, ..., 9}, and clearly the order of the digits 
is important to the value of the number. For example, 1,132 and 1,321 are each lists 
of the digits 1, 1,2, 3, but they represent different numbers because the digits are in 
different orders. 

‘An unordered litis a finite sequence of objects in which we are concerned with 
how many of each object occur, not with the order in which they are given. For 









































example, 1, 2, 3, 1 and 1, 1, 2, 3 are considered to be the same unordered list of 
Are the prizes 
different? 
Yes | No 
Cin a person Ye | 16 | 10 
win both prizes? nal ale 

















Table 5-1 Four Interpretations of Example 3 


roe 7-74 UOIWERCe 7 — X» r S a o o 


372 Chapter 5 / Combinatorics 


numbers, while 1,2, 3 is a different one since here the number 1 occurs only once. It 
might be more natural to think of the numbers as being “types” of things, so 1, 1,2, 3 
means that we are listing two things of type 1 and one thing of types 2 and 3. We will 
give some natural circumstances in which this is a desirable type of structure below. 

A bridge hand’ is a nice example of an unordered list—certainly the order of 
the cards in a hand means nothing to a bridge player. The hand is determined only by 
which cards are present. Another example is a bag of groceries. The fact that a bag 
contains two oranges and a gallon of milk is enough to determine its contents—the 
order of the items within the bag is irrelevant. This second example is used by many 
authors to capture the essence of the unordered list structure to the extent that they 
use the word bag for an unordered list. 

We now return our attention to the questions in the table, focusing on the second 
question “Can a person win both prizes?" If the answer is yes, then entries such as AA 
are allowed in our list of winners. We can see that the question could be phrased more 
generally as “Are repetitions allowed?” That is, when we make the list of winners, 
are we allowed to repeat a person? 

The observant reader may have already noticed a redundancy in the fact that a 
set is really an unordered list with the restriction that we can take no more than one 
of each type of available object. Similarly, an ordered list with this same restriction 
is usually called a permutation. We use the special words “permutation” and "set" 
only because they refer to commonly encountered objects and we want to be effi- 
cient in talking and writing about them. Because we have this special terminology for 
the situation where repetition is not allowed, we will use the general terms ordered 
list and unordered list to indicate the more general situation where repetition is al- 
lowed. Table 5-2 summarizes the terminology to be used. Permutations and sets are 
so important that we have particular notation for these two special cases. The num- 
ber of r-element subsets (sometimes called r-combinations) of the set (1,2, .... n] 
is denoted by C(n,r), and the number of permutations of length r using elements 
of the set (1,2, ..., n] is denoted by P(n, r). In the next several sections we will de- 
velop tools that will allow us to numerically evaluateC (n, r) and P(n, r), which in turn 
allows us to answer questions about the more general ordered and unordered lists. 








Practice Problem 1 Decide which of the four structure types (set, unordered list, 
permutation, or ordered list) best characterizes the objects in each of the following 
situations. Can any of the following have more than one answer? List the additional 
assumptions you must make for your representation to be correct. 














Does order matter? 
Ye No 

Are repetitions | Yes | Ordered ist | Unordered tist (bag) 

aa No Permutation Set 














Table 5-2 Terminology for Four Basic Structures 


* A bridge hand is played in much the same way as a hand in the game of hearts, if that is more familiar. 
| This may be written differently in other books or on your calculator. The number of permutations of 
length r can also be expressed as „P, or Pr. Similarly, the number of subsets of size r can be expressed 


so CE ox) We vse our notation spy bse lenit pe 


Dl 4 ee eee Se eee ee! 








51 Introduction 373 


(a) Dealing a five-card (draw) poker hand 
(b) Dealing a nvo-card blackjack hand. 

(c) Creating a game schedule for your favorite sports team 

(d) Filling your orange plastic jack-o'-lantern for trick or treat. 


Organization in Counting 


‘There is a famous story about the prolific mathematician Carl Friedrich Gauss (1777- 
1855). As a child, Gauss had a teacher who set his students to doing meaningless 
sums to occupy their time. The teacher was not very creative even at coming up with 
‘meaningless sums, so the students were told to add the numbers from 1 to 100. While 
the other students started adding 1 to 2, and then this sum to 3, and so on, Gauss 
noticed that the sum could be written (1 +100) + (2 + 99) + (3 +98) +--- + (50+ 
51)—in other words, 101 added to itself 50 times resulting in 5,050. So Gauss finished 
well before everyone else and was beaten up at recess for being such a geek. 

‘The moral of the story is that some seemingly hard problems can often be done 
in a simple way if you give some thought to organization before you begin. In our 
problems, we could try to write down everything we are supposed to be counting 
and then point to each one and say “one,” “two.” ..., but we would rather be like 
Gauss. In fact, one could argue that the main emphasis of this chapter is simply 
organization. Along this line, we will try throughout this chapter to uncover and 
understand particularly elegant answers to questions and try not to “get beaten up" 
in the process. 





"ll Example 4 How many permutations of the leters in MATH are there? 


SOLUTION If we write them down until we cannot think of any more, we 
might get LIST 1 below. The drawback to this is that we might never be sure if 
the list is complete. Another drawback is that once the list is made, we will have 
to then count the length of the list, and since itis too long (for most people) to 
count on their fingers and toes, we may not be sure of the answer we get. 


LIST 1: MATH, AMTH, AMHT, THAM, AHMT, HAMT, HMAT, MHAT, 
THMA, MHTA, HMTA, HATM, AHTM, MAHT, TMAH, MTHA, HTMA, 
TMHA, ATMH, TAHM, ATHM, TAMH, MTAH, and HTAM 





On the other hand, we might organize our answer as LIST 2 in Table 5-3, 
where the permutations are sorted so that each column contains permutations 
starting with the same letter. (Within each column, the entries are arranged 
based on the second letter.) It is clear that it is easier to be sure of the size of 
LIST 2than LIST 1, but we also notice that itis easier to check for completeness. 
Thus, by writing the list down in a sensible way, it is much easier to find its length 
and be sure of it. ü 





lll ample 5. Explain how to organize the outcomes to answer the question 
“How many ways can two winners be chosen for prizes from a class consisting of just 
four people (Andrew, Bob, Carly, Diane)?” from the beginning of this section. (In this 
‘example, one person is allowed to win both prizes.) 








374 Chapter 5 / Combinatorics 





MATH | AMTH | TMAH [imer 





MAHT | AMHT | TMHA | HMTA 


MTAH | ATMH | TAMH | HAMT 








MTHA | ATHM | TAHM | HATM 





MHAT | AHMT | THMA | HTMA 




















MHTA | AHTM | THAM | HTAM 





Table 5-3 LIST2 


SOLUTION As we have seen, the fact that one person is allowed to win both 
prizes means that repetition is allowed in the representations of the outcomes. 
Since we are not told if the prizes are different (i, if order matters in the rep- 
resentations), we will answer the question under both possible circumstances. 
If the prizes are different, then our representations are ordered lists since 
order matters and repetition is allowed. We will organize the data in Table 
5-4 with four columns corresponding to the possible winners of the first prize. 
Within each column we arrange the rows in alphabetical order by the second 
prize winner. From this table, it is easy to see that there are 16 possible outcomes. 
If the prizes are not different, then our representations are unordered 
lists since order does not matter and repetition is allowed, This time we can 
organize the outcomes in Table 5-5 with four columns corresponding to the 
possible winners of the first prize, To avoid listing the same outcome twice, we 
now adopt the convention that within each column we only list those outcomes 
whose second prize winner does not come alphabetically before the first prize 
ner. This avoids, for example, having both AB and BA (which represent 
the same outcome) in our table, In this ease, the table has columns of varying 
lengths, but we can easily see that the total is 4+3+2+1= 10. n 














Practice Problem 2 Consider the question “How many ways can two winners 
be chosen for prizes from a class consisting of just four people (Andrew, Bob, Carly, 
Diane)?” from the beginning of this section, but assume that we know the same person 
is not allowed to win both prizes. 




















| Andrew | mob | Carty | Dime 
AA BA | ca | da 
an [m | co | oa 
ac | ac [ cc | oc 
ab |b | co | oo 

















Table $-4 Prizes Are Different in 
Example 5 














HHH 
HHT 
HTH 
HTT 


THH 
THT 
TH 
TT 


Figure 5-1 Tree of 
‘outcomes in Example 7. 





51 introduction 375 























Andrew | Bob | Carly 

AA BB | cC |DD 
AB BC | CD 

AC BD 

AD 

















Table 5-5 Prizes Are the Same in 
Example 5 


(a) Make a table for the list of outcomes for this question, assuming that the prizes 
are different, In this case, the outcomes are represented as permutations 

(b) Make a table for the list of outcomes for this question, assuming that the prizes 
are the same. In this case, the outcomes are represented as sets 


a Example 6 Represent the list of all outcomes of rolling a red six-sided die and 
a green six-sided die in an organized way. 


SOLUTION We use Table 5-6 with the result of the green die labeling the 
columns and the result of the red die labeling the rows. n 


Tlf pample7.epresentihelistofallpossbleresulsof tasinga penny a nickel, 
and a dime together. 


SOLUTION. The list of all possible results of tossing a penny, a nickel, and 
à dime together can be simplified by using three letters, for example, HTH, to 
reflect the result where we agree ahead of time to the convention that the first 
letter describes how the penny lands (in this example, H stands for *heads"), the 
second letter describes how the nickel lands (in this example, T for "tails*), and 
the last letter describes how the dime lands. We can illustrate these outcomes 
using a “tree” structure, where the first branch denotes the result of the penny, 
the second branch the result of the nickel, and the last branch the result of the 
dime (see Figure 5-1). One must only imagine walking along the tree from left. 
to right, and think of tossing coins when one chooses a branch. Thi 


















out to be a useful alternative to imagining tables in many circumstances (1 

Greeni Green? Green3 — Green 4 Green 
Redl| (D (1,2) 0,3) (1.4) (1,6) 
Red 2 (2.1) (2,2) (2.3) (2,4) (2,6) 
Red3 | (GOD GD G3) G4) Go) 
Red4 (4.1) (4.2) (4.3) (4,4) (46) 
Red 5 (5.2) (5,3) (5,4) (5,6) 
Red 6 (6,2) (6,3) (6,4) (6,6) 





Table 5-6 Solution to Example 6 





a a 


376 Chapter 5 | Combinatorics 


Practice Problem 3 Explain an orderly way to organize each of the following 
lists so that the number in each list is easy to see: 


(a) The list of all permutations of the three letters in the word HAT 

(b) The list of all (different-looking) permutations of the four letters in the word 
BOOK 

(c). The list of all three-element sets using letters from the word GAMES 


Combinatorial Equivalence 


‘There is a concept in discrete mathematics that seems to be simultaneously the least 
understood by beginning students and the most ignored by textbooks. On the likely 
chance that there is a correlation between these two situations, a word on this subject 
seems appropriate. Formally, itis the idea of a “one-to-one correspondence” between 
finite sets, but itis perhaps more easily thought of as recognizing when two problems 
have the same answer. There are several levels of difficulty among applications of 
this idea. The following examples are meant to illustrate these levels. In them we will 
be concerned with seeing when two problems are equivalent—we will not actually 
answer any of the questions now, but you should keep them in mind. 

The easiest example of this phenomenon is illustrated by the following pair of 
questions: 





(i) How many even multiples of 3 are there between 1 and 100, inclusive? 


(ii) How many multiples of 6 are there between 2 and 96, inclusive? 





Notice that if we had explicitly listed the numbers that we were trying to count, 
in both cases we would have written 6, 12, 18, ..., 96, So (i), and (ii) are equivalent 
because they are asking you to count exactly the same set of numbers; consequently, 
you can choose to answer (i) or (ii) (whichever you find easier to understand) and 
you know that the other one has the same answer. Unfortunately, this example is so 
Simple as to be of essentially no use to us later on. 

Perhaps more relevant is the next application, which is still ea 
as the previous example. 


but not as silly 





GB Example 8 without answering either question, explain why (i) and (i) have 
the same answer. 


(i) How many multiples of 3 are there between 100 and 300, inclusive? 
(îi) How many integers are there between 34 and 100, inclusive? 


SOLUTION For each number x in the list counted by (i 
in the list counted by (i). This can be illustrated as follows: 





the number 3x is 


List(i) — 102 — 105 — 108 +» 297 — 300 
t t + t H 
Lis() — 34 35 36 e 99 — 100 








51 Introduction 377 





Since each item in list (i) corresponds to one item in list (ii) and vice versa, 
these two lists have the same length. a 





Most relevant to our purposes are examples where we can create a correspon- 
dence between questions about different structures. 


Ta Example 9 Without answering either question, explain why (i) and (ii) have 
the same answer. 


(i) How many ways are there to distribute three balls—one red, one blue, and one 
green—to 10 people? (Someone getting more than one ball is allowed.) 
(ii) How many integers are there from 0 to 999, inclusive? 


SOLUTION Since in (i) we are considering a set of distributions of balls 
among people and in (ii) we are looking at a set of numbers, we are obviously 
not counting the same set, but we claim the two sets that are being counted 
do have the same size. Certainly, this could be the case “accidentally,” but we 
are interested in finding a reason for it. And if we can find a reason for it, then 
as before, there’s no need to answer both questions. In this example, we can 
imagine the people in (ii) being numbered 0, 1, ..., 9, and we will record three 
numbers from left to right as "who gets the red ball,” "who gets the blue ball," 
and “who gets the green ball.” 








For example, 
Person ges ed Berson 2 ets red Penson 3 gets red 
Distribution | Person o get bive Penn et Noe. Person 7 sets bue 
Reson pes preen enon pet green Person 5 pen green 
i i i 
Integer am ds sis 


So for each way of distributing the balls, there is a corresponding integer 
between 0 and 999, Observing that every such integer corresponds to a unique 
distribution of balls, we can conclude that (i) and (if) are counting sets of the 
same size. o 





This sort of argument always requires two components: a rule establishing a 
correspondence between the problems, and an informal explanation as to why every 
element in the set associated with each problem corresponds to a unique element 
in the set associated with the other problem. The previous example illustrates the 
usefulness of this technique as well—we have reduced the complicated-sounding 
problem (i) to a trivial one (ii). This is why the recognition of equivalence between 
two problems is fundamentally important. It allows us to "transform" an unfamiliar 
problem into a familiar one, 

We saw above that sometimes establishing that two problems will have the same 
answer can be useful. In no subject is this more true than in combinatorics, The next 
example shows the power of this idea even in the absence of any good way to deal 
with either problem. 














We 0 0 





378 Chapter 5 / Combinatorics 


u Example 10 Without answering either question, explain why (i) and (ii) have 
the same answer. 


(i) How many sets of size 2 can be made using elements from (1, 2,3, 
(ii) How many sets of size 7 can be made using elements from (1,2. 


9? 
9? 








SOLUTION Wemustdescribe how to link each entry in one list with a unique 
entry in the other list, so lets try it by considering a generic entry in the list 
described by (i). Let S denote the set {1,2,3,...,9}. With each two-element 
subset T = (a, b] of S, associate the seven-element set $ — T, consisting of the 
seven elements in (1, 2, 3, .. .,9] other than (a, b), For example, 


(3,5) 11,9 (2,3) 16.7] 
1 t 1 1 
(1,2,4,6,7, 8,9] (2,3,4,5,6,7, 8] {1, 4,5,6,7, 8,9) (1,2,3,4, 5, 8,9] 





This rule is fully and uniquely reversible, so this links the entries in the two 
lists with a one-to-one correspondence. o 


a Example 11 Without answering either question, explain why (i) and (ii) have 
the same answer. 


(0. How many different outcomes are there in flipping a coin five times in a row? 
(li) How many sets can be made using elements from (1,2, 3, 4,5)? 


SOLUTION Picture five numbered blank spaces into which you will enter 
“H” or "T" as shown below: 





1 2 3 4 s 


We will describe how to build an entry in the list for (ii) given a generic. 
entry in the list for (i). A generic entry for (i) can be written as an ordered 
list of length 5 using H's and T's to stand for “heads” and “tails.” For example, 
THHTH is such an entry. We place the entry into the five numbered blank 
spaces, then simply write down (inside the () brackets) the positions of the 
H’s In our example of THHTH, the H's occur in the second, third, and fifth 
positions, so we write down the set (2,3, 5]. o 


Practice Problem 4 Ilustrate the rule of the preceding example by filling in the 
missing entries in Table 5-7. 


We conclude this section with one final example that describes a complicated 
correspondence that we will see again in this chapter. 








51 Introduction 37' 





[Resul otcointosses | THHTT | HTTTT HHHHH 








[Subset of (1.2.3. 4.5) | 25) | v 1.451 
Table 5-7 Table for Practice Problem 4 





@ Example 12 The following are equivalent: 





(i) How many positive integer solutions are there to x + y+ z = 
(ii) How many two-element subsets of (1,2, ...,20) are there? 


SOLUTION We must describe how to link each entry in one list with a unique 
entry in the other list, so let's try it by considering a generic object described 
by (i), an ordered list of three positive integers x, y, z for which x + y+ z = 21. 
For example, 1 +1+19,2+3+ 16, and 3+2 4 16 are all different solutions 
counted in (i). Notice that once x and y are chosen, there is no choice about 
the value of z since the three numbers must sum to 21. 

Our correspondence will be to associate with each solution x, y, z the sub- 
set [x x + y]. Since y cannot be zero, we can be sure that x and x + y are two 
different values, and since zis positive, the largest x + y can be is 20. Hence, we 
can be sure that the set (x,.x + y) is a two-element subset of (1, 2,..., 19, 20), 
so this set will certainly be listed in (ii) above. 

Table 5-8 illustrates some examples of this correspondence, To see that this 
rule is reversible, we consider an item in (ii) and show that there is an item in (i) 
corresponding to it. For example, consider the subset (14, 17). By the way the 
rule was given, it is clear that x must be 14, the smaller element of the set, Then 
since x + y must equal 17, we see that y is 3. Finally, x + y + z = 21 shows that 
zis 4, Itis easy to check that the set linked to this solution 14 + 3+ 4 = 21 using 
the above correspondence is none other than the set (14, 17). We can de: 
this pattern in general terms as follows. Given a subset (a, b] of (1, 2, ... 20) 
with a < b, let x = a and y = b — a, and then let z = 21 — b. 

Wecan consequently be sure that our rule matches the answers to questions 
(ji) and (ii) in a one-to-one manner. Hence, we can be sure that these two 
questions have the same answer even though we do not yet know what this 
answer ist [ 























Practice Problem 5 — In Example 12, we checked the reversibility of the origina 
rule using the following reasoning: Given a set (a, b] with a <b, let x =a, y = b —a 








Solution tox+y+2=21 Subset (a 6} of (1,2,...,20) 
141419 = (1,2) 
342416 = (3.5) 
243416 + (2.51 
164124 =- (16.17) 
184241 - (18,20) 


Table 5-8 Table for Example 12 





380 Chapter 5 / Combinatorics 


(11,12) 






(19,20) | (1.20) | n1) | (10,20) 


f t 


Table 5-9 Table for Practice Problem 5 





Solution x. y. z 





Seti x + y) 





and z =21 — b, and then the solution x, y, z will correspond to a, b). For each of the 
two-element subsets in Table 5-9, give the corresponding solution to x + y z — 21, 
and check that the set (x, x + y} is in fact, (a,b). 


Solutions to Practice Problems 
1 (m) A five-card poker hand has no repetition and the 


order in which the cards are received is irrelevant, 
so this is a set, 

(b) A two-card blackjack hand has no repetition, but 
since the first card is "down" and the second is “up” 
for all to sce, the order does matter for the game, so 
this is a permutation. 

(©) This depends on the sport—the order of games cer- 
tainly matters but repetition in a schedule is al- 
lowed in some sports (like baseball) and not oth- 
ets (like the schedule of Venus Williams's tennis 
matches at Wimbledon, a single-climination tour- 
nament). Hence, this can be either a permutation or 
simply an ordered list. 

(d) Anyone who has dumped his or her candy on the 

floor and sorted it knows that the only thing that 

matters is how many of each piece of candy you get. 

Hence, this is an unordered list (of candy types) in 

which repetition is allowed. 

In Table 5-10, each column contains those permuta- 

tions for which the person whose name is at the top 

of the column receives the first prize. 

(b) In Table 5-11, each column contains those sets for 
whom the person which name is at the top of the 
column is the first of the winners in alphabetical 
order. 

















Table 5-10 Solution for Practice 
Problem 2(a) 





Table 5-11 Solution for Practice 
Problem 2(b) 


3. In each case we can make a table, 


(a) The list of all permutations of the three letters in the 
word HAT can be arranged as in Table 5-12 so that 
entries in the same column start with the same let- 
ter. This gives us three columns, cach of which will 
have two rows, for a total of six objects. 

(b) The list of all different-looking permutations of the 
four letters in the word BOOK can be arranged as 

in Table 5-130 that entries in the same column start 

‘with the same letter, This gives us three columns, but 

not every column has the same number of rows. 

Altogether this give us 3+3 +6 = 12 entries, 

‘The list of all three-element sets using letters from 

the word GAMES can be arranged so that entries 

in the same column have the same first letter in 
alphabetical order. For example, in the £ column 

‘goes every three-element set of letters for which E. 

is the first letter in the set in alphabetical order, In 

this case, that means those sets that contain an Æ but 


Hu A T 


HAT AHT THA 
HTA ATH TAH 
Table 5-12 
Solution for Practice 
Problem 3(a) 











notan A. With this organization, there will be three 
‘columns (do you sce why there is no M column and. 
noScolumn?), cach of which has a different number 


Exercises for Section 5.1 


1. As in Examples 2 and 3, convert each question into a 
question of the form “How many elements are in the 
set...” In case of ambiguity, clearly state any assump- 
tions you make about order and repetition. If the cor- 
responding set is large, just describe how the elements 
are listed, and list five or six sample elements of the set. 
(2) Asa reward for a job well done, a child is allowed to 

reach into a bag and grab any two candy bars The 
bag contains five each of the following: KrazyKat 
Bar, MilkyMorning Bar, LusciousLemon Bar. How 
many different ways are there for the child to make 








A E G 
IA.G.M) — (EGM) — IGS.MI 
(A.G.S) — (ES.G] 
(A.S.M) — (MISI 
1A, E.G) 
(A. E. M] 
1A, ES) 
Table 5-14 Solution for Practice 
Problem 3(c) 


of rows, so the best we can do is organize our li 
as in Table 5-14. This table does not have a nic 
shape, but we can tel that it is complete, There a 
6+3+1 = 10 such three-element sets, 

4 The completed table is shown in Table 5-15. 

5 The completed table is shown in Table 5-16, 





b) If the child in the previous problem plans to gh 
the first bar she takes out of the bag to her broth 
and eat the second one herself, how many differe 
ways are there for her to make her choice? 

4) When packing for a short trip, a traveler choos 
three shirts from a closet that contains nine shir 
How many ways are there to do this? 

(d) A shopper is buying three shirts from a store th 
stocks nine different types of shirts, How many wa 
are there to do this, assuming the shopper is willi 
to buy more than one of the same shirt? 



































her choice? 

Result ofcointowes | THHTT | rrer | THITH | tr | nnana | irra 
Subset of (1,2.3,4.5) | 23) m | us a 0.23.45) Tass 
Table 5-15 Table for Practice Problem 4 
Set (a,b) ua | um» | wm 11.19] (10,20) (11, 12) 
Solution x,y,z] 141419 | 194141 | 141941 | 14941m | 1041041 | 14149 
sewrty | na | ue | uw n10) noz) [ umm 





Table 5-16 Solution for Practice Problem 5 











382 Chapter 5 / Combinatorics 


(e) An ice cream shop offers eight flavors of ice cream. 
How many different two-scoop dishes are possi- 
ble if the two scoops are placed side by side in the 
dish? 

4D) An ice cream shop offers eight flavors of ice cream. 
How many different two-scoop dishes are possi 
ble if the two scoops are placed one on top of the 
other? 

2, Follow the direetions for Exercise 1. 

(a) A state lottery game consists of drawing ping pong. 
ball from each of three machines containing ten 
balls labeled from 0 to 9. How many possible out- 
comes are there for the game? (A winning ticket 
must match the numbers drawn in the same order 
they were drawn.) 

(b) A security box guarding a sensitive area has three 
buttons colored red, green, and blue. To enter the 
arca, you must enter a security code, pressing four 
buttons in succession. How many different security 
codes are there? 

(©) A store sells apples, bananas, and oranges. | want 
to buy two pieces of fruit to have for lunch, How 
many possible ways are there to do this? 

4d) In the previous problem, if I'm buying one piece 
for myself and one for my best friend, how many 
possible ways are there for me to buy two pieces of. 
fruit? 

e) Near the end of a clearance sale, a store has only 
three bicycles left for sale—a black one, a green one, 
anda red one. I want to buy two bicycles How many 
possible ways are there to do this? 

4D In the previous problem, if I'm buying one bike for 
myself and one for my best friend, how many pos- 
sible ways are there for me to buy two bikes? 

3. For the situations described in Exercises 1 and 2, de- 
cide which of the four structure types (set, unordered 
list, permutation, or ordered list) best characterizes the 
situation. 

4. Decide which of the four structure types (set, unordered 
list, permutation, or ordered list) best characterizes the. 
objects in each of the following situations. Can any of 
the following have more than one answer? 

(a) Dealing a 13-card hand for the card games bridge 
or hearts 

(b) Selecting three officers—president, vice president, 
and secretary—for a club 

(©) Rolling a pair of dice 

12 marbles from a jar contai 

green, and blue marbles 

(e) Ordering pizza toppings from a menu of choices 

(Ð Setting a batting order for a baseball team 





5. Example 6 presents an organized list of all outcomes of 

F rolling a red six-sided die and a green six-sided die. Use 

that list to answer these questions. 

(a) How many possible outcomes are there? 

(b) Of these, for how many is the sum of the values on 
the two dice equal to 7? For how many is the sum. 
of the values equal to 11° If you roll a pair of dice, 
arc you more likely to roll a 7 or to roll an 11? 

(©) For how many of the outcomes is the sum of the 
values 5 or less? What percentage of the possible 
‘outcomes does this represent? 

6. Example 6 presents an organized list of all outcomes of 

F rolling a red six-sided die and a green six-sided die, Use 

that list to answer these questions, 

(a) OF the listed outcomes, how many are "doubles" 
(ke. both dice have the same value)? What percent- 
age of the possible outcomes does this represent? 

(b) How many are doubles and have the sum of the two 
dice values less than 4? 

(©) How many have a 5 on exactly one of the dice? 

(@) How many have a 5 on at least one of the dice? 

7. In Example 7, we presented a tree as a handy represen- 
tion of the possible results of tossing a penny, a nickel, 
and a dime, Use that tree to answer these questions 

(a) How many possible outcomes are there? 
b) Of these, how many have exactly one head? 
(©) How many have at least onc head? 

4d) How many have exactly two heads? 

8. Use the tree from Example 7 to answer these questions 

(à) Of the outcomes shown, for how many does the re- 
sult on the penny match the result on the dime? 

(b) Which is more likely, that the result on the penny 
‘matches the result on the dime, or that they do not 
match? 
(©) For how many of the outcomes do all three coins 
match? 

(d) For how many of the outcomes do exactly two of 
the coins match? 

(€) For how many outcomes is the number of tails less 
than the number of heads? 

9. The tree in Figure 5-2 represents a “best of 3” tennis 
match between two players labeled A and B. In the or- 
dered list representation, the letters A and B indicate a 
set won by Player A or B, respectively, where the first let- 
ter refers to the first set, the second letter to the second 
set, and the third letter to the third set when necessary. 
Use this tree to answer the questions that follow. 

(a) How many different results are there? 
(b) Of these, for how many does the match end after 
two games? 














E BEE 


BB 
Figure S-2 Game tree for Exercise 9. 


(©) For how many does Player A win in exactly three 





d) Explain why the answer to part (c) is the same as the 
answer to this question: "How many ordered lists 
of length 2 taken from (A. B] have exactly one A?" 
(Hinr: To win in exactly three games, how many of 
the first two games must Player A have won?) 

10. The idea in Exercise 9 can be easily extended to a "best. 
of 5” match in which the first player to win three sets 
wins the match. Answer the following questions about 
this type of match: 

(a) Givea tree to representa “best of 5” match between 
Players A and B. 

b) How many ways are there for the match to end after 
three games? 

(©) How many ways are there for Player A to win in. 
exactly four games? 

(d) Explain why the answer for part (c) is the same as 
the answer to the question "How many ordered lists 
‘of length 3 taken from (A. B] have exactly two A's?” 

(e) How many ways are there for Player A to win in 
exactly five games 

4D Fil in the blanks. The answer to part (c) isthe same 
as the answer to the question “How many ordered 
lists of length iken from (A, B] have 
exactly A 

11. In Example 4, we presented an organized list of all per- 
mutations of the letters in the word MATH. Use that 
list to answer these questions. 

(a) How many permutations are there? Can youanswer 
this without actually counting them all? 

(b) Make a copy of the table in Example 4, and change 
each H to a T. Now some of the entries in the table 
are the same. For example, both MATH and MAHT 
have become MATT. Divide the original table en- 
tries into groups using the rule “Two original entries 
goin the same group if they now look the same on 
your copy.” How many groups are there? How big 
iseach group? 

(©) Use part (b) to answer the question "How many 
distinct arrangements are there for the letters in 
the word MATT?" as a division problem. 








51 Introduction 383 


12, Answer these questions by organizing the items to be 
‘counted in any way you like. 

) How many different outfits can John form if he has 
three shirts (red, green, and yellow) to choose from, 
and two pairs of pants (black, white)? 

(b) A new board game has a standard six-sided die, and 
a spinner with three colors, red, white, and blue, A 
player takes a turn by tossing the die and spinning 
the spinner, How many different possible results are 
there? 

(©) A security box guarding a sensitive area has three 
buttons colored red, green, and blue. To enter the 
area, you must enter a security code, pressing four 
buttons in succession. How many different security 
codes are there? 

(A) A shopper wants to buy a total of three pieces of 
fruit, and he only likes apples, bananas, and oranges. 
How many different ways are there for him to select 
the three pieces of fruit? 

) How many arrangements are there of the letters in 
the word HEAR? 

0) Using your answer to part (c) as a starting point, 
how many arrangements are there of the letters in 
the word HEARD? 

13, Answer these questions by organizing the items to be 
counted in any way you like, 

(a) Sara does an online search for flights to visit her 
‘grandmother. She is given a choice of four 
her grandmother's city, and a choice of three fights 
for the return trip. How many different ways are 
there for her to schedule her trip? 

(b) How many different-looking arrangements are 
there of the letters in the word DEED? 

(©) How many arrangements are there of the letters in 
the word GAMES? 

44) How many different-looking arrangements are 
there of the letters in the word PUZZLE? 

(©) If we did a national survey asking people for their 
favorite two days of the week, how many different 
responses could we possibly get? 

M. A binary sequence is simply an ordered list using only 
digits chosen from (0, 1), where we usually suppress the 
commas that ordinarily separate list items, For exam- 
ple, 010, 110, and 011 are different binary sequences of 
length 3. 

(a) How many binary sequences are there of length 3? 

(b) How many binary sequences are there of length 4? 
Of these, how many end with a 1? 

(©) How many binary sequences of length use exactly 
two l'sand two 0's? 

















384 Chapter 5 / Combinatorics 


d) How many binary sequences of length 5 do not have 
two adjacent 1's? 
(e) How many binary sequences of length 5 look the 
same reading them forward as backward? (For ex- 
ample, 11011 counts since its reverse is also 11011, 
but 10110 does not count since it is not identical 
with its reverse, 01101.) 

IS. Answer these questions about the list of positive inte- 

gers less than 100,000, 

(a) How many multiples of 7 are there? 

(b) How many times does the digit 9 appear? 
6) How many perfect squares are there? 

IG, Each of the following problems is about counting the 

factors of numbers in an organized way: 

(a) How many positive factors does 300 have? (Hint: 
Make a table with three columns: Those divisible 
by 4, those divisible by 2 but not 4, and those not 
divisible by 2, Use the fact that 300 = 2? 3:5? to 
help you organize your table.) 

(b) How many positive factors does 600 have? 

(©) How many positive factors does 9,576 have? 
(Hint 9,576 = 23.7.19) 

(@) A whole number is square-free if it is evenly divisi 
ble by no perfect squares (other than 1), 

i How many of the positive factors of 300 are 
square-tree? 

i, How many of the positive factors of 9,576 are 
square-free? 

This exercise explores a method that can be used to 

attack problems of the type given in Exercise 16. 

(a) How many ordered lists of length 2 with entries 
taken from the set (0, 1,2) are there? 

(b) The answer to part (a) is the same as the an- 
swer to the question “How many positive factors 
does 36 have?" To see this, link each ordered list 
a, b to the po jh, Fill in Table 5-17 
with exampl correspondence 
works, 

(©) Explain in your own words why this correspondence 
matches each ordered list entry to exactly one pos- 
itive factor of 36, and is reversible. (Hint: Write 
36-4.9- 2.3) 

18, These questions have the same answer. 

F i, How many distinguishable arrangements of the let- 

ters in O1100 are there? 
ii. How many two-element subsets of (1,2,3,4, 5) are 
there? 
A function that demonstrates this fact takes a given ar- 
rangement of the letters in O110O, and returns the list 
of positions containing the /'s. 














p. 
























Ordered list | Factor of 36 
0,0 2030 
01 
12 
22 
4 
6 
36 














Table 5-17 Table for 
Exercise 17 


(a) Describe in words the reverse function, 
b) Fill in Table 5-18 to illustrate the correspondence, 
19, These questions have the same answer. 

1. How many eight-digit binary sequences have three 
T's no two of which are adjacent? 

fi, How many three-element subsets of (1,2, 3,4, 5,6) 
are there? 

Here is a function that demonstrates this fact: Given 

an eight-digit binary sequence containing three 1's, no 

two of which are adjacent, write down the three posi- 

tions containing the 1's, Decrease the second number 

you wrote by 1, and decrease the third number by 2. 

(a) Describe in words the reverse function, 

(b) Fill in Table 5-19 to illustrate the correspondence, 









































onoo | (31 
ooon 
O0IOI 
10001 
{1,4} 
{3.4} 
{1,3} 
(2,5) 
Table 5-18 
Table for 


Exercise 18 









































10101000 | {1,2,3} 

01010100 

00100101 

00010101 
{1,3,5} 
{1,3,6} 
{3,4,5} 
{2,4,6} 

Table 5-19 Table 


for Exercise 19 


20, Explain why (i) and (ii) have the same answer in each of 
the pairs of questions below. You do not need to actually 
answer either question, 

(a) i How many distinguishable arrangements of the 
letters in ABABA are there? 
i, How many five-digit binary sequences are there 
With exactly three I's? 
(b) k How many two-element subsets of the set 
11,2,3,4, 5) are there? 
W, How many three-element subsets of the set 
11,2,3,4,5] are there? 
(©) i How many permutations of all five of the objects 
in (a, 1, x, 3, 9) are there? 
Jl. How many permutationsof length 4 of the objects 
in (a, 1,x.3,9) are there? 
d) i. How many ways are there to flip three heads in 
five tosses of a coi 
ii, How many three-element subsets of (1.2, 3, 4,5] 
are there? 

21, Explain why (i) and (i) have the same answer in each of 
the pairs of questions below. You do not need to actually 
answer either question. 














22, 


51 Introduction 385 


(a) Rooks attack each other if they are on the same row 
or column of a chessboard. 

i. How many ways can eight rooks (which all look 
the same) be placed on a chessboard so that no 
two rooks can attack each other? 

ii. How many arrangements are there of the letters 
ABCDEFGH? 

(b) A positive factor of an integer n isa positive integer 
by which n can be evenly divided. 

i. How many positive factors does 537,824 have? 
(Hint: This number is 2 75.) 

fi, How many different outcomes result from 
rolling a pair of distinguishable six-sided dice? 

(©) i, How many different paths can a rook take to 
move from the lowerleft corner to the upper- 
right corner of a chessboard if it never moves 
downward or leftward? 

ii, How many different 
the following characters 


AAAAAAABBBBBBB 


‘The following problems look at the factors of a speci 
kind of number that occurs frequently in discrete math- 
‘ematies, The general explanation of the pattern is quite 
difficult to formalize at this level, but students who like 
a good challenge should enjoy working on it. 

(a) What is the largest power of 2 that evenly divides 
1112 Nors: We write n! (read "n factorial") for the 
product 











ingements are there of 








(n= Din = 2) (00). 


b) What ifthe largest power of 2 that evenly divides 
zum 

(©) Notice that if you write 11 in base two, itis 1011 
that has three 1'sin it, and 11 ~3 = 8, and that 2" is 
the answer to the question in part (a). Now notice 
that 234 in base two is 11101010 that has five 1's in 
it, and 234 — 5 = 229, and that 2 is the answer to 
part (b). Show that this is no coincidence—establish 
that for any positive integer n, the above process will 
produce the exponent of the highest power of 2 that 
evenly divides nl 




















1 Combinatorics 


5.2 Basic Rules for Counting 





In this section we apply our ideas about organization in counting to answer some 
standard kinds of problems. In particular, we will derive “formulas” for two of the 
finite structures from the previous section, 


"Ml Example 1 tn a small Southern state, a license plate consists of one or wo 
leners from (A, L, B, M) followed by four or three (respectively) digits. How many 
license plates are possible? 


SOLUTION Let's call a license plate “type T” if it has one initial letter and 
“type II" if it has two initial letters. The list of the type I license plates can 
be organized as shown in Table 5-20. a table with 10,000 rows and four 
columns, so we can say it has 40,000 entries. 

Table 5-21 shows a similar organization of the type II license plates. It will 
only have 1,000 rows, but it will have more columns. In particular, the first row of 
the table will consist of the letter combinations shown in Table 5-21 along with 
the digits 000. Thus, the table of type II license plates will have 1,000 rows and 
16 columns (one for each of the letter combinations). We can conclude that 
there are 16,000 type IT plates. Combining this with the number of type I plates, 
we determine that the answer to the original question is 40,000 + 16,000 
56,000. o 




















The above example used an organized way of counting the possible license 
plates, but we had to make some decisions along the way as to how to break up the 
problem. Also, when we counted the type II license plates, we did not even write 
down the list, Instead, we just thought about what the list would look like in tabular 
form. This was enough to get a visual image of the list, allowing us to subsequently 
determine the size of the list. The basic organizational techniques in this problem 
come up so often that itis convenient to give them names so that we can talk about 
them more easily, 


The Rule of Products 


Formally, the rule of products states that the number of ordered pairs with a first 
coordinate from set A and a second coordinate from set B is equal to the product of 
the number of elements in each set. Informally, the rule says that if we can organize 





0000 | 0000 | 0000 | M0000 





A000 | £0001 | gooo | moooi 








9999 | L999 | B9999 | m9999 




















Table 5-20 Organizing the Type I Plates in 
Example 1 








52 Basic Rules for Counting 387 





AA | LA | BA| MA 





AL | LL | BL| ML 





AB | LB | BB| MB 





AM | LM | BM | MM 




















Table 5-21 Organizing the Type 11 
Plates in Example 1 


‘our list into a table with n rows and n columns, then we know that the list has (m)(u) 
entries total, We saw this in Example 1 when we counted each type of license plate. 


Ml ampie 2: tow sony verra vere ondeva meal Gong pfo si 
wich and one beverage at a restaurant that serves five different sandwiches and six 
different beverages? 








SOLUTION Call the sandwiches 1,2, 3, 4, 5 and the beverages A, B, C, D, E, 
F. Then the orders could be written as in Table 5-22, Since this is a table with 
six rows and five columns, the list has a total of 30 entries. n 


There is a more practical way to state the rule of products that does not involve 
writing down the actual table: 


Rule of products: If cach entry in a list can be created by first selecting one of 
x objects and then one of y objects, then the list has a total of (x)(y) entries. In 
terms of sets, this means that n(A x B) = n(A) :n(B) for all finite sets A and 
B. 


In the example above, the entries are made by first selecting one of the six objects 
in (A, B,C, D, E, F} and then one of the five objects in (1,2, 3, 4, 5], so the list has 
length (5)(6 

The advantage to this latter way of thinking of the product rule is that it is easier 
to picture adding "steps" to a selection process than itis to add dimensions to a table, 











m | a2 | a | a [as 





m| Bm2 | B3 | Ba | Bs 





ci | a| a | ca lcs 





pi | D2 | D3 | D4 | DS 
+ 


Ei | E2 | E | E4 | Es 








m| F| F3 | rs | os 























Table 5-22 Organizing Orders in 
Example 2 








388 Chapter 5 / Combinatorics 


a 
517 
9 


| 


2. choose a different tens' digit. Resulting number chosen 
1. Choose a ones’ digit, 3. choose a different hundreds’ digit. 


Figure 5-3. Part of the selection tree, 


E] Example 3 How many numbers between 100 and 1,000 have three distinct 
odd digits? (For example, 153 should be counted but not 133 or 123.) 


SOLUTION Each of these numbers can be constructed in a three-step pro- 
cess, First choose a number from the five digits in (1, 3,5,7, 9} and make this the 
ones digit, Next choose the tens digit from the four digits left after our ones digit 
is removed from (1,3,5,7,9}. Third, choose a hundreds digit from the three 
digits left after our first two choices have been removed from (1, 3,5,7,9). 
Applying the products rule gives us 


54.3260 
such numbers, o 





A three-step process corresponds to a three-dimensional table if we hold onto 
our original analogy. A more versatile visual image for these multistep processes is 
a selection tree as shown in Figure 5-3 for the example above, 

‘The branches shown indicate that 7 was chosen as the ones digit and 1 was 
chosen as the tens digit. At the right side of the tree we see all final outcomes that 
correspond to these choices. If you can visualize the entire tree, you will see that each 
of the five original branches has four branches, giving a total of 20 branches after the 
second step. Each of these 20 branches, in turn, has three branches in the last step, 
so altogether the final list at the right will consist of (5)(4)(3) = 60 numbers, just as 
our solution predicted 








Practice Problem 1 Use the product rule to answer these questions. 


(a) How many binary sequences of length 3 are there? 

(b) In how many ways can one be dealt the first two cards in blackjack, first card 
face-down and second card face-up? Assume that you are using a standard deck 
of 52 cards. 


The Rule of Sums 


Formally, the rule of sums states that the number of elements in the union of two 
joint sets is equal to the sum of the number of elements in each set. Informally, 
this means that if the list being enumerated can be broken into two disjoint parts, 
then the number of entries in the whole list is the sum of the number of entries in 
the two parts, We applied this rule in Example 1 when we added the number of type 
1 plates to the number of type 2 plates to get the total number of license plates. 















52 Basic Rules for Counting 389 


Rule of sums: If the list to count can be split into two disjoint pieces of size x 
and y, then the original list has x + y entries. In terms of sets, we can write this 
as n(A U B) = n(A) - n(B), provided A and B are disjoint. 


Recall that the product rule, stated for a two-step process, had an obvious gen- 
eralization to a multiple-step process. Similarly, we stated the rule of sums for two 
nt pieces, but it generalizes to more than two pieces, provided that every two 





pieces are disjoint, 


"BB Example 4 Howmany positive integers tess ihan 1000 consist of dint digis 


from (1,3,7, 9? 





SOLUTION We split our problem into three di those with one 
digit, those with two digits, and those with three digits, So the solution is the 
sum of (i) the number of one-digit numbers using distinct digits from (1,3, 7,9], 
(ii) the number of two-digit numbers using distinct digits from (1, 3, 7,9], and 
(iii) the number of three-digit numbers using distinct digits from (1, 3, 7, 9], and 
each of (i), (i), and (iii) can be calculated using the product rule as above. (Try 
it now!) So the answer is 4+ (4)(3) + (4)(3)(2) = 40. o 















We now continue with more complex counting problems that combine the sum 
and product rules. 


w Example 5 In how many ways can one win a dice game played with three 
distinguishable dice in which a winning roll is one that has at least two values that are 


the same? 





SOLUTION Ifwe think of the dice as being colored red, green, and white, 
then we can represent an outcome as an ordered list of length 3 with elements 
taken from {1,2,3,4,5,6} in which the first number is the roll of the red die, the 
second number is the roll of the green die, and the third number is the roll of 
the whi 
In this manner every winning roll looks like one of the followi 
XYX,YXX, or XXX, where X and Y are different numbers from (1,2,3,45 6) 
In each of the first three of these cases, there are six ways to choose X and five 
‘ways to choose Y, while in the fourth case there are only six outcomes altogether 
(namely 111,222, 333, 444, 555, and 666). There are a total of (3)(6)(5) +6 = 96 
winning rolls, o 











XXY, 














Practice Problem 2 1f we roll a six-sided die three times and record the result as 
an ordered list of length 3, how many of the possible outcomes contain exactly one I. 
(Hin: Divide the list into three disjoint sets: those where the 1 is the first roll, those 
where it is the second, and those where it is the third.) 


V eri 6 iieaoe ser sconces Co ter ta ned conte 
of three of the same letter? (There are 24 letters in the Greek alphabet.) 


SOLUTION Each three-letter sequence of Greek letters is either (i) a string 
of three of the same letter or (ii) a string that we are supposed to count, So by 








390 Chapter 5 / Combinatorics 


the sum rule, the total number of three-letter sequences of Greek letters is the 
number of sequences of type (i) plus the number of sequences of type (ii). 

By the product rule, the total number of three-letter sequences of Greek 
lettersis (24)(24)(24), and clearly the number of sequences of type (i) is24 since 
there is one such sequence for each letter. Therefore, the number of sequences 
of type (ii) is 24° — 24 = 13,800. o 





Example 6 is a slight twist of the sum rule that is fairly common. If a problem 
asks for the number of a certain type of objects that have a specific property, then 
by knowing the total number of this type of object and the number of these objects 
that do not have the property, we can find the answer to the original problem by 
simple subtraction. In fact, any time the total number of objects is obvious, instead of 
answering a given counting question, we can answer the complementary problem that 
asks how many of the objects are not counted by the original question. Since every 
problem has an associated complementary problem, one always has the freedom to 
choose whether the original problem or its complement seems more approachable, 
This is sometimes just a matter of personal taste, but often it makes the difference 
between an elegant solution and a mess. 


Rule of complements: If there are x objects, and y of those objects have a 
particular property, then the number of those objects that do not have that 
particular property is x — y. In terms of sets, using U for the universal set, we 
can write this as n(A’) = n(U) — n(A) for all sets A with elements from U. 


Eh Example 7 How many five-digit numbers use distinct digits from Q..... 8? 
How many of these are odd? How many are even? 


SOLUTION Place the digits left to right since the leftmost (ten thousands’ 
place) digit is the only one with a special restriction (i.e., it cannot be 0). The 
product rule applied to this five-step process will give us (6)(6)(5)(4)(3) = 2,160. 
For the second part, do the same thing, but this time place the ones’ digit 
first (it now has the greatest restriction since it must be odd). The leftmost 
digit should be placed next (since it is the second most restricted), and then 
the final digits can be placed in any order. These five steps can be completed in 
(3(5)(5)(4)(3) = 900 ways. 
For the third part, since every five-digit number using the digits (0, ....6) 
is either odd or even, the answer to the third part can be found by subtracting 
the previous answer from the total: 2.160 — 900 = 1,260, a 








Practice Problem 3 How many rolls of three distinguishable dice result in the 
largest number showing being either a 5 or a 6? (For example, the roll 554 counts 
since the largest number showing is 5, but the roll 334 does not count since the largest 
number showing is 4.) 


One difficulty in using the sum rule comes from the fact that sometimes the 
“natural” way to break a list into pieces does not create disjoint pieces. Since we saw 
that the sum rule can be stated in terms of the size of the union of sets, you might 
recall from Section 3.1 that a more general fact is true. This more general statement 
makes the rule of sums easier to implement in practice. 














52 Basic Rules tor Counting 391 


E Example 8. If we roll a six-sided die three times and record the result as an 
ordered list of length 3, in how many of the 6° — 216 possible outcomes are there 
exactly one 1 or exactly one 6? 


SOLUTION We will use the set notation from Section 3.1. Let A be the set 
of all rolls in which there is exactly one 6, and let B be the set of all rolls 
in which there is exactly one 1. It is easy (see Practice Problem 2) to count 
n(A) = 3-5? = 75 and n(B) = 3 - $ = 75, but we cannot apply the sum rule to 
answer the question because A and B are not disjoint. However, itis possible 
lo account for the overlap between A and B. Namely, A N Bis the set of all rolls 
in which there is exactly one 6 and exactly one 1, so n(A N B) = 3-2-4 = 24 
(see Exercise 5). Hence to find n(A U B), we can add n(A) +n(B) and then 
subtract the overlap, n(A n B). That is, 


n(A UB) = n(A) + n(B) -n(A N B) 
=75+75 -24 = 126 








n 





Itis very useful 
into cases that. 
arc not necessarily disjoint. Note that the following formal statement is still true even 
when the pieces are disjoint, so this is truly a more general rule of sums. 














Rule of sums with overlap: 1f the list to count can be split into two 
pieces of size zand y, and the pieces have z objects in common, then the original 
list has x + y — z entries In terms of sets, we can write this as n(A U B) = 
n(A) +n(B) — n(A N B) for all sets A and B. 








Although this rule does generalize to more than two pieces, the generalization 
is not as easy to apply. In general, if you divide a problem into more than two pieces, 
make sure there is no overlap among the pieces. 


Algorithms for Counting 


As we progressed through the examples in this section, we informally developed 
a way of counting by imagining ourselves building each entry in a list through a 
sequence of steps. The idea of writing an informal algorithm for generating objects 
is of fundamental importance in discrete math. For those familiar with computer 
programming, what we are doing is imagining writing a program to print the entire 
list that we want. For example, the following “pseudo-code” would print the list 
associated with the problem, “How many odd numbers having distinct digits are 
between 100 and 1,000?" 











For each U from (1,3,5,7,9] do 
For each H from {1,2,3,4,5.6,7,8,9} with H # U do 
For each T from {0,1,2,3,4,5,6,7,8,9} with T # H and T + U do 
Print H,T,U 











Combinatorics 


We definitely do not care to write code for all our problems, so we will usually say 
something like “Choose a ones’ digit from (1, 3.5, 7, 9), choose a different hundreds’ 
digit from [1, 2, ..., 9), and choose a tens’ digit from (0. 1, .... 9} different from the 
first two choices." This describes the same algorithm in English. Now we can look at 
either description and say that there are five ways to do the first step, eight ways to 
do the second step (regardless of what choice was made for the first step), and eight 
‘ways to do the third step (again regardless of what choices were made in the first two 
steps). By the product rule, this algorithm generates (5)(8)(8) = 320 samples 

The parenthetical conditions above are very important. It must be the case that 
the number of choices at each stage is independent of previous choices. 








T crete 9 ber Joel "How many ALAS 3200 orn be forme aui 
distinct digits from (0. ..... 6)?" we could write the following three-step algorithm: 
II] Choose a ones’ digit. 

[2] Choose a different tens’ digit. 

[B] Choose a hundreds’ digit different from the other two. 





Analyze this algorithm using the product rule. 


SOLUTION When we analyze the algorithm, we see that there are seven 
choices in completing the first step, six choices in the second step, leaving us 
only to count the choices in the last step. But here is the problem: If we choose, 
say,3 as our ones’ digit and 5 as our tens’ digit, then the hundreds’ digit must be 
chosen from the three objects in (2, 4,6). On the other hand, if we chose 2 as 
our ones’ digit and 0 as our tens’ digit, then the hundreds’ digit must be chosen 
from the four objects in (3, 4 5, 6). So the number of choices for the hundreds’ 
digit depends on the previous choices. 

AIl we can conclude from this attempted solution is that this particular 
algorithm escapes analysis with the products rule, so either the algorithm or 
the analysis should be changed for this problem. Sce Exercise 28 at the end of 
this section to fix the solution. [s] 





Practice Problem 4 Explain why the following algorithm does not correctly gen- 
erate the list of all results of rolling a sum of 10 on three distinguishable six-sided dice. 
(We will represent the outcomes of the dice rolls as an ordered list of length 3 with 
elements taken from (1,2,3,4, 5. 6].) 


[a] Choose any element of (1,2, 3, 4, 5, 6] for the first roll. 
[b] Choose any element of (1,2, 3,4, 5. 6] for the second roll. 


[c]. Fill in the third number by subtracting the sum of the previous two numbers from 
10. 


Applying the product rule to this algorithm tells us that there will be (6)(6)(1) = 36 
ways to roll three dice to get a sum of 10. This is wrong! 


General Formulas to Count Ordered Lists and Permutations 


We are now ready to apply the concepts in this section to derive "counting formulas" 
for some of the basic finite structures we introduced in Section 5.1. 








52 Basic Rules for Counting 393 


Theorem 1 The number of ordered lists from (1, ... . n] of length r is n". 


PROOF Think of ther places for entries in the ordered list. Choose any number 
from (1, ...,7] for the first space, any number from (1, ... , n] for the second space, 
and so on for all r spaces. By the product rule, there will be 

(nyn)---(m-n 

r times 
of these ordered lists. C 





"BE Example 10 How many r-digit binary sequences (ie, ordered lists from (0, 1) 
of length r) are there? 


SOLUTION By Theorem 1, there are 2' of them. o 





Definition 


1. Remember that a permutation is just an ordered list in which no element 
is repeated. The number of permutations* from {1,..., n) of length r is 
denoted by P(n, r). 

2. We write n! for P(n, n) = (n)(n — 1) ---(3)(2)(1) with the stipulation that 
0! = 1. We read the notation n! as "n factorial.” 








Theorem 2 For any nonnegative integers n and r, P(n.r) =n -(n — 1)- (n — 
2)---(n —r +1}. Note that if n <r, then this product will be 0, consistent with our 


definition; and that ifn > r, this product can be written as zÈ. 


PROOF Thinking of it in the same way as in the proof of Theorem 1 except 
with the constraint that at each step we may only choose from objects not already 
selected, we have n choices for the first space, n — 1 choices for the second space, 


and so on, giving us the desired formula (by the product rule) after all r spaces 
have been filled. a 





BE example 1 How many batting orders for a baseball team are possible from 
a roster of 20 players? 


SOLUTION _A batting orderisan ordered list from the set of available players 
of length 9 without repetition—a permutation. So by Theorem 2, there are 
P(20, 9) = 3X = (20)(19)--- (12) = 6 x 10" of them. o 





"Bl Example 12 How many ways are there to arrange seven people in a line? 


SOLUTION Call the people 1,2,3, and 7. An arrangement in the line 
of these people corresponds to a permutation from (1, ..., 7] of length 7, and 
by Theorem 2, there are P(7.7) = 7! = 5,040 of these. ü 





" Other notation that is sometimes used includes „P, and Pf 





394 Chapter 5 | Combinatorics 


gee a A 
themselves in a movie theater line, assuming that spouses always stand together? 


SOLUTION Call the couples Smith, Jones, and Williams, Form the movie 
lines in a four-step process as follows, Choose a permutation of the three names 
in 3! = 6 ways, choose an order for the couple first in this permutation to stand 
in 2! =2 ways, choose an order for the couple second in the permutation to 
stand in 2! =2 ways, and choose an order for the couple last in the permu- 
tation to stand in 2! — 2 ways. By the product rule, there are (6)(2)(2)(2) = 
48 ways. 








Practice Problem 5 You have a geography test consisting of a column of 10 
countries and a column of 20 export products—your task is to match each country 
to its top export, and you are instructed that exactly 10 of the export products will be 
correct answers (each for only one listed country) and the other 10 are red herrings* 
If you have not studied at all, how many different ways are there to complete the test 


by just guessing? 


Solutions to Practice Problems 

1 (w) Wecan form the binary sequence in three steps, from 
left to right: (i) Select the first digit, (ii) select the sec- 
ond digit, (iii) select the third digit. There are two 
ways to do each step, and thus 2- 2 -2 = 8 binary se- 
‘quences. This is illustrated by the tree shown in Fig- 
ure 5-4, For example, choosing a Oin the first step, a 1 
inthe second step, and a 1 in the third step will result 
in the sequence 011 shown in blue in Figure 5-4. 

(b) This problem is implicitly a two-step process: deal 

the face-down card, then deal the face-up card. 
There are clearly 52 ways to do the first step, Then, 
no matter which card has been dealt, there are 51 
cards left, so there are 51 ways to do the second step. 
Hence, the number of different two-card blackjack 
hands is 52 x 51 = 2,652. 





m 
no 
101 
100 


on 
010 
01 
000 


Figure 5-4 Tree for Practice Problem I(a). 


2 There are 1-5: 5, where the only 1 is the first roll (one 
choice for the first roll, five for the second, and five for 





4 


the third). Similarly, there are 5-1 -5, where the only 
1 is the second roll, and $- 5- 1 where the only 1 is the 
third roll, The total is 3- $° = 75, 

As in Example 5, we will represent an outcome as an 
‘ordered list of length 3 with elements taken from (1,2, 
3,4, 5,6). The largest number rolled must be either $ or 
6, 80 in order to not be counted, a roll would have to be 
represented as an ordered list of length 3 from the set 
{1,2.3, 4). There are 4? = 64 such ordered lists, leaving 
6 — 4! = 152 rolls that must have a 5 or 6. 

Follow the algorithm's instructions to see what it is do- 
ing. For example, if we choose 3 in the first step and 4 in 
the second step, we end up with the “roll” 343, but if we 
choose 1 in the first step and 1 in the second step, we end 
up with 118, which is not a possible dice roll, Similarly, 
if we choose 6 in the first step and 5 in the second step, 
we cannot have a negative result on the third die! The 
problem with this algorithm is that it generates several 
things that we do not want. Hence, the answer of 36 it 
gives us is too large. The correct answer is 27, which we 
will see how to get in the next section. 

For each of the 10 countries, you are choosing one 
of the 20 products, so we can represent your answer 
sheet as an ordered list of length 10 where the elements 
‘come from the 20 products without repetition, The to- 
tal number of these answer sheets is then P(20, 10) = 
1670,442,572,800. a big enough number that studying is 
definitely better than guessing! 


"Unless the top export of a country is red herring, in which case this item is a wild goose. 








Exercises for Section 5.2 


52 Basic Rules for Counting 395 


Be clear on how you are organizing your thoughts and applying the basic counting rules in answering the 


following questions, 


1. (a) How many ordered lists with entries from (1... 6] 
of length 10 are there? 

(b) How many of the above never have a 6 appear? 

2. (a) A true-false test contains 10 questions. In how many 
ways can a student answer the questions if every 
question is answered? 

(b) A true false test contains 10 questions In how many 
ways can a student answer the questions if some 
‘questions may be left unanswered? 

(©) A matching test contains 10 questions, and each an- 
swer must be used exactly once. In how many ways 
cana student answer the questions if every question 
is answered? 

3. A certain mid-Atlantic state has a simple rule for its 
license plates: Use three letters followed by four digits 
(a) How many possible plates are there? 

(b) Suppose the state court has ruled that 97 of the pos- 
sible three-letter combinations are offensive and 
cannot be used. Now how many possible plates are 
there? 

4. A certain large Northeastern state has a strange rule 
lor the two-letter combinations that are legal for its 
license plates. To keep people's plates from acciden- 
tally spelling possibly offensive words, they use the rule 
that any license plate that has a vowel as its first letter. 
‘mustalso have a vowel as its second letter. For example, 
CD and OE are legal combinations, but ON is not. How 
many two-letter prefixes are possible on license plates 
in this state? (They consider only the five letters A, E, 
1, 0, and U to be vowels) 

f Mime rl a scide die tree times and record the re- 
sult as an ordered list of length 3, in how many of the 

{6° = 216 possible outcomes are there exactly one 1 and 

exactly one 6? (Hint: Form the entries in the list using. 

three steps: Decide which position will contain the 1. 

decide which position will contain the 6, and fill in the 

remaining position.) 

6. In how many ways can a club with 17 members elect 
a president, vice president, and secretary (assuming no 
person can fill more than one office) for cach situation. 
described? 

(a) There are no restrictions. 

(b) Susan has removed herself from consideration for 
president due to a busy schedule, but she is willing 
to serve in either of the other offices. 

(©) Sam has indicated that he will serve as president 
only if Mary is named as the vice president. 











(0). The club's bylaws require that last year’s vice pres- 
ident becomes this year's president. 

7. An organization has 10 male and 7 female members 
f£. In how many ways can the organization elect a pres- 

ident, vice president, and secretary for each situation 

described? 

(a) The president must be female. 

(b) The president can be either sex, but the vice presi 
dent must be female and the secretary must be male, 

(©) The president and vice president must be of the 
same sex. 

(@) All three officers may not be of the same sex. 

X. An organization has 8 math majors, 12 computer sci 

JF. ence majors, and 6 science majors. In how many ways 
can the organization elect a president, vice president, 
and secretary for cach situation described? 
(a) The three officers arc all the same major. 
b) Atleast onc of the officers must be a science major. 
(©) Either the president or the vice president must be 
a math major. 

9. We will represent the results of three tosses of a six- 
sided die as an ordered list of length 3 with entries from 
[Nar 
(a) How many different results are possible? 

(b) In how many of these are all three tosses different? 
We will represent the results of four tosses of a coin as 
an ordered list of length 4 with entries from (H, T]. 
(a) How many different results are possible 
(b) In how many of these are the first two tosses the 
same? 
We will represent the results of five cards being dealt 
from a standard deck of 52 cards as a permutation of 
length 5 with entries from the set of cards. 
(a) How many different results are possible? 
(b) In how many of these does the first card have the 
same value as the last card? 
46) In how many do all five cards have the same suit? 
12. (a) How many arrangements are there of the letters in 
the word MATCH? 
(b) Of these, how many have the letters M and A side 
by side in that order? (Hinr: Think of MA as a 
single object, so you are counting arrangements of 
the four objects MA, T, C, and H.) 
(©) How many have the letters M and A side by side 
but in the order AM? 








ster 5 / Combinatorics 


ow many arrangements are there of the letters in 
e word EMPHATIC? 

these, how many have the letters E and M side 
in either EM or ME order? 

kjack hand consists of two cards, the first of which 
t face-down and the second face-up. 

how many blackjack hands is the face-up card an 
e and the face-down card a 10? 

how many blackjack hands does the face-up card 
ve a value of jack, queen, or king, while the face- 
wn card has a value between 2 and 9, inclusive? 
how many blackjack hands is the face-up card an 
c and the face-down card a club? 

bination lock has three numbers in the combina- 
ach in the range 1 to 50. 

ow many different combinations are there? 











ow many of the combinations have the first and 
cond number matching? 

ow many of the combinations have exactly two of 
e three numbers matching? 

are 16 marbles numbered 1 to 16in a box. Marbles 
ire red, marbles 6 to are green, and marbles 9 to 
blue, I draw out four marbles, one at a time with- 
placing them, and record the result as an ordered 
four color/number combinations. For example, I 
R4, G6, RI, BIOif the marbles drawn are, in order, 
#1, no. 

ow many possible results are there? 

{ these, for how many are both the first and last 
arbles red? 

Y how many are the first and second marbles dif- 
rent colors? 

how many are all four marbles the same color? 
ercise 16, how do your answers to the questions 
cif each drawn marble is replaced in the box be- 
c next marble is drawn? 

IV game show, nine scarves numbered 1 to 9 are 
|in a basket and thoroughly mixed. Then they are 
out one at a time, 

ow many possible results are there? 

[ these, how many have all five odd numbers first, 
llowed by four even numbers? 

nany of the results counted in Exercise 18 follow 
ttern “even, odd, even, odd, even, odd, even, odd, 


stspeaker ina department seminar chooses six au- 
s members. One after another he asks them their 


a 
s 


“u 


birth month, writing their responses as an ordered list 

of length 6. 

(a) How many different results are possible? 

b) OF these, for how many are there no duplicate 
months listed? 

c) What percent of the possible results have a dupli- 
cate month? If the speaker always bets each audi- 
ence a dollar that there will be a duplicate, is this a 
good bet for the speaker? 

‘The professor in your class asks each of the 30 students 

in the class his or her birthday, writing their responses 

as an ordered list of length 30, Assume no onc is born 

on February 29. 

(a) How many different results are possible? 

b) Of these, for how many are there no duplicate birth- 
days listed? 

(©) What percent of the possible results have a dupli- 
cate birthday? Is this percent higher than you ex- 
pected, lower than you expected, or about what you 
expected? 

If we toss a coin 10 times in succession and record the 

‘outcomes as an ordered list of length 10 using entries 

from (H. T), how many of the 2" possible outcomes 

satisfy each of these conditions? 

(a) Begin with three "heads" in a row. 

(b) End with three "tails" in a row. 

46) Begin with three "heads" in a row and end with 
three “tails” ina row. 

d). Begin with three "heads" in a row or end with three 
“tails” in a row. 





- A certain club is forming a recruitment committee con- 


sisting of five of its members. They have calculated that 
there are 8,568 different ways to form this committee. 
Of these, 700 have exactly one woman, 2,520 have ex- 
actly two women, 3360 have exactly three women, 1,680 
have exactly four women, and 252 have exactly five 
omen. 
(a) How many committees have at least three women? 
(b) How many committees have no women? 
(€) How many committees have at most two women? 
The club of the preceding exercise has two members 
named Jack and Jill. They have calculated that 2.380 of 
the committees have Jack on them, 2,380 have Jill, 1.820 
have Jack but not Jill, 1,820 have Jill but not Jack, and 
560 have both Jack and Jill 
(a) How many committees have either Jack or Jill? 
(b) How many committees have neither Jack nor Jill? 
(©) Jack and Jill are carpooling, so they insist that if 
either one is on the committee, the other person. 








2s, 


z. 


must also be on the committee. How many commit- 
tees meet this condition? 

(@) Jack and Jill have had a fight. Jack says, “I Jill is 
on the committee, I won't be.” Jill says, “If Jack is 
on the committee, 1 won't be.” How many of the 
committees meet this condition? 

The following questions follow up on the principle of 

inclusion-exclusion, discussed in Section 3.1 and Ex- 

ample 8 of this section, 

(a) OF the 6° = 216 results of rolling a six-sided die 
three times discussed in Example 8, how many of. 
them include getting exactly one 1, exactly one 2, 
or exactly one 6? 

(b) There are 6* = 1,296 results of rolling a six-sided 
die four times in succession. How many of these in- 
clude getting exactly one 1, exactly one 2, or exactly 
one 6? 


i. Write each of the following as a product (e.g. P(6, 4) = 


9-8-7 6), and then evaluate the product: 

(a) P(19, 5) 

() (6.6) 

(©) P(120, 17) 

Fill in the blanks to write each of the following products 
in the form P(n, r): 


(a) 9-8-7-6 = PL ) 

(b) 10:9-8:7:6:5-4:3-2-1 = PL T 
m emi; 

(6) 365 364363- ++- -338 = PL 5 


ay 
4) (n 1) (m) (m= 
P 


1):(m-2)(mm-3)- 





(km) = 





(e) ke (k= 1) (k= 2) PC , 
phased! 
Solve the problem in Example 9 by changing the algo- 
rithm so that the product rule can be applied. 
Here is an algorithm for Exercise 7(b), which counts 
the ways to elect officers in an organization of 10 males 
and 7 females, if the president can be either sex, but 
the vice president must be female and the secretary 
must be male: “There are 17 choices for the president, 
7 for the vice president, and 10 for the secretary, giving 
17-7- 10 = 1,190 different results” This is incorrect. 
Explain why, and give a correct algorithm. 
Here is an algorithm for Exercise 11(b), which counts 
the five-card hands where the first card dealt has the 
same value as the last card dealt: "There are $2 choices 
for the first card, 51 for the second, SO for the third, 49for 
the fourth, and 3 for the last, giving 52 - 51 - 50-49. 
19, 492, 200 possible hands.” This is incorrect. Explain 
why, and give a correct algorithm. 











52 Basic Rules for Counting 397 


31. Here isan algorithm for Exercise (c), which counts the 
ways to elect officers where either the president or vice 
president must be a math major (The organization has 
math majors, 12 computer science majors, and 6 science 
majors.) "If the president is a math major, there are 8 
ways to choose the president, 25 for the vice president, 
and 24 for the secretary. There are the same number 
when the vice president is a math major, so the total 
is 2-8-25 -24 = 9,600.” This is incorrect. Explain why, 
and give a correct algorithm. 

Here is an algorithm for Exercise 24(a): "There are 

2,380 committees that have Jack, and 2,380 that have 

Jill. By the sum rule, there are 2,380 + 2,380 = 4,760 

that have either Jack or Jill." Explain the error, and 

correct the algorithm. 

33. For part (d) of Exercise 24, a student reasons cor- 
rectly that we want to count committees that do not 
include both Jack and Jill, and that thi is logically the 
same as “don’t have Jack, or don't have Jill.” How- 
ever, his counting algorithm is incorrect. Explain why, 
and correct it. The algorithm is: “By the complement 
rule, there are 8,568 — 2,380 = 6,188 committees that 
don’t have Jack, and 8,568 — 2.380 = 6.188 commit- 
tees that don't have Jill. By the sum rule, there are 
6.188 + 6,188 = 12,376 that either don't have Jack, or 
don't have Jil” 

34. How many five-digit numbers have all distinct digits and 
are odd? 

35. How many numbers from the set (1,2... 9.999] use 
exactly two different digits? (For example, 1,121 should 
be counted, but 3,333 and 1231 should not.) 

36. How many four-digit numbers use the digit 7? 

37. How many positive integers have distinct digits, all of 
which are odd? 

38. How many positive integers have distinct digits? 

39. (a) How many four-digit odd numbers have digits only 

from the set [1. 
(b) How many of these have atleast one digit repeated? 
(©) How many of these have only one digit repeated? 
40, (a) Using only the digits 1,2, 3, 4, 5, and 6, how many 
five-digit numbers can be formed? 
(b) How many of these have at least one 5? 
(©) How many of them have either no 5 or no 6? 

1. In how many ways can five families of four stand in a 
movie theater line if each family must stay together? 

42. How many subsets of |1, 2, 3.4, 5) have two elements? 
How does this compare with the number of permuta- 
tions of length 2 with entries from [1,2, 3, 4. 5)? 

43. Prove by induction that for all n = 2, the number of 
two-clement subsets of (1, ... nl is 


























398 Chapter 5 / Combinatorics 


5.3 Combinations and the Binomial Theorem 





We have seen that the rules of the sums and products are remarkably simple buil 
blocks that can be used to solve many enumeration problems, In this section, we will 
extend these rules to do even more, 





Counting with Equivalence Classes 





2 Example 1 How many two-element subsets of (1, 2, 3, 4} are there? 


‘The difference between permutations and subsets is simply a matter of whether 
we care about the order of the entries—in a permutation we do and in a subset we 
do not, Specifically, we consider 2 3 and 3 2 to be different permutations while we 
consider (2, 3) and (3, 2] to be the same set. 

We saw in Section 4.5 that when we have a notion of "equivalence" on a set of 
objects, we call this an equivalence relation. More important, we call a set of objects 
that are equivalent to one another an equivalence class. In this example, there is an 
‘equivalence relation on the set of all permutations of length two with entries from 
(1,2, 3,4]. This equivalence relation can be described as "would look the same if 
they were sets." For example, we would say that 





(23,32) 
is an equivalence class since the permutations therein would look the same if they 
were sets, 
Here are some other equivalence classes for this equivalence relation: 
e133.) 
9132,24) 
© and others 


‘This provides the key idea for answering the question in Example 1. 


SOLUTION (To Example 1) We know that the number of permutations of 
length 2 with entries taken from (1,2, 3, 4) is P(4, 2) = 4 «3 = 12. If we use the 
equivalence relation “would be the same if they were sets,” each equivalence 
class contains two permutations, so we will have } = 6 equivalence classes. 
Table 5-23 shows the 12 permutations in the 6 equivalence classes. Each equi 
alence class (by its very definition) identifies a single two-element subset of 
{1, 2, 3, 4), In this problem, there is a small enough number of these to simply 
list: 











(1,2) (3), (14) (2.3) (24) (3,4 


So the answer to the question is 6, but more important, we have developed an 
approach to a more general question, a 





" In the language of Section 45, equivalence classes are simply the parts of the partition induced by an 
‘equivalence relation, The important thing is that equivalence classes form a collection of nonempty 
subsets ofS, for which every element of S is in exactly one subset, or class. 








3 Combinations and the Binomial Theorem 399. 








mie 
m 





























Table 5-23 Equivalence Classes in Example 1 


Practice Problem 1 Use the idea of equivalence classes to explain why the number 
of three-element subsets of (1,2, 3, 4, 5) is 242. 








Definition 


. We will write C(n,r) (usually read “n choose r") for the number* of sets 
from (1,... n] of size r. 


. Sets from (1,...,n) of size r are also called recombinations from 
11,2... n). 








Theorem 1. C(n,r) = “42. Note that ifn > r, this can also be written as zs. 


PROOF We imagine stretched out before us the permutations of length r with 
entries taken from (1,2,..../1}, and we concentrate on the equivalence classes 
Using the same equivalence relation as in Example 1. A permutation of length 
r from (1,2, ..., 1) consists of an ordered list with r different objects; hence, we 
know that any of the r! ways to arrange these r objects will look the same as the 
original permutation if they were sets (i.e, if the order of them did not matter) 
‘Therefore, each of the equivalence classes on the P(n, r) total permutations 
is made up of r! permutations, and each of these equivalence classes corresponds 
to a single subset of (1,2, ..., n]. Hence, the number of r-clement subsets of 
11.2, 














T 
rl m 





person. committees can be formed from the 
100-member U.S, Senate? 


SOLUTION This is the definition of C(100, 5)—choose five members from 
the 100-clement set, where order does not matter and repetitions are not al- 
lowed. By Theorem 1, this is P(100, 5)/5! = 49 = 7.5 x 107, o 


Practice Problem 2 In draw poker, a player receives a hand of five cards from a 
standard deck. A flush is a hand that has all five of its cards of the same suit. (There 


“er sotion sie nds, nd") 








400 Chapter 5 / Combinatorics 


are four suits in a standard deck, with 13 cards in each suit.) How many draw poker 
hands are flushes? 


Before we begin to solve problems using our nice formula for C (n, r), let's look 
at one more example of the idea of counting with equivalence classes. 


Ml A 375556 O EI A wana 
in a circle? 


SOLUTION Note that given two such arrangements, as long as all the chil- 
dren are holding the same hands on the same sides in the two configurations, 
the arrangements are considered the same. For example, the two arrangements 
in Figure 5-5 are considered the same, 

Now if the children stood in a line instead of a circle, we know they can be 
A c arranged in 6! = 720 ways. Let's say that two of these linear arrangements are 

F/ 8 B/ 7D — “equivalent” if they amount to the same thing when you wrap them in a circle. 

(Imagine the children holding hands in their line and then the two children 
E on the ends holding hands to complete the circle so that “left-to-right” in the 

F line becomes "clockwise" in the circle.) For example, Table 5-24 shows the 

Figure 5-5 Equivalent arrangements in a line that all become the same circular arrangement shown 

circular arrangements. in Figure 5-5. In fact, each equivalence class will consist of six arrangements 
in a line, and each equivalence class corresponds to one circular arrangement, 
Therefore, the number of circular arrangements is 2! = 120, [s] 











Applications of Combinations 


There are many counting problems where the solution can be viewed as choosing 
r-element subsets of a given set of size n. We can also combine this idea with the 
rules of sums, products, and complements developed in the preceding section. 


"Ml cornea Act Crise ver contest na d fos a pe 
steering committee, Clearly, there are C(18, 5) possible committees. Of these, how 
many are possible for each of these situations? 


1, The committee contains exactly three women. 
2. The committee contains at least three women. 


3. Jack and Jill refuse to work together, so the committee does not contain both of 
them. 


SOLUTION 


1. We fill the committee in a two-step process—select three women, and then 
select two men. There areC (10, 3) ways to do the first step, then € (8, 2) ways 





[ancper | mcer | coeran | DeFABC | EFABCD | FABCDE 




















Table 5- 





4 An Equivalence Class of Linear Arrangements 








d^ 


Explore more on 
the Web. 


53 Combinations and the Binomial Theorem 401 


to do the second step. By the products rule, the total number of committees 
is 
C(10,3) -C(8,2) = 120-28 
= 3,360 
2. We divide our solution into disjoint sets: those with exactly three women, 
those with exactly four women, and those with exactly five women, For each 
part, we use reasoning similar to part 1. The total is 
(10, 3) - C(8, 2) + C(10, 4) -C(8, 1) + C(10, 5)  C(8,0) 
120-28 + 210-8 +2521 
5292 
3, There are at least three ways to attack this problem, The dei 
ered in Exercise 21. 
(a) Divide the list into three disjoint parts: those that have Jack but not Jill, 
those that have Jill but not Jack, and those that have neither. 
(b) Solve the complementary problem. 


(©) Divide the list into two overlapping parts: (i) those that don't have Jack, 
and (i) those that don't Then use the "sums rule with overlap? 














re consid- 








Practice Problem3 How many ofthe committees described in Example 4 contain 
at least one man? 


‘This type of reasoning easily extends to situations where there are more than 
two types of objects to place in the set. 


lll Example 5 1y we select a commitee of sze 10 from a group that contains 25 
Democrats, 28 Republicans, and I4 Independents, how many of the possible committees 
will have exactly 5 Democrats, 4 Republicans, and 1 Independent? 





SOLUTION Apply the product rule to the three-step process, (i) choose the 
Democrats, (ii) then the Republicans, and (iii) then the Independents, There 
are C(25, 5) -C (28, 4) C (14, 1) such committees. o 


We can even apply reasoning about choosing subsets to problems that do not 
appear to be about sets. In the second part of the next we count ordered lists 
usingan algorithm where one step involves combinat indicates an important. 
correspondence between sets and ordered lists that will be used a great deal in the 
remainder of this chapter. 





lll Example 6 A coin is tossed five times, and the results are recorded as an 
ordered list from (H, T) of length 5. 


1. How many possible outcomes are there? 
2. Of these, how many contain exactly three heads? 








D2 Chapter 5 / Combinatorics 


3. Generalize the previous result to explain why C(5,0)+C(5,1)+C(5,2)+ 
C(5,3) +€(5,4) +C(5,5) =”. 


SOLUTION 


1. By the product rule, there are 2 -2-2 -2 2 = 2 = 32 possible outcomes. 


2. We number the positions in the list with the numbers 1 to 5, and record 
the positions that contain heads, For example, HHTHT corresponds to the 
selection {1, 2, 4]. The question can thus be translated to "How many ways 
are there to choose the three positions to be occupied by the heads?" This is 
the same question as "How many three-element subsets of (1, 2, 3,4, 5} are 
there?" The answer isC(5, 3) = 10. 

3. By part 1, there are 2° possible outcomes. We count those outcomes a dif- 
ferent way, by dividing them into six disjoint sets: those with no heads, those 
with one head, and so on. By reasoning similar to part 2, there are C(5, 0) 
with no heads, C(5, 1) with one head, and so on, By the sum rule, the total is 
€(5, 0) +C(5, 1) + C(5, 2)  C(5, 3) +€(5, 4) - C(5, 5). Since the two an- 
swers must be the same, this establishes the result, 








The Binomial Theorem and Arithmetic Triangle 


Combinations are also important outside the setting of a discrete math course, The 
most familiar use of them is probably their role in the so-called binomial theorem. 
‘This simple result explains algebraically how to multiply out the polynomial 


(+x) 2 (14 2) 0:43) (x) 
—— 


Believe it or not, this fact was of central importance to the mathematicians who laid 
down the foundations for the development of calculus’ in the seventeenth century 
and earlier, 

In algebra, a binomial is simply a polynomial with two terms, Thanks to the 
distributive property, the product of binomials (1 4- x) (1 + x) - (1 + x) expands al- 
gebraically to become 


(x) (43) 0-3) 8 1D dE e dex dpa 
vbdoexdeakloaeangur 


This expansion can be simplified by combining terms. For example, the terms 
x1-1, L-x- 1, and 11 +x each simplify to x, so when they are added together, 
the result simplifies to 3x. Notice that each term in the resulting sum is formed by 
choosing either the 1 or the x from each of the three binomials being multiplied. 
Each of the terms in the example above comes from choosing one x from among the 


" In fact, an early great discovery o Sir Isaac Newton (1643-1727) himself was the generalization of this 
fact for values of n other than natural numbers. 








Explore more on 
the Web. 


feed 
146414 
15101051 
1 615201561 


Figure 5-6 The first 
seven rows of the AT, 


53 Combinations and the Binomial Theorem 403 


three available x's, so we could have guessed ahead of time that there were going to 
be C(3, 1) of these terms to be combined. 

There is nothing special about this example, of course, This happens for any 
number of 1 +x terms being expanded. For example, when the product 


Qta) (ta) (ex) x) 





is expanded to 
(L+a)' 8 1e Ax Gr 445 +04 
=C(4,0) -C(4, 1)x + C(4, 2)x?  C(4, 3) + C(4, Ax 
we see that the coefficient of x* is C(4, K) for each x^ on the right-hand side. 


This general connection between the numbers C(n,k) and the expansion of 
(1 +)" is the binomial theorem. 








Theorem 2 (The Binomial Theorem) The coefficient of the x* term 
in the expansion of (1 + x is C (n, k). More formally, 


O +a)" =C (1,0) +C (n, D + CC, 2) 4 +++ + Cin n)” 
= Le (n, kx" 


[x] 











Practice Problem 4 Whar is the coefficient of x* in 


(a) The expansion of (1 + x)? 
(b). The expansion of (1 + x??? 


‘The binomial theorem is usually accompanied by the more visually appealing 
arithmetic triangle (AT), also called Pascal's triangle’ or Tartaglia’s triangle, The 
ingle is simply a table of the values of C(n, k) in the shape of a triangle, so math- 
ematically it is nothing new for us. However there are many patterns within the 
triangle that might not be apparent without this visual representation. The first few 
rows are shown in Figure 5-6. 

In order to have the proper correspondence, we refer to the row and entry 
numbers starting with 0. That is, the top row is row 0 and the leftmost entry in each 
row is entry 0, Using this convention, we see that entry k in row n is the number 
Cink). 








Practice Problem 5 What does the binomial theorem state if we substitute the 
value 1 for the variable x? How can this be interpreted as a fact about the arithmetic 
triangle? 


* Mt mightseem ironic, but we use the term arithmetic triangle out of respect for Blase Pascal 1623-1662) 
After all, this is the erm he himself used when he wrote about i in 1653. 





M a a a E 


404 Chapter 5 / Combinatorics 


When we form the triangle, we are struck by a pattern: When two successive. 
entries in a row are added, one gets the entry immediately below them. Formally, it 
seems that. 


C(n.k) - C(n, k+ 1) = C(n +1, k+1) (9 


For example, we expect that entry 2 in row 5 will be C(4, 2) +C(4, 1) =6 +4, and 
if we check using our counting formula for C (5.2), we get C(5, 2) = $4 = 10 that is 
the same thing. 

To account for this pattern, we need look no further than the binomial theorem 
itsel. We know that (1 +x)° = (1 + x) - (1 4-3)5, so the binomial theorem tells us 
that 





(L+ x) (1+C(4, D)! +C(4, 2)x? + C(4, 3)? +x") 
(x) (1+ 4x + 6x? + 4x9 ex) 
dE oe Dye (64 4) (4 6) + (144) +> 


= CS. ji 
r= 





By comparing the coefficients on the left-hand side of the long equation above 
with those on the right-hand side, we see the pattern clearly. For example, the 
coefficient of xè on the left is C(5, 2), while the coefficient of x? on the right is 
C(4,2) +C(4, 1). This means that the additive property of forming the arithmetic 
triangle can be accounted for by the way we multiply polynomials, 





Practice Problem 6 Multiply (1 +x) = 1+ Sx + 10x? + 10x? + Sx* +x by 
(1 +x), and compare the resulting coefficients to row 6 of the arithmetic triangle, 


Another way to account for patterns in the arithmetic triangle is with our old 
friend, mathematical induction, 


BE Example 7 Show hat he following is true forall n > 2: 
€(2,2) +€(3,2)  C(4, 2) +--+ C(n,2) =C(n + 1.3) 


SOLUTION Let P(n) be the statement "C(2,2) +C(3,2)+C(4,2)+ 
C(n,2)=C(n+1,3)." The first statement P(2) simply states, “C(2,2) = 
C(3,3)" which is certainly true. Now let m>2 be given such that 
PQ), P(3).-... Pim — 1) have all been checked to be true, and we are ready to 
consider statement P(m). 








(CQ.2)--C(3. 2) +--+ +C(m=1,2)) -C(m.2) 
= (Cm. 3) +C (m, 2) by P(mi) 
=C(m+1,3)by (+) 


This shows that statement P(m) is true, completing the induction. n 





Ecc 7 o n 


5:3 Combinations and the Binomial Theorem. 405 


Practice Problem 7 Use mathematical induction to prove that the following is 
true for all n > 4: 


C(4.4) - C(S, 4) +2 + C(n, 4| 





C(n+1,5) 


We conclude this introduction to the arithmetic triangle with a magic trick at- 
tributed to Harry Lorayne in [25]. 


BB rien 6 Aspen vines ac acht Non à ik Comet s node 
cards) and lays them in a face-up row. The magician then removes a card and places it 
face-down well above the other cards. The spectators are given the following instruc- 
tions: 


© Form a row of four face-up cards above those that are there by adding the values 
of two consecutive cards in the row of five and subtracting 10 whenever the sum 
exceeds 10. Find a card that has this value and place it above and between the two. 
cards. 


© Use the same rule to form a row of three face-up cards and then a row of two face-up 
cards. 


© When the rule is applied one last time, the performer turns over his selected card to 
reveal that it is the correct value for the top of the triangle. 
How does the magician determine which card to place at the top? 
SOLUTION If the cards are A, B,C, D, and E, the prediction is determined 
by computing 
A+E+4x(B+D-C) 

with appropriate subtraction of the 10's when this expression exceeds 10, 

For example, if the initial cards are 1,4, 3, 5, and 7, the performer computes 

14744 x (445-3)=32 


which means his prediction is 2. The spectator obediently forms the row 5, 7,8, 
2 according to the given rules, followed by the row 2, 5, 10, and then the row 7, 
5. At this point it is clear that the top card should be a 2, which the performer 
reveals to be the case. But why does this work? n 


Solutions to Practice Problems 





1 Instead of giving a table that includes all 60 of the per- 
‘mutations counted by P(S, 3), we will focus on the equiv- 
alence classes. For example, the permutations in 

135,153 
‘would all be the same if they were se 
permutation would this apply), so this is an equivalence 
class, The fact that there arc six items in this class is 
no accident since we know there are 3! arrangements 
of any three distinct objects. Hence, every equivalence 






class consists of six permutations, and cach equivalence. 
class corresponds to only one set ([1.3. 5] in the exam- 
ple). Therefore, the number of three-clement subsets of 
11.2.3.4,5] is B, 

If we were creating a flush on purpose, we could do it 
with the following two-step algorithm: 

© Choose a suit. 

© Choose five cards from this suit for your hand. 





(0 e UV UM o ONERE E C HENNNNEPED 


406 Chapter 5 / Combinatorics 


Since there are four ways to complete the first step and 
C(13,5) = 18180913951 — 1,287 ways to complete the 
second step, by the products rule there are a total of 
(4)(1,287) = 5,148 flushes possible. 
Although we could find the sum of the number with one 
man, the number with two men, ..., and the number 
with five men, it is easier to solve the complementary 
problem. The solution then is the difference between 
the total number of committees and the number with no 
‘men, This is C(18. 5) — C (10, 5) = 8,568 — 252 = 8,316. 
4 (a) (142)! 2 C(10.0) +€(10, 1)x +C(10. 2)? + 
C(10,3)x +--+, so the coefficient of x* is 
(10,8) = 45. 

(by (1-22)! =C(10, 0) - C(10, 162) + C(10.2) 
+C(10,3)(x2)° +-+. Since (1?) =x", the coeffi- 
cient of 1* in this expansion is C(10, 4) = 210. 

5. Ifx=1,the binomial theorem states 


ae 





























€(n,0) +E (m1)  C(n. 2) +--+ C(n n) 
which shows up in the arithmetic triangle as the fact that 
the sum of the entries in row mis 2". 


Exercises for Section 5.3 


1. (a) List all the permutations of length 2 from the set 
(a, b.c, d, e). 

(b) Organize the permutations into equivalence classes 
for the equivalence relation “would be the same if 
they were sets.” 

(c) How big is cach equivalence class? How many are 
there? 

(d) How many subsets of (a, b, c, d.e] have two ele- 
ments? 

2. (a) How many permutations of length 3 from the set 
la. b. c. d, e) are there? 

b) For the permutation acd of length 3, list all the el- 
‘ements of its equivalence class for the equivalence 
relation “would be the same if they were sets” 

(c) Repeat part (b) for the permutation dae. 

(d) Howbigis each equivalence class formed asin parts 
(b) and (c)? 

(c) How many subsets of (a, b. c. d, e) have three ele- 
ments? 

3. Refer to Example 3. 

(a) List all elements in the equivalence class for the 
following order of the children in a line: ACDBFE. 

(b) List all elements in the equivalence class for the 
following order of the children in a line: ADFBCI 

4. Four people (Al, Betty, Cindy, and Dan) are tobe seated. 
ata round table. 








6 Multiplying 1--5x 4-102 + 10x! Sx x* by (1+) 
results 





14 (1 Sc 5-10) + (10410)? + (10+ 5)x* 
+6408 +2 


which has coefficients 1,6, 15.20, 15,6, 1, identical to 
row 6 of the arithmetic triangle. 

7 Let Pin) be the statement “C(4,4)+C(S.4)+ 
C(6,4) +--- C(n.4) — C(n- 1,5)" The first state- 
ment P(4) simply states, “C(4,4) = C(S. 5)" 
is certainly true. Now let m 2-4 be given such that 
P(4). P(S)....-. P(m— 1) have all been checked to be 
true, and consider the next statement P(m). 








(C44)  C(.4) +--+ C(m—1.4)) +C(m 4) 
= (C(m,5)) + C(m. 4) by Pim — 1) 
—C(m-1,5) by the Pascal triangle pattern 


This shows that P(m) is true, completing the induction. 


(a) List all the possible orderings of the four names, 
‘organized into equivalence classes by the relation. 
“the same if placed in a circle.” 
(b) How many ways are there to accomplish the seat- 
ing? 

5. Inhow many ways can four married couples be seated at 
a round table so that spouses sit together? (Hrs: Do 
this in two steps. Place the couples’ surnames in a circle, 
then for each couple decide whether the husband sits 
to the left or the right of his wife.) 

6. How many ways are there for five married couples to 
stand in a circle with the condition that all spouses are 
next to cach other? 

7. In how many ways can four boys and four girls stand in. 
a circle if they must alternate boy-girl-boy-girl? 

S. In how many ways can four boys and four girls stand 
in a circle if all the boys stand together and all the girls 
stand together? 

9. How many subsets of {1,3.5.7.9} have two elements? 

10, How many subsets of (1,...,6) have an odd number of 
elements? 

11. In how many ways can six shuttle vans line up at the 
airport? 

12. How many variations in first second: and third-place 
finishes are possible ina 100-yard dash with six runners? 














13. How many committees of three people can be formed 
from a club with 17 members? 

14, Ona TV show, there are eight contestants. One of them 
(call her Jill) has won a prize dinner, and is allowed to 
select three of the others to join her. How many ways 
are there for her to make her selection? 

15. The 9 men and 12 women in the Math Club need to 
form a fundraising committee. 

(a) How many possible committees of four people can 
be formed? 

(b) How many of these four-person committees have 
the same number of women as men? 

(©) How many four-person committees have more 
women than men? 

16. This problem refers to the Math Club from the previ- 

F ous problem. Jill (a woman) is the president, and Jack 
(a man) is the vice president. Answer each of the ques- 
tions above with the added condition that Jack or Jill 
but not both must serve on the committee. 

17. Abagcontainsa dozen oranges two of which are rotten. 
‘A sample of three oranges is taken from the bag. 

a) In how many ways can the sample be taken (how 
many different samples are there)? 

(b) Of these, how many contain exactly one rotten or- 
ange? 

(€) How many contain exactly two rotten oranges? 

d) How many contain no rotten oranges? 

18. Suppose a shipment of 100 computers contains four de- 
fective computers, and we choose a sample of six com- 

puters 

(a) How many different samples are there? 

(5) Of these, how many samples contain all four defec- 
tive computers? What percent of the total does this 
represent? 

(©) How many samples contain one or more defective 
computers? What percent of the total does this rep- 
resent? 

You conduct an experiment in which you interview a 

large number of families, each of which has eight chil- 

dren. For each family, you write down the gender (M 

for male, F for female) for the eight children, in order 

from oldest to youngest. 

(2) How many possible results are there? 

b) Of these, how many consist of either all M's or all 
Ps? 

(c) How many have exactly three M's? 

20. You are tracking your favorite baseball player by writ- 
ing down his performance in 10 successive plate appear- 
ances using an ordered list of length 10. 


g 


53 Combinations and the Binomial Theorem 407 


(a) If you use H for any time he gets a hit and N when 

he doesn't, how many possible results are there? 

(b) Of the total, how many include exactly six hits? 

(©) Of the total, how many of them include fewer than 
three hits? 

21. Finish part 3 of Example 4. That is use each of the three 
suggested strategies, and verify that all give the same 
‘numerical answer. 

22. Three members (Mary, Sue, and Tom) of a 20-person 
office are carpooling, so they insist on never working 

separately. That is, whenever one of them is on a com- 

mittee, all three must be. How many committees of size 

7 meet this requirement? 

How many committees of five men and four women can. 

be formed from an organization with 43 women and 47 

24. Suppose in the previous scenario that among the mem- 
bers there are 20 married couples and there is a rule 
prohibiting spouses from serving on the same commit- 
tee. How many five-man, four-woman committees can 
be formed under this new restriction? 

25, Inastandard deck of 52 cards, how many five-card hands 
have at least onc king? 

26. There are five red, three green, and eight blue marbles 
in a box. 

(a) In how many ways can a sample of four be selected, 
without replacement and without regard to order? 
(That is, reach in and grab four marbles at once), 

(b) Of these, how many have all four selected the same 
color? 

(©) How many have two colors, with two of each color? 

27. A committee of size 6 is being formed from a group of 

F 10 Republicans, 8 Democrats, and 4 Independents, 

(a) How many different committees are possible? 

(b) Of these, how many have exactly two of each polit- 
ical persuasion? 

28. Your friend spinsa spinner with three colors (red, green, 
blue), and you record the results on six consecutive 
spins 
(a) How many possible results are there? 

(b) Of these, how many have exactly two of each color? 

29. One game in the Pennsylvania Lottery requires the pur- 
chase of a ticket that has 11 different numbers taken 
from 1.2.3. ...79. 80]. The order of the numbersdoes. 
not matter. 

{a) How many different lottery tickets are there? 

(b) Oncea week, the lottery draws even different num- 
bers from a rolling barre! of ping pong balls num- 
bered 1 through 80. How many different lottery 
tickets can contain all seven winning numbers? 








wg 








inatorics 


ratio of possible tickets to possible 

1 

e-digit numbers have distinct digits 
of the digit 9? How many of these 
decreasing order? 

ficient of z° in the expansion of 


ficient of i^ in the expansion of 
the coefficient of r* in this same cx- 


f every other entry in row n of the 
? What does the binomial theorem 
How are these questions related? 
lowing problem for some small val- 
explain the general statement based 
corem, 


ficient of (Ls) in the expansion 





(enr 
h 
tical induction that for all n = 0, 
1) * C(r. 2) Cin n) = 2" 
tical induction that for all n > 0, 
£€ (0,2) € 3:C(n3) ove 
)e n.a 





tical induction that for all > 1, 
C(n.2) &3:Cin 3) c 

"= nxi +y 

ting about differential calculus, show 


| Exercise 37 follows from the bino- 
s the idea of derivative, 


39. What entry in row 7 is the same as C(5,2) :C(2,0) + 
€G.1)-C2,1)  C(5.0)- C(2,2)? (Think about the 

inomial theorem applied to all parts of the true equa- 
tion (1 +x)? (1+x) =(1+x)’ to see why this is 
true.) 

40. Generalize the previous problem to give a general de- 
scription of the “multiplication” of two rows of the tri 
angle, 

41. Another true statement about polynomials is that 





x (E +a) + (1-3) 4e ex") 
0x3" -1 

(a) Explain why the above equation is true of polyno- 
rials, 

b) Compare the coefficients of x? on either side of this 
equation, and explain why this proves 

C(1,1) 4 CQ. 1) CQ, 1) + FECT) 
=C(n+ 1,2) 

(©) What fact about the arithmetic triangle is explained 
by comparing coefficients of x? in this same poly- 
nomial identity? 

0) State the general rule at work in the last two exer- 
cises as a pattern in the arithmetic triangle, (Draw 
a picture.) 

42. Let k be a fixed positive integer. Prove the following 

statement by induction on n 2 k: 

CK) + CUR A, A) +CH 2, K) e Cn) 
C(n 4 V k+ 1) 











Draw a picture to illustrate this statement in the arith- 
metic triangle, 

48, Use the mod 10 arithmetic system from Section 2.7 to 
explain the secret of the card trick in Example 8. 


5.4 Binary Sequences 





One of the fundamental building blocks in combinatorics is the binary sequence. We 
have already seen that the number of binary sequences of length n is 2", and we have 
enough tools at hand to count the number of binary sequences with any prescribed 


number of 0's and 1's. 


ll Euenpie 1na many taney oranes ores nec ue 19 idee DSt 


SOLUTION Weare trying to count ordered lists but it will not work tosimply 
place the eight symbols left-to-right, Instead, think of eight blank spaces as 


shown below: 








54 Binary Sequences 409 





1 2 3 4 5 6 7 8 


Now the algorithm is only two steps long: (i) Choose five of the eight spaces. 
and put the five T's there, and then (ii) put the three 0's in the remaining three 
spaces. There are therefore C(8, 5) x C(3, 3) = C(8, 5) such binary sequences. 

Alternatively, you could (i) choose three of the eight spaces and put the 
three 0's there, and (ii) put the five 1's in the remaining five spaces, 
rithm then produces C(8, 3) x C(5, 5) =C(8, 3) binary sequences. No 
we have implicitly shown that C(8, 5) = C(8, 3) since they are both answers to 
the same problem! o 














Theorem ! The number of binary sequences with r I's and n — r 0's is C(n,r) 
orCinn=r). 


PROOF Thisis a generalization of Example 1. m 





Practice Problem 1 Use Theorem 1 along with the sums rule to find the number 
of binary sequences of length $ that use an odd number of T's. 


‘The same type of algorithm that generates binary sequences can be used to solve 
other types of counting problems. 
I Example 2 


1. How many ordered lists of 10 letters chosen from (m, a. t) have exactly three m's? 
2. Of these, how many also have exactly four a's? 


SOLUTION 





1. Again, we are trying to count ordered lists with repetitions allowed, but we 

have the same peculiar restriction—we do not have an unlimited number of 
ich letter, We must use the letter m exactly three times, so it it will not do 
to place a first letter, a second letter, and so on. Instead, imagine 10 blank 
spaces and perform the following two-step algorithm: 











1 S die s CE NAE 8 95 n 


Step 1. Choose three of the 10 spaces and put the three m's in these spaces. 


Step 2. Fill the remaining seven spaces from left to right with letters chosen 
from (a, t}. 
For example, if we choose spaces (2,4, 9) in the first step, we have 


m m m 
1 By iust 5 6 7 8 9 1 











410 Chapter 5 / Combinatorics 


There are now seven empty slots to fill with a's and r's, and this would be 
true no matter which spaces were chosen in the first step. Reasoning as in 
Example 1, we see that there are C(10, 3) ways to do the first step of the 
algorithm. The second step corresponds to selecting an ordered list of length 
7 from a set of two elements, for which the count is2-2-2-2-2.2-2=2'. 
By the rule of products, there are C(10, 3) -2 = 15,360 ways to complete 
the two-step process 

. The algorithm is very similar. 





Step 1. Choose three of the 10 spaces and put the three m's in these spaces. 
Step 2. Choose four of the remaining seven spaces and place the four a’s 
in those spaces 
Step 3. Finally, place r's in the remaining three spaces. 
For example, if we choose (2, 4, 9} in the first step, and (1, 5, 6, 10) in the 
second step, we have 


a m m a a m a 
x Qui do Kaye S. eU cuo 8 9 100 


After placing the r's, we have the ordered list amrmaattma. 

Again, there are C(10, 3) ways to do the first step. In the second step we 
choose four of the seven remaining spaces, and there are C(7, 4) ways to do 
this. There is only 1-1-1 = P = 1 way to do the third step. The total count 
is C(10, 3) -C(7, 4)- 1 = 4,200. n 








Vll coup 3 1o dasy dui qualapl rade ets op e ners Uhr 
MISSISSIPPI are there? 
SOLUTION An arrangement of the letters in this word will be an ordered 
list of length 11, chosen from the letters (M, /, S, P] that make up the word, 
with the restriction that there must be one M, four I’s, four 5's, and two P's. We 
imagine 11 empty spaces, and perform a four-step algorithm. 
Siepl. Choose one of the 11 spaces, and put the M in this space. 
Sicp2. Choose four of the remaining 10 spaces, and put the four I's in 
these spaces. 
Step 3. Choose four of the remaining six spaces, and place the four 5's in 
these spaces 
Step 4. Finally, place the 2 P's in the remaining two spaces, 
For example, if we choose space (4} in the first step, and then spaces 
(2.3. 8, 10] in the second step and {1, 6, 7, 11) in the third step, we will have 


m do TEs m Bal io g 


ers, See se c E M EL SEU: 


With only the P's left to place, we will have the arrangement SI/MPSSIPIS 
after all four steps. 

The analysis of this algorithm is simple. There are C(11, 1) ways to do the 
first step, C(10, 4) ways to do the second step, C(6, 4) ways to do the third step, 








2 ee —-_- Cr 





54 Binary Sequences 411 


and C(2, 2) ways to do the fourth step. So the final answer to this question is 
the product 


CAL, 1) C(10.4) C(6, 4) C(2.2) = (11(210)(15)(1) = 34,650 


n 

€ $5 Practice Problem 2 
iilis rai (a) Show that the answer to the preceding example does not depend on the particular 
the Web. order in which the letters are placed by using an algorithm that places first the P's, 


then the S's, then the M, and finally the I's. 
(D) How many ways are there to arrange the letters in the word ALABAMA? 


Unordered Lists with Repetition Allowed 


We can also apply binary sequences to find a counting formula for the last of the four 
types of finite structures described at the beginning of this chapter, unordered lists 
in which repetition is allowed. The prototypical example of this structure is a bag of 
grocery items, where there may be several of the same type of object (like apples) 
and the order in which the objects are placed in the bag is irrelevant. Because of this 
analogy, unordered lists with repetition allowed are also known as bags. 

We will count these structures by representing them with binary sequences and 
then applying the knowledge about binary sequences we have developed in this 
section. It is best to look first at a simple example, so we begin by revisiting an idea 
we first encountered in Section 5.1 


Vll techo 4 T E EE E E meo q- 
tions have the same answer. 


(I) How many ways can one fill a bag with 10 pieces of fruit at a store that sells only 
apples, bananas, and peaches? (We are assuming that the store has at least 10 of 
each type of fruit available.) 

(ii) How many solutions in nonnegative integers are there to the following equation? 


a+b+c=10 


SOLUTION To show that question (i) and (ii) have the same answer, we can 
represent a bag of fruit (counted by (i)) as a list of three numbers a, b, c in 
which a represents the number of apples, b the number of bananas, and c the 
number of peaches In every such list, a, b, and c will be nonnegative integers 
that sum to 10. This sets up a one-to-one correspondence between bags of fruit 
and solutions to the equation. Here are a couple of specific examples of this 
correspondence. 


© A bag with four apples, one banana, and five peaches corresponds to the 
equation 4+1+5=10. 


© The solution 8 +0 +2 = 10 corresponds to a bag with eight apples, no 
bananas, and two peaches. 
o 


uw Example 5 Without answering either question, explain why these two ques- 
tions have the same answer. 





d--———————————————————————Á————————!)!EE—— 





412. Chapter 5 / Combinatorics 


(2.1.7) «— 001010000000 
(1;7.2) — 010000000100 
(6.0.5) «— 000001100000 
(87.0) «— 000100000001 


(0,10,0) =— 100000000001 


Figure 5-7 The 
correspondence in 
Example 5, 


(i) How many solutions in nonnegative integers are there to the following equation? 
a+b+c=10 


(ii) How many binary sequences of length 12 have exactly two I's and ten 0'5? 


SOLUTION Set up a correspondence by linking a solution (a, b, c) to the 
equation in (ii) to the binary sequence made of a 0's, then a 1, then b more 0's, 
then another 1, and finally c 0's, 


(a, b, c) «— 0. . 010...010.. .0 
KEIN 


Sincea + b+ c = 10,theresulting sequence will be of length 12, and will contain 
exactly two 1's and ten 0's, For example, part of this correspondence is shown 
in Figure 5-7, This “linking” is a one-to-one correspondence. [s] 


‘The significance of these correspondences should now be clear. By Theorem 
1, we know that the answer to question (iii) is C(12, 2). Therefore, this is also the 
number of solutions described in question (ii), and hence it is also the number of 
bags described by question (i). 

We summarize the more general connection between these three counting prob- 
Jems in the following lemma, 





Lemma 2 The following three questions have the same answer for all natural 
number values of n and r: 


© How many unordered lists with repetitions allowed (bags) of size r with entries from 
a set of size n are there? 

© How many nonnegative integer solutions to the equation x, x, + +++ +34 = rare 
there? 


© How many binary sequences of length r +n — 1 with exactly r 0's are there? 


Practice Problem 3 For the particular questions in Examples 4 and 5, illustrate 
the correspondences between the bags of fruit, the equation solutions, and the binary 
sequences by filling in the missing entries in Table 5-25. 


This allows us to see the power of the one-to-one correspondence in combina- 
tories. Because we know how to answer one of the three questions in Lemma 2, we 
know how to answer all three, 


























Bag of Fruit | Equation. Binary Sequence 
-— 
1 apple, 3 bananas, 6 peaches 
44244=10 
| 100100000000 
| 000000000011 











Table 5-25 Table for Practice Problem 3 











54 Binary Sequences 413 
Theorem 3 Let natural numbers n and r be given. 


1. The number of solutions to the equation yy +++: + =F using nonnegative inte- 
gersisC(r n lr) 

2. The number of unordered lists of length r taken from a set of size n, with repetitions 
allowed, isC(r +n - Vr). 

3. The number of bags ofr pieces of fruit that can be bought at a store with n types of 
fruit available isC(r +n — 1r). 


PROOF This simply combines Lemma 2 with Theorem 1. " 


Applications of Theorem 3 


Many people find the counting formula derived in Theorem 3 difficult to use in specific 
situations, so we will give some practical advice for a variety of problems. Note that 
itis perfectly correct to simply use Theorem 3 as it is written, but if you have trouble 
remembering it, you should consider our recommendations for answering these types 
of questions. 





ll karpie 8 ear nay boeraai pieeo egi mom bay fresco nam 
that sells four types of candy? 


SOLUTION While it is possible to match this up directly with Theorem 3, it 
is easier to relate it to more fundamental facts as follows: 
Step 1. To what nonnegative integer equation do these bags of candy cor- 
respond? 
Solution, VE we let w, x, y, and z represent the number of pieces 
of the four types of candy, the equation would be w + x + y + z = 
20. 
Step 2, Choose a particular solution to this equation, and draw a picture of 
a binary sequence that corresponds to this solution. To remember 
how to do this, imagine a string of twenty 0's that must be broken 
into four parts with sizes w, x, y, and z, respectively. To partition 
the string of twenty 0's into four parts, insert three 1°s as dividers 
within the string so that w 0's occur before the first 1, x 0's occur 
between the first and second 1's, and so on. 
Solution. Using this idea, we see that the solution w = 3,x = 0, 
y = 10, z = 7 corresponds to the binary sequence 








.000.11000000000010000000 
E 1 Y 
Step 3. Whatarethe length and the number of 's in this binary sequence? 
How many binary sequences are there with these attributes: 
Solution, This sequence has length 23 with twenty O's and three 
1's, There are atotal of C (23, 20) or, equivalently, C (23, 3) of these. 








So the correct first step in a problem of this type is to find a representation of 
the objects being counted as solutions to a nonnegative integer equation. Sometimes 
this objective requires some creativity, as the remaining examples illustrate, 














414 Chapter S / Combinatorics 


Explore more on 
the Web, 


i Example 7 How many bags of 10 pieces of fruit can be purchased at a store 
that carries apples, bananas, peaches, and pears if we insist on getting at least one of 
each type of fruit? 


SOLUTION We can form such a bag of fruit in two steps: (i) Put one of each 
type of fruit into the bag, and (ii) put in six more pieces of any type. There's only 
one way todo the first step. For the second step, we can let a, b, c, and d represent 
the number of additional apples, bananas, peaches, and pears, respectively. 
Clearly, a + b + c + d =6 and the variables can be any nonnegative integer, 
so these solutions can be represented as strings of six O's, into which three 1's 
have been placed as dividers, There are a total of C(6 + 4 — 1, 6) such binary 
sequences, Hence, there are C(9, 6) = 84 bags of this type. o 





Note that because of the original analogy between bags and solutions to equa- 
tions, this same problem could be presented in the following different way. 


rs] Example 8 How many positive integer solutions are there to the equation 
w+x+y+z=10? 
SOLUTION The problem has the same answer, C (9, 6), as Example 7. o 
Practice Problem 4 Use the idea in Example 7 to answer the following question 


“How many nonnegative integer solutions are there to the equation w +x + y + z = 10 
if w is required to be at least 3 and x must be positive?” 





Our final application comes from games that involve rolling dice, We consider 
these partly because the next chapter focuses on probabilities involved in games, and 
many games use dice to generate their moves, This kind of problem also illustrates, 
that some very natural questions have some surprisingly difficult answers. 





u Example 9 How many different outcomes are possible if 1 toss a regular six- 
sided die four times? In how many of these do the four rolls sum to 14? 
SOLUTION We can represent an outcome as an ordered list from (1, ... 6} 
of length 4—that is, as xy, 42, x9, x4, where x, is the result of the i^ toss of the 
die, There are 6* = 1,296 of these. 
‘To answer the second question, we think of the above representation of the. 
four rolls of the die; then the condition that the sum is 14 corresponds to having 
3i +12 +13 + ty = M. So if we can say how many solutions (x, x, xs, 4) there 
are to this equation with the condition that each x; is from (1, ..., 6), then this 
will answer the original question. 
Let's first think about how many solutions to xy -- 1; + xs +4 = 14 there. 
are with only the condition that each x; > 1. Example 8 essentially showed that 
this is the same thing as the number of nonnegative integer solutions to the 
equation xj +. + x4 + x4 = 10, which according to Theorem 3 is C(10 +4 — 
1, 10) = 286. 
Now back to the original problem—that is, we wish to have each x, < 6 too. 
In the above analysis, we counted many solutions that could not correspond 




















5,4 Binary Sequences. 415 


to dice rolls, For example, we counted (1, 1, 3,9) since it corresponds to the 
following binary sequence with fourteen O's and three 1’s 





01010001 000000000 


‘The strategy we will use will be to subtract from the total number 286 the 
number of solutions that do not correspond to die rolls, So we wish to count 
the number of solutions to xy -- x; + Xs + xa = 14 (with each x, > 1) that have 
either x, > Torx, > Torxs > 7or x > 7. Note that it is impossible to have, say, 
3; 2 7 and x; > 7 at the same time in a solution to xj + x2 +3 + xi = 14, so 
the "bad" solutions (i.e, those not corresponding to die rolls) can be separated 
into four types, depending on which of the four quantities is too big, 

Now how many solutions to xi +2 + x3 +.%4 = 14 (with each xj > 1) are 
therein which x; > 7? To answer this, it is e; it to think about this as a problem 
about bags of fruit, This is equivalent to asking "How many bags of 14 pieces 
of fruit can be bought from a store that carries apples, bananas, cantalopes, and 
peaches, if we insist on getting at least one of cach type of fruit and we must 
get at least seven apples?" Example 7 suggests that we form these bags in two 
steps: First, place seven apples, one banana, one cantalope, and one peach into 
the bag; second, place four more pieces freely into the bag, There is only one 
way to do the first step, and by Theorem 3 there are C (4+4 — 1, 4) ways to do 
the second step, so there are C(7, 4) = 35 total bags of this type. 

‘This means that there are 35 solutions to x + x» - xy +u = 14 (with each 
x, = 1) in which xy > 7, Similarly, there are also 35 with x2 = 7, 35 with xs > 7, 





























and 35 with x, > 7, giving us a total of 4 x 35 = 140 “bad” solutions. 
Therefore, there are 286 — 140 = 146 solutions with each x, from (1, ..., 6), 
and hence 146 ways that the die rolls can sum to 14, D 





In this example, we were lucky that it was impossible to have two or mo 
variables that were "bad" at the same time in the previous problem. If this had 
happened (e.g, if the sum of the four dice had to be 16 instead of 14), we would have 
had to incorporate the principle of inclusion-exclusion first discussed in Section 3.1, 


Practice Problem 5 — /n how many ways can one roll a sum of 17 on six distin- 
quishable six-sided dice? 


Summary of Counting Basic Structures 


In the first four sections of this chapter, we have introduced four types of finite 
structures and provided “formulas” that count them. Table 5-26 summarizes what 
we have determined so far. In each case, the entries in the structure come from the 
set [1,2,..., n]. 


Solutions to Practice Problems 





1 The binary sequences described must have either one $ with one 1, C(5, 3) binary sequences of le 
1. three 1's, or five l's Applying Theorem 1 for each of three 1's, and C(5. 5) binary sequences of length 5 with 
these cases, we have C(5, 1) binary sequences of length five 1's. The answer then isC(S, 1) +C(5,3) - C(S. 5). 





416 Chapter 5 / Combinatorics 














What? How Many? 
Ordered lists of length r w 
Permutations of length r P(nr) 
Unordered lists of sizer || C(r-en—1.r) 








Sets of size r C(n.r) 


Table 5-26 A Summary of Counting 


Formulas 


2 (a) This algorithm gives C(11,2):C(9,4)-C(S.1)- 3 Table 5-27 shows the completed table 


C(4,4) = 34,650, which is the same count as in the 4 First put the required three apples and onc banana into 
the bag, then purchase six additional pieces of fruit. 
‘There is one way to do the first step. The second step. 
corresponds to binary sequences with six 0s (the fruit) 
and three 1's (the dividers separating the groups of O's). 
"There are C(9, 6) =C(9, 3) = 84 different bags of fruit 


example, 
w 


algorithm: 


© Choose four of the seven blanks, and place A’sin 


them. 


© Choose one of the remaining three blanks, and 


place the L init, 


© Choose one of the remaining two blanks, and 


place the B init, 


We imagine seven blanks into which we place the 
letters in ALABAMA according to the following 


meeting the conditions, 


with positive integer solutions to the corresponding, 


© Choose the one remaining blank, and place the equation 


Minit 


‘There areC(7, 4) ways todo the first step, three ways 
to do the second step, two ways to do the third step, 
and one way to do the last step, so by the products 


rule there are 


C(7,4)-3-2-1— 210 distinguishable arrangements 


with each x; < 5, is 


Exercises for Section 5.4 


1. How many binary sequences of length 8 satisfy each of 


F the following conditions? 
(a) Exactly six 1's 
(b) Exactly two 0's 
(c) Atleast two I's 


€(16, 5) - 6-C(10, 


As in Example 9, we first convert the equation 


xy chan bas bay tay tt) = 17 


3px dou +I += 


with nonnegative solutions. Now following Example 9, 
the number of nonnegative solutions to this equaiton, 


2856 





2. How many binary sequences of length 10 satisfy each 
F of the following conditions? 


(a) The same number of O's and 1's 


(b) At most two 0's 
(©) Atleast eight 1's 


























Bag of Fruit. INL Binary Sequence 

1 apple, 3 bananas, 6 peaches | 1+3+6=10 010001000000 
4 apples, 2 bananas, 4peaches | 4+2+4=10 000010010000 
2 bananas, 8 peaches 0+2+8=10 100100000000 
10 apples 10+0+0=10 | 000000000011 











Table 5-27 Solution to Practice Problem 3 

















a 





6 


10. 


A ternary sequence is sequence of digits chosen from 

10.1.2). How many ternary sequences of length 12 sat- 

isfy cach listed condition? 

(a) Exactly eleven 1'5 

(b) Exactly three 1's and two 0's 

(e) No T's 

You are tracking your favorite baseball player by 

ing down his performance in 10 successive plate appe: 

ances using an ordered list of length 10. 

(a) Ifyou use H fora hit, S fora strikeout, B fora base- 
on-balis, and O for anything else, how many results 
are possible? 

(b) Of the total, how many have exactly three H's? 

(©) Of the total, how many have exactly four H's, ex- 
actly one S, exactly two B's and exactly three O's? 

How many ordered lists, of length 10 with repetitions 

allowed, are there satisfying each condition? 

(a) Using the set (a, b.c. d] 

(b) Using the set lx, y, z) and having exactly three a's 

(©) Using the set (0, 1,2,3,4), having exactly two 0's 
and three 2's, and not beginning with 0 

(Compute exact numerical answers for (a) and (b), and 

then answer (c) using complete sentences.) 

(a) How many binary sequences of a length at most 5 
have exactly three 1's? 

(b) How many binary sequences of length 6 have ex- 
actly four V's 

(e) Why do these two questions have the same answer? 

How many 10-digit numbers are there that use exactly 

three 1's three 2's, and four 3's? 

How many properly written numbers (i.e, leading 0's 

are not allowed) use each digit (0,1,2,...9) exactly 

once? 

















- How many properly written numbers use each digit 


(0. 1,2... .9) exactly twice? 


How many arrangements are there of the letters in the. 
word MATHEMATICS? 





54 Binary Sequences 417 


11, How many arrangements are there of the letters from 
your last name? 
How many arrangements are there of the letters in the 
word MISSISSIPPI that begin with either an / or an $? 
In Example 3 and Practice Problem 2, we extended the 
idea for counting binary sequences to count any kind 
of sequence made with a fixed set of symbols. In this 
exercise, we will develop an alternative way to answer 
these questions using our more fundamental notion of 
equivalence classes. 

(a) There are 7! = 5,040 arrangements of the letters in 
ALABAMA, given that the seven letters all look 
different. On the other hand, if we did not use the 
goofy fonts, some arrangements, like ALAAMAB 
and ALAAMAB for example, would look the same, 
How many of the arrangements of the seven goofy 
letters look like ALABAMA when the goofiness is 

(b) Use the answer to part (a) along with the fact that 
there are 5,040 different arrangements of the seven 
distinct letters in ALABAMA, to determine the 
number of distinguishable arrangements of the l 
ters in ALABAMA. 

14, For Examples 4 and 5, illustrate the correspondence 

between the original bag of fruit, the equation solution, 

and the final binary sequence by filling in the missing, 

entries in Table 5-28. 

1 purchase a bag of 12 pieces of fruit from a store 

that sells apples, bananas, peaches, oranges, pears, and 

pineapples. Using the strategy outlined in the section: 

(a) Describe the equation and binary sequence that 
would represent these bags: (i) two of each kind 
of fruit; (i) six apples and six pears; (ii) 12 oranges. 

(b) Describe the bags and equations corresponding to 
these binary sequences: (i) 0011001 1001000000; (i) 
1000000101010010; (ii) 00010001000100011, 

16, How many ways are there to fill a bag with 20 pieces of 

F fruit at a store that sells apples, bananas, and peaches 


1 


1. 











15, 
































Bag of fruit Equation | Binary sequence 
5 apples, 5 bananas 
1011000000000 
100000000001 














Table 5-28 Table for Exercise 14 


ANB Chapter 5 / Combinatorics 


. How many ways are there to fill a bag with 20 pieces of 

fruit ata store that sells apples, bananas, peaches, and 
oranges? 

18. How many ways are there to fill a bag with 20 pieces of 

Ff. fruit at FruitMart, a store that sells 30 types of fruit? 

19. How many solutions using nonnegative integers are 




















i ll y 


be chosen for prizes from a class consisting of just four 
people (Andrew, Bob, Carly, Diane], assuming that the 
prizesare identical door prizes (so it doesn’t matter who 
gets which prize) and that the same person could win 
both?” (Hinr: First rewrite the problem in the form 
“How many unordered lists of length can 











Bie o fS e) te ath be mde from the set L With repetitions 
20. How many solutions nonnegative integers are Nn 
f theretotheequation wt re ye 10) S 27, Translate the folowing into “bag” problem and solve. 
MSS REA in: The Computer Club has 15 members, and plans to give 
2L. How many solutions using nonnegative integers ate yay 35 ential refrigerator magnes to the members, 
ap ape rii A How many ways are there to accomplish this if each 
22, How many different bags of produce can I bring back member must receive at least one magnet? 
from the store, assuming that there are apples bananas, a How many ways are there to distribute 200 apples 
oranges and peaches available: T buy atleast onc of" mong 43 women and 47 men if each woman must get 
ps secius ec JU Le at least two apples and each man must get at least one 
he 4 apple? 
23. Repeat the preceding exercise, but this time assuming 29, How many ways are there to distribute 200 apples 
ROLE A se. uo Cheech among 42 Democrats, 47 Republicans, and 5 Indepen- 
24, How many solutions using positive integers are there (denis if cach Democrat and Republican must get atleast 
the equation x + y z = 13? one apple and at least one Independent gets less than 
25. How many positive integer solutions are there to the — — four apples 
equation 30. Generalize Example 8 to find a general formula for the 
trbt number of positive integer solutions to the equation 
(Hint: Break the problem into four cases depending Arb 
on whether e = lc 2,0 =3,ore 24) 31. There are 6€ = 1,296 different outcomes from rolling 


26, Translate this problem from Section 5.1 into a "bag" 
problem, and solve it; How many ways can two winners 





four distinguishable ordinary (six-sided) dice. In how 
many of these is the sum of the dice 12? 





5.5 Recursive Counting 








At the beginning of Section 5.1, we related the ingenuity displayed by Gauss as 
a child in computing the sum of the first 100 positive integers. This sort of clever 
orpanization has been exploited throughout this chapter to derive all sorts of slick 
devices for counting complicated sets of objects. In this section, we will examine 
a form of reasoning that is valuable in many aspects of discrete mathematics but. 
especially in the study of computer science. 








Examples 


Despite contradictory rumors, recursion is not a fabricated tool to make beginning 
students dizzy, The human brain naturally uses recursive reasoning in many situations. 
For example, when Gauss announced that the sum 1 +2 + 3 + +- + 100 was 5,050, 
the rest of the class was probably suitably impressed, However, suppose the teacher 
had then responded with “Alright Mr. Smarty-Pants, Prince of Mathematics, now do 
this sum!” 





142434004994 100-4 101 





55 Recursive Counting 419. 


Nostudent in the class would have started over again. They would have simply added 
101 to the 5,050 that Gauss already computed and arrived at 5,151 all at the same time. 

We will model other problems with this same point of view. That is, we will 
imagine that the answer to the proposed question is known when n = 100, and we 
will think of a way to use this information to arrive at the answer when n = 101. 
Of course, there is nothing sacred about the number 100 so our explanation ought 
to be completely general, If at this point this seems closely related to mathematical 
induction, you are on the right path, 











I anp 1 nal rise e model fur ase namaber op pena Han vet ur 
in the first round of a round-robin tournament with n teams, (In the first round of a 
round-robin tournament, each team plays all other teams once.) 





SOLUTION Letan denote the number of first-round games in à tournament 
ofn teams, For example, a; = 0 and a; = 1, Let's imagine that we know every: 
thing there is to know about tournaments with n — 1 teams, but we are now 
faced with n teams to play in the first round, Use the letter Z to denote the last 
team to enter, Without Z there would have been  ~ 1 teams in the tournament 
so there would have been dy. first-round games, but Z is there and must play 
cach of those n — 1 teams as well—this accounts for an additional n — 1 games 
There is a total then of ay. + (n — 1) first-round games in a tournament with 
n teams, More simply stated, 














à, = n-i + (n= 1) 


This along with the knowledge that a; = 0 is enough to find the value of a, for 
anyn. n 


Ml Example 2 Finda recursive model for P, r) the number of r-permutations 
from (4, ..., n]. 





SOLUTION ‘The presence of two variables makes this a little more elusive, 
but it’s not too hard. Imagine we already are experts on permutations of all 
lengths on sets of size n — 1, and we wish to enumerate the r-permutations of 
(1... n]. We will construct these with a two-step algorithm: 


© Choose an element from (1, ..., 1} for the first position. 


© Place an (r — 1)-permutation from the remaining (n — 1)-clement 
remaining r — 1 positions. 





t in the 





As this algo 
rule tells us that 





m will form all r-permutations from (1, ... i the product 





P(n.r) =n P(n- lr -1) 


This along with the fact that P(n, 0) = 1 for all n is enough to generat 
value of P(n, r), so this is an adequate recursive model. 





Practice Problem 1 Let dy denote the number of ways to roll a sum of n us- 
ing k distinguishable regular (six-sided) dice. Explain why the following relationship 
is true: 





dax = yt et + hen. + dy abt + ha + da + dy 64-1 


Ml parole 3 Lith, bete umber oftsubuct of 1.2...) that donot contain 
consecutive numbers as elements. For example, if n = 4, the eight sets 





11.31.(2.4). (1.4), (1). (2). (3). (4. 0 


all have this property so we would say by — & Find a recursive model for by. 


SOLUTION Itis easy to verify that b; = 2 (by listing the sets (1) and Ø) and 
b; —3 (by listing the sets (1], {2}. and Ø). Now for a specific n > 2, imagine 
that we know everything about these kinds of subsets of (1, 2, ... .K) for all 
k <n, and we are trying to understand subsets of (1,2, ..., 7). Any subset of 


(1.2......n] either contains the element n or it does not. 

© If a subset contains n, then in order to include no consecutive numbers, the 
remaining elements of the subset must come from (1,2. . ...n — 2) and 
include no consecutive numbers. Because of our (imagined) expertise, we 
know there are b, 2 ways to choose these remaining elements. 

€ If a subset does not contain n, then it is actually a subset of (1.2, ....n — 1) 
with no consecutive numbers, and our (imagined) expertise tells us there 
are by_; such subsets. 


These two cases account for all subsets of (1, 2, ..., n} with no consecutive 
numbers, so b, = by1 + bn-2- n 














Practice Problem 2 Let c, denote the number of binary sequences of length n. 
Use a recursive model to explain why 


Ca = 265-1 


"lll Example 4 The clasic puzzle “The Towers of Hanoi” consists of n disks 
of different diameters and three pegs. (Figure 5-8 illustrates n = 6.) The only rule is 
that no larger disk can be stacked on a smaller disk. The objective is to move all the 
disks from the leftmost peg to the rightmost peg one disk at a time. Let H, denote the 
number of required moves to accomplish the objective, and find a recursive model 
for Ha- 
SOLUTION Itis easy to see that H, = 1. Assume that we are masters of the 
Tower of Hanoi with n — 1 disks, and that someone has given us a Tower of 
Hanoi puzzle with n disks stacked (legally) on the leftmost peg. The puzzle can 
be solved in the following steps: 


Figure 5-8 Towers of Hanoi. 





55 Recursive Counting 421 





ax a] mp |e Sys 





+ 
Hw |} 1 | 3] 7] i | a] @ 


























Table 5-29 Number of Moves for the m-disk Tower 
of Hanoi 


© Move the topmost n — 1 disks from the left to the center peg. 
© Move the bottommost (i. largest) disk from the left peg to the right peg. 
@ Move the n — 1 disks from the center peg to the right peg. 





ince the first step of this operation is equivalent to solving an (n — 1)-disk 
‘Tower of Hanoi puzzle, we know from our assumed expertise that there are H,. 
required moves to accomplish the first step. Clearly, only one move is required 
to accomplish the second step, and again the last step requires the "solution" of 
an n — 1-disk puzzle and consequently requires Hn- additional moves. These 
three steps then solve the puzzle with disksso we have H, = Hs +1 + Hai. 
or 





Hy =2Hy1+1 





o 


Example 4 illustrates an interesting phenomenon. The recursive description of 
H, was easy to justify, but if we had written out a few of the Hn numbers, we would 
see the strong pattern shown in Table 5-29. We leave the proof of this pattern for 
more practice with induction. 


Practice Problem 3 Use induction to show that the H, of Example 4 satisfies 
H=% -1 





Dl sens needeeseerpete tun oiimi 

‘must essentially win the game by two points. Specifically, if the game is tied late in the 
‘game (this is called being "tied at deuce"), then the game continues until one player has 
won two points more than the other player. In how many different ways can a tennis 
game tied at deuce still be tied at deuce 10 points later? 


SOLUTION Let T, be the number of ways this can happen after n points. If 
we look at some examples, we might see a pattern among the values of Tn. 





© If the game is tied at deuce, then the game cannot be tied again after only 
one point; therefore, we would say that T; = 0. Extending this reasoning, 
we can see that in fact after any odd number of points have been played, 
the game cannot be tied again; hence, T, = 0 whenever n is odd. 

© Let's consider Tz, the number of ways that a game tied at deuce is tied at 
deuce again two points later. We will represent the play of the games as a 
list of letters from [A, B], where A represents a point won by one player 
and B represents a point won by the other player. The only sequences of 
two points that will leave the game tied again are AB and BA; hence, 
n-2 








COo CC 





422 Chapter 5 / Combinatorics 


@ We can deal with T4 in a similar way by simply listing all possibilities for 
four points; ABAB, ABBA, BAAB, and BABA. Therefore, T, = 4. 


Now suppose that we have mastered counting the possibilities for games 
lasting less shan n points, and we are confronted with having to count how many 
games that start at deuce are at deuce again after n points (assuming n is an 
even number). These games look like one of the following: 


© AB. . Where the blank contains a sequence representing a game 
that starts tied at deuce and is again tied at deuce n — 2 points later, We 
know that there are 7,.. possibilities to go in the blank. 

© BA , where the blank contains a sequence representing a game 
that starts tied at deuce and is again tied at deuce n — 2 points later, We 
know once again that there are T,.. possibilities to go in the blank. 








Since in each of these two cases there are T,.. possibilities, the total num- 


ber of ways this can happen is 2 T,.2. In other words, 
Tu= 2: T2 
‘This along with the facts that T, = 0 and 7; = 2 is enough to determine the 











value of T, for any n. 


Solutions to Practice Problems 


1 Imagine one die is a different color than the others— 
say it's red. A roll of all k dice that sum to n falls 
into one of six cases, depending on the outcome of the 
red die. 
© Ifthe red die isa 1, then the other k — 1 dice mustsum 
ton = 1; there are dy. 4-1 ways for this to happen. 
© Ifthe red dicisa2, then the other k — 1 dice mustsum 
ton — 2; there are dy.24-1 Ways for this to happen. 
© Ifthe red die is 3, then the other k — 1 dice must sum 
ton — 3; there are d, 4-1 ways for this to happen. 
© Ifthe red dieisa4, then the other k — 1 dice must sum 
ton — 4; there are d, i.i Ways for this to happen, 
© Ifthe red die is5, then theotherk — 1 dice must sum 
ton — S; there are dy_s,4-1 Ways for this to happen, 
© Ifthe red die isa6, then the other k — 1 dice mustsum 
to n — 6; there are d, «i-i Ways for this to happen, 
Altogether this means that 





Exercises for Section 5.5 


1, Find a recursive model for dy, the number of n-digit 
numbers that do not use the digit 0. 


n 


da= oaa dina di 
Adag dosis deat 
Each binary sequence of length n consists of either 0 or 


1 as its leftmost entry followed by a binary sequence of 
length n — 1. Hence, 





PEL 


From Example 4, we know that H =1 and Hy = 
2H,., +1, Let P(n) be the statement “Hl, = 2^ — 
Since 2! — 1 = 1, we know that /(1) is true, Now sup- 
pose P(1), P(2),.. ., Pn — 1) have all been checked to 
be true for some given integer m = 2. Then 





Hy = 2Hy-s +1 from Example 4 
= 22"! 1) +1 by Pin-1) 
=7-1 


2. For Practice Problem 1, give enough information about 
"initial values” so that all values of d, can be deter- 
mined. 











3. For Example 5, find a formula in terms of n for Ty, and. 
prove your formula is correct using mathematical in- 
duction, 

4. Use the recurrence relation Pn. r) e n P(n = 1,7 — 
1) (with P(n. 0) — 1 for all n = 0) from Example 2 to 
prove by induction that P(n, n) = n! for all n > 0. 

S, Use the recurrence relation P(n. r) =n: P(n - Ler = 
1) (with P(n, 0) = 1 for all n = 0) from Example 2 to 
prove each of the following: 

(a) Forall n> 1, Pn =n 

(b) Foralln > 2, P(n.2) =n (n — 1) 
to use (4) 

(©) For all n 23, P(a,3) = n- (n= 1): (0-2) 
will need to use (b).) 

6. Recall that C(n, k) is defined to be the number of k- 
clement subsets of the set [1,2, .. n), These can be 
generated using the following recursive algorithm: (i) 
Decide whether to use the largest possible number n 
‘or not; (i) fil in the remaining elements chosen from 
11,2,.... n — 1] to form a k-element set, Explain how 
this accounts for the fact that 


Cin k) e C(n - 1,4) e C(n - 1 k= 1) 








(You will need 


(You 








7. Find a recursive model for c, the number of ways to 
cover a 2 x n “chessboard” with “dominoes” that each 
covera 1 x 2area, 

3. Let d, denote the number of ways that n letters can be 
put into n envelopes so that no letter goes in the cor- 
rect envelope. The numbers d, are traditionally called 
derangement numbers, 

The great Swiss mathematician Leonhard Euler 
(1707-1783) showed that these numbers satisfy the re- 
‘currence relation 


dy = (81) dades) forn> 3 











56 Excursion: Solving Recurrence Relatons 423 


Find the values of d; and cs, and then use this recursive 
description to compute the value of ds- 

9, Provebyinduction onn 1 thatthe derangement num- 
bers from the previous exercise satisfy 


4 -n(s- 


eben) 





atero) 


10, Find a recursive model for 1, the number of ways in 
which 2n tennis players can be partitioned into n first- 
round matches in a tournament. 

11. Find a recursive model W, the number of ways that a 
tennis game tied at deuce is over after exactly n more 
pointsare played. (This problem uses terminology from 
Example 5.) 

12, Find a recursive model (in terms of 1 and A) for fas, the 
number of ways to fill a bag with n pieces of fruit at a 
store that sells k types of fruit 

13, Find a recursive model for s, the number of binary 
‘quences of length n that have no consecutive 1's 

14, Find a recursive model for w, the number of properly 
written n-digit numbers that do not have consecutive 
1's as digits, 

18. A partition of the set X = (1,2... n] is a collection 
Si, Sa, +-+ of nonempty subsets of X that do not over- 
lap (ie,, when i s j, S 5, # V) but that together ac- 
count for the entire set (ie. Sı US2U ++: = X). Finda 
recursive model for counting B,, the number of differ- 
ent partitions of (1,2... ml. (These are called the Bell 
numbers in combinato 




















2 5.6 Excursion: Solving Recurrence Relations 





"This section is focused on the issue of finding closed formulas for recursively defined 
sequences. This problem is a central fixture in Section 1.2 of this book, although we 
were not brave enough at the time to attempt any formal procedures for finding closed 
formulas, In that section and the subsequent sections on mathematical induction in 


Chapter 2, we contented ourselves with bui 
to prove, the relationships between the two types of sequence defi 
section, we will develop some methods for moving from recursive des 
closed formulas, and we will see some applications of these metho 





gan understanding of, and being able 









>. c — F o.» = CEN 


424 Chapter 5 / Combinatorics 


Difference Tables 


When we played “guess the next number” in Section 1.2, we often looked at the 
differences between successive terms to look for a pattern. This sometimes led to a 
recursive description, but it rarely led immediately to a closed formula. The method 
of difference tables will allow us to find a closed formula without ever producing a 
recursive description under special circumstances. We will use the sequences in the 
following example to develop this method over the next few pages- 


ll Ae en E 
express the general pantern as a recurrence relation: 


1. 2,5,8,11,14, 
2. 6,11, 19,30, 44, 
3. 1,5, 14,30, 55, 91, 5 





For the sequence 2, 5.8, 11. 1 
between successive terms as follows: 





is easy to form the sequence of differences 


252528414212. 
c dp cde 
In this discussion, it is important that we index our original sequence beginning with 
0. Hence, in the sequence above, we have s; — 2, s; — 5, and so on. Formally, we 
define the sequence of differences for a given sequence {sq} as follows. 





Definition For a given sequence of numbers [s,) beginning with so, we 
define the sequence [A,] of first differences by the rule 


Asse Sn 


for all n > 0. Hence, the first term in the sequence of first differences is Ag 
5-5 











‘Table 5-30 shows the original sequence and the sequence of first differences in 
convenient table form. The examination of differences is of central importance to 
mathematics. Indeed, the entire field of differential calculus is a close examination of 
certain differences. The connection between sums and differences in “continuous” 
mathematics is reflected by the fundamental theorem of calculus itself. In the context 
of “finite differences,” this relationship is slightly more transparent. Consider the sum 
of the numbers in the A,, row of Table 5-30: 





| A Gm enek aA 








| = f2{s]s fu | [a | al] 2 








s Poles} sales Ios |a 




















TableS-30 First Differences of the Sequence s» 











56 Excursion: Solving Recurrence Relations 425 


Bm + Amo i Art Ar + Ao 
(Sm — 5071) + (Smt — Sm-2) + -+ 2 5) + (S1 — 30) 
=Sn—S0 
Since the first differences are all three in the example, we know that Ay + Am-2 
A An + Ay + Ao = 3m, and so 


3m= 





m — 50 


which means that s, = 3m +2. 

Thisis certainly more complicated than it needs to be for a sequence as simple as 
this, but we have laid a foundation of tools that can be applied to the more interesting 
sequences. The primary one (which you are asked to prove formally in the exercises) 
is the relationship between sums and differences. 





Theorem ! (Fundamental Theorem of Sums and Differences) For any sequence 
Asn) with first differences Ay = 5421 — si and any n = 1, 


m= 0= A 


r=] 


T angie 2 Ait een Tana shepeccasiersabove ofa toe furvials 
for the sequence 


6, 11, 19, 30, 44, 





When we form the difference table as we did before, the sequence of first differ- 
ences is not constant, so we need to extend our definition of differences so that it can 
be repeated. We define the sequence of second differences by forming the differences 
of the sequence of first differences. (Likewise, we can define the sequence of third 
differences by forming the differences of the sequence of second differences, and so 
on, but that will not be necessary for this example.) The general definition is recursive 
in nature. 





Definition For a given sequence of numbers s,) beginning with sy and an 
integer k > 2, we define tr {As} of &^ differences by the rule 


CIN 
= Ania oe 


for all n > 0. Hence, the first term in the sequence of K^ differences is Aj 
— AE. For consistency, we refer to the sequence of first 
[A] instead of {4,,) in this context. 















Now the complete difference table for the given sequence 6, 11, 19, 30, 44,. 
from Example 2 is given in Table 5-31. We can easily find a closed formula for A, 
but let's do it as an application of Theorem 1 just for fun. According to the theorem, 





426 Chapters / Combinatorics 


























we should have 


=] 


Sis 
i 
=3m 





Since Ao = 5, we conclude that Am = 3m + 5, and this formula clearly matches the 
terms shown. Now applying Theorem 1 again, we have 


cas. 
2 
-YXoes 
D 
- us 
n? 


fnm rco by Propos 1 from Section 23 





Therefore, since s; = 6, we can conclude that s, = jm” + jm+6. 

This process is straightforward but rather tedious. There is a remarkable simpli- 
fication of this process whenever the difference table eventually has a constant row, 
as it does in these previous examples. 


Theorem 2 Fora sequence {sq} whose I^ differences are constant, for all n > 0, 
Sn — 599 AS: C(n.i) 
i 


u Example 3 Use Theorem 2 to find a closed formula for the sequence 
1,5,14,30,55, 91,... 














5,6 Excursion: Solving Recurrence Relations 427 




















Te E E 
sfe eee 
sess 
rp [8 [- 























Table 5-32 Difference Table for Example 3 


SOLUTION Table 5-32 shows the complete difference table for this se- 
quence. The numbers in bold are so, Aj, Aj, and Aj. so according to 
Theorem 2, 





Sa = 89 + A} -C(n, 1) + AZ -C(n, 2) + A5 -C(n, 3) 
n(n—1) , y nin— 1n 
: Qi TL] 


m n 





2145 








ET 


143544384. 
e^ *2 


Practice Problem 1 Find a formula for the sum of the first n fourth powers 
SM 
= 


The previous section showed how sometimes we can easily find a recursive 
description of the answer to a counting problem. Difference tables can also be used 
to find closed formulas for some of these descriptions 


Dll aanp 4 ra beine runner RE ofa plara crat by i iie 
no two of which are parallel and no three of which intersect in any single point. Find. 
a recursive description of r, and use this to find a closed formula utilizing Theorem 2. 











SOLUTION Im Figure 5-9, we draw some pictures to see the first few values 
of the sequence {ra}. 

Because of the rules for placing the lines, when each new line is drawn, 
it must cross each of the previously drawn n — 1 lines. This will create n new 
regions. Hence, the recurrence relation ry = r,..1 +7 holds for this sequence. 
We can use this pattern to extend the pattern in the picture as we create the 
difference table for this sequence in Table 5-33. According to Theorem 2, the 
closed formula for this sequence is 


tp — V3 C(n 1) C(n.2) 





which can be simplified to 
a(n+1) 
2 o 





n-di 





428 Chapter 5 / Combinatorics 


























No lines, 


1 line, 2 lines, 3 lines, 


a= 1 region f= 2 region f= 4 region 7a 77 region 





Figure 5-10 The hockey 
stick identity with m 
and n =3. 





Figure 5-9 First few cases for Example 4, 


Proving Theorem 2 


The key to proving Theorem 2is the following fact about Pascal's arithmetic triangle. 
The proof of this fact is addressed in Exercise 12 at the end of this section. The format 
ofthe proof of Theorem 2 is induction on k > 1, although itis one of the more difficult 
induction proofs in this book. 


Theorem 3 (Hockey Stick Identity) For all m > 0 and n = 0, 
C(m, m) + C(m+ 1, m) e ---C(n nm) =Clm+ntim+l) — (53) 


This fact obtains its name from the picture that you get when you illustrate itin Pascal’ 
arithmetic triangle as shown in Figure 5-10. 


PROOF (of Theorem 2) Let P(k)be the statement “For any sequence [s,] whose 
4" differences are constant, s, = sy + Ef, Aj: Cri i) for all n > 0." Statement 
P(1) simply says, “For any sequence [sn] whose first differences are constant, 
Sn = So + A} -C (n, 1) for all n > 0." This is established by Exercise 13 at the end 
of this section, 

Now let m = 2 be given such that statements P(1), ..., P(m—1) have all 
been checked to be true. In particular, P(m — 1) tells us about sequences with 
constant (m — 1)^ differences. To prove the next statement, we let any sequence 
{Sn} be given whose ni^ differences are constant. Consider the sequence {An} of 
first differences of [s,], This new sequence {An} has constant (m — 1)" differences. 
More important, the i^ differences for the sequence (A, are precisely the (i + 1) 
differences for the original sequence [s,]. So by statement P(m — 1), we know that 











meee: | at pea prs] 4 








m3] equ ERE 





SEES EEE 





a] a [afi jeje 


























Table 5-33 Difference Table for 
Example 4 








5.6 Excursion: Solving Recurrence Relations 429 


for all n > 0, 


Ay Aot Ay 








Cn. i) 


We know that 
m 
sa=s9+) A; by Theorem 1 
fa 


i E 
-54 (00+ En -C(j.i)| by statement P(m — 1) 





mios gie 


$3 A Y 3 ait CU.) 


fo dee 





e ez 
sn ane © (Eeuo)- an from rearranging the sum. 
fat NS 


so+ Ap C(n 1) 3 Cni +1)- A by Theorem 3 
v 
zi 


] 


so+ Aj: CQ. 1) 





The induction proof is complete at this point, but it is highly recommended 
that you trace through it with a particular sequence in mind to fully understand. 
the steps. " 


Other Types of Recurrence Relations 


When we study solutions to recurrence relations, we look at different types of re- 
currences and try to find a general solution for each type. There are some common 
techniques that can be used to find these general solutions, but these would be ap- 
propriate through further study in combinatorics. In the remainder of this section, 
we will look at two particular types of recurrence relations, and use induction to 
prove what we can about them. We will learn more about some of the recurrence 
relations we saw earlier in the book, and we will see some common applications of 
recurrence relations. Let's first see why the method of difference tables is not enough 
for understanding recurrence relations in general. 








3) 
Example 5 Complete the difference table for the sequence 
5,13, 37, 109, 325, 973, .. 








SOLUTION Table 5-34 shows the first two rows of the difference table, The 
first differences change by a factor of 3 as you look left to right, as do the 
second differences. Hence, the pattern in the differences will not eventually be 
constant, but they have settled down into the same simple pattern, n 








Because the differences are never constant, we cannot find a closed formula 
for the sequence in Example 5 using our method of difference tables. However, the. 





ee re ea al — —-EBEMMI 


430 Chapter S / Combinatorics 























Table 5-34 Partial Difference Table for Example 5 


difference table above is stil useful in determining that each term in the original 
sequence is roughly three times the previous term. A sequence based on this kind 
of pattern is the first of two special types of recurrence relation that we will study in 
the remainder of this section. 


Linear First-Order Recurrences 








Definition A linear first-order recurrence relation is one of the form 
a, = C avi f(n) 
where C is any constant and f is any (nonrecursive) function. 





Sequences with this sort of recursive pattern are easy to understand when the 
function f in the definition is simple. We will be able to recognize these cases when 
we sce a row in the difference table in which the values increase by a constant factor, 
as they do in Example 5 above. To illustrate how this is helpful, we will find a closed 
formula for the sequence in Example 5. 


u Example 6 Finda closed formula for the sequence 5, 13, 37, 109, 325, 97: 
whose difference table is given in Example 5. 
SOLUTION We break down the solution into a sequence of observations: 
1. A, =8-3" for alln > 0. 
2. By Theorem 1,5, — 59 = 37 A = 8-73 F. 
3. From Exercise 13 of Section 2.3 (concerning the sum of a geometric se- 
quence), we know that 








4. Therefore, 








5.6 Excursion: Solving Recurrence Relations 431 


While it is possible to find a closed formula for many linear first-order recurrence 
relations in this way, itis more efficient to prove a general result that can be used for 
a large class of sequences. 





Theorem 4 —Giventhe recurrence relation ay =b -an-ı + c, with b # 1, the closed 
formula dy, = L - P + +5, satisfies the recurrence relation for any value of L. 


PROOF We show that any closed formula of the given form 
recurrence, Ifa, = L- P! + 5s, then ay = Lb js, and so 











sfies the given 


bay tea LU bc 





-L4 eit, 
which means the recurrence relation is satisfied by this closed formula. u 


In practice, we will typically know a starting point for the sequence in addition. 
10a recurrence relation, so we can use the starting value to determine an appropriate 
value of L in the general closed formula given above. 





B eerie TR AE BE E A A Ae du tog sepenca aii 
Theorem 4 to find a closed formula: 








1,3,11, 43,171, 
SOLUTION The recurrence relation for this sequence is a, = 4ay- — 1,80 
b = 4 and c = —1 above, giving us a solution of the form a, = L-4" + 4. In 
order for a; = 1 to be true, we solve L - 4! + 4 = 1, and get L = }. Therefore, 
en 
apis 
is the closed formula for the given sequence. o 


"lll Example 8 Suppose you save money ata bank that pays 5% per year in simple 
interest. For example, if you have $1,000 in the bank all year, then at the end of the year, 
the bank adds $50 to your account. If you open the account with $100, and in addition 
to the interest that the bank pays, on the last day of each year, you deposit $100 more 
into the account, how much money will you have at the end of 10 years? n years? 


SOLUTION Let M, be the amount of money you have at the beginning of 
the ri^ year. We can figure out the value of your account for the first few years: 






= 100 +5 + 100 = 205 
© Ms = 205 + 0.05 - (205) + 100 = 315.25 


In general, we see the recursive pattern 





432 Chapter 5 / Combinatorics 


M, = Mn- +05- Mua +100 








1.05 - M, +100 
We can use Theorem 4 with b = 1.05 and c = 100 to conclude that the closed 
formula 
100 
L:(105 + 17155 
= L - (1.05)" — 2,000 





will describe the same sequence, for an appropriate value of L. To find 
value of L, we simply take a known value like M; = 100 and solve the equation 





100 = Ł - (1.05)' — 2,000 


which tells us L = 





,000. Therefore, the closed formula for this sequence is 


M, = 2,000 - (1.05)" — 2,000 
=2,000- ((1.05)" — 1) 
We can check a few values to make sure this agrees with our work above: 
© M =2,000- ((1.05)' —1) = 100 
1) =205 





9 M;=2,000- (aos? 


© Ms =2,000- ((1.05)° — 1) = 315.25 





The given question about the end of the tenth year is answered by the value 
of My, = 2,000 -((1.05)"" — 1) ~ 1,420.68, whichis the amount of money inthe 
bank at the beginning of the eleventh year. Similarly the amount of money in 
the bank at the end of n years is given by M. = 2,000-((1.05)"*"—1). C 





Practice Problem 2 Every spring, a nearby lake is stocked with 5,000 catfish. 
Due to fishing and environmental conditions, the population of catfish n weeks after 
the stock date is given by 





Pa = Pa- +0.05 + p,-, — 500 

Starting with the first week and continuing until there are no catfish left. Find a closed 
formula for the number of catfish n weeks after the stock date, and use your answer to 
Project the number of catfish there will be in the lake. 


Second-Order Recurrence Relations 


A second-order recurrence relation is one where each term in a sequence of numbers 
can be related to the two preceding terms in the sequence. The most famous example 
of this is the sequence of Fibonacci numbers, defined by F; = F: =1 with F, = 








5.6 Excursion: Solving Recurrence Relations 433 


Fy. + Fy-2 for n > 2. The remainder of this section explores the general appearance 
of closed formulas for this type of recursive description. 


BE example 9 


1. Show that no matter what numbers are used for C and K, the closed formula 
a, =C 2^ + K - 3" satisfies the recurrence relation 


a, = Sn) — 61-5 


2. Find appropriate values of C and K in the previous part to determine the closed 
formula for the following sequence with recurrence relation a, = Sa, — 64-3: 


3,7, 17.43, 113, 307, ... 


SOLUTION 





pp =C-P44K-H and agg =C 27 Ko 








-s(c e K-I) - (C 27 e K-39") 
10€ — 6C 27? + (15K — 6K)3"? 
CÓíeKGt 











2. Since a, =3and a, =C -2+ K-3,we know that 2C + 3K — 3 Sincea; — 7 
and a, =C -4+ K -9, we know that 4C + 9K — 7. Combining these wo 
equations tells us that C = 1 and K = 1. Hence, the closed formula for this 











sequence is 
a -ralwomaGe 
3 o 
Theorem 5 If r ands aretwo different solutions to the equation. x d then. 
no matter what numbers are used for C and K, the closed formula an m+K-s" 
satisfies the recurrence relation 
a, caua dan 
PROOF See Exercise 21 at the end of this section. a 


a Example 10 Find the closed formula for the Fibonacci numbers, which satisfy 
FOEaFa with F= F= 


SOLUTION Since the quadratic equation x? = x + 1 has solutions (derived 
from the quadratic formula) 








EEE ————— — 





434 Chapter 5 / Combinatorics 





we know from Theorem 5 that a sequence with closed formula 


nae. (4) +4 (4) 


will satisfy the Fibonacci numbers’ recurrence relation. In addition, the specific 
values F; = 1 and F; = 1 give us the following two equations in terms of the 
constants C and K: 


ree (8) e (58) 
ONCOL a Ry 


The solution to this system of equations isC = 7 and K = 54, so the Fibonacci 
numbers have the closed formula. 











ma) 























PE ERE 45Y 
"iz 
ü 
Practice Problem 3 Find the closed formula for the sequence 1, 5, 13, 41, 121, .. 
satisfying the recurrence relation a, = 2a, -, + 3a, 
Solutions to Practice Problems 
1 Table 5-35 shows the table of differences for the se- ME 
quence 0, 1, 17, 98, 354, .... So we have Pa = L- (105V + > 
d a L-(L05)" + 10, 000 
ÉK = 041-C(n,1) +15-C(n,2) +50-C(n,3) 
£ In order for p = 5.000, we must have L = —5,000, so. 
+60-C(n,4)+24:C(n, S) we have the closed formula 
With some work (or some help from a computer algebra ; 
system), this can be simplified to Pa = 5,000 > (1.05) + 10,000 
a nn n 1)? 3n — 1 If we solve when thisis equal to 0 (using logarithms), we 
ye nore Dor tnn find that pa 100and pis = ~395,s0 the last time there 





are catfish in the lake is after 14 weeks of the season, 
2 We can restate the recurrence relation for thissequence 3 Since the quadratic equation x? = 2x +3 has solutions 
as x and x = —1, we know from Theorem 5 that a se- 
quence with closed formula 








Pa = 1.05 p.i — 500 a -C Y e K C0 

“The first few values of thissequence are givenin Table S- will satisfy the given recurrence relation. In ad. 
36, AccordingtoTheorem4 with = 1.05ande = —500, tion, the specific values a; — 1 and a =S give us the 
the closed formula for this sequence is 

















56 Excursion: Solving Recurrence Relations 435 









































nfo [1 2 3 4 s| 6 
«|? 1 97 | 98 [389 | 979 | 2275 
a] 1 |w | st | 256 |es | 1296 
afas | 6 | is | 369 | em 

aj so |n | 194 | 302 

az] o | | 108 

aju | 24 

















Table 5-35 Difference Table for Practice Problem 1 


ug oum ae et a es 





Pn [5000 5150 5208 5168 5026 4778 4416 
Table 5-36 Difference Table for Practice Problem 2 


following two equations in terms of the constants and 
K 

Cy +K (=I)! 

G} +K-(-1F 





Exercises for Section 5.6 


1, Find closed formulas for each of the following se- 
quences: 
(2) 3,10, 17, 24,31, 
(b) 7, 13,25, 43, 67, 
(©) 1,3,8,16,27,41,... 
(d) 1, 10,35, 84, 165, 286, 
2. Find closed formulas for each of the following se- 









= Sei $2n forall n> 1 
Sno +3n—Sforall n> 1 





(@) s = 7.59 = sir 1 forall n> 1 
3. Find a closed formula for each of the following sums: 
(a) Dh 4-3) 
O DE Gk+1) 
(e ELLE +3k=1) 
@ YLe 


4, In a certain round robin basketball tournament, every 
team must play every other team exactly twice. Let gy 
be the number of games that must be played if n teams. 
participate in the tournament, 


The solution to this system of equations is C 
K 


and 








4, so this sequence has the closed formula 





Greca) 





(a) By imagining what happens to a tournament with 
n= 1 teams if it is joined by one more team, find a 
recursive description for the sequence {gn}. 

(b) Use Theorem 2 to determine a closed formula for 
Be 


5. There isa slightly shorter way to state Theorem 2 for a 


sequence [s,] with constant ^ differences. For all > 0, 


SEE 


How must we interpret the notation Aj for this to be 
consistent with the theorem? 
Find closed formulas for each of the sequences whose. 
recursive description is given. 












(2) aj =2, dy 22, +3 for all n> 1 
(b) a; = 1,2, = 34,., +2 for alln > 1 
(e) ai = 0,44 =$ — 22, foralln > 1 
(@) a —3,2, 73 ayy for all n> 1 


‘Suppose your house mortgage (the amount you owe 
‘on your house) grows at a rate of 0.5% per month, 


SS eee 





436 Chapter S / Combinatorics 


You originally borrow $100.000 for the house, and ev- 
ery month you make a payment of $1,000. How many 
months will it take before you owe nothing on the 
house? 

& A bacteria culture grows at a rate of 10% per day. 

(a) IF this morning the culture has 1,000,000 bacteria, 
how many days will it take for this number to dou- 
ble? 

(b) Suppose at the beginning of each day (excluding 
this morning), you remove a sample of $0,000 bac- 
teria for testing, Now how many days willittake for 
the original 1,000,000 bacteria to double? 

e) Suppose at the beginning of each day (excluding this 
morning), you remove a sample of 200,000 bacteria 
for testing, What happens under this scenario? 

9. Find a recursive description for the sequence in Exam- 
ple 5, and use Theorem 4 to derive the closed formula. 
for the sequence, 

10, We saw in Practice Problem 3 of Section 5.5 that the. 

recurrence dy = 2: an-s + 1 (with ap 0) is satisfied by 

the closed formula a, = 2" — 1, If instead we are given 
the condition ay = k, what is the closed formula for as? 

Ma, = ba, + c, with b # 1 and we are given ao = 1, 

what is the value of the constant Z in Theorem 4 in 

terms of b and c? 

12, Prove Theorem 1 by induction on n. 

13. For any sequence {sy} whose first differences are the 
constant K, show by induction on n that s, = s, + Kn 
for alln > 0. 

14, Prove by induction on n > 1 that if p(x) is an n° degree 
polynomial, then the n” differences for the sequence 
P(O). p(1). p(2),.--are constant. 

15, Use the procedure in Example 6 to find a closed formula 
for each of the following sequences: 

(2) ay = 1,2, = 21 +m forall n> 1 

(b) a, = 1,4 = 3aq-1 2n 1 for alln > 1 

(6) ay = 1, dy = 22, f for. 

16, Prove Theorem 4 by applying the method of difference. 
equations to the generic recurrence relation of the form 











Oy = ban te 


for constants b # 1 and c. 
17. For each of the following second-order recurrence re- 
lations, find a general closed formula (using constants 





C and K as in Theorem 5) for a sequence that satisfies 





18. For each of the recurrence relations in Exercise 17, find 
the closed formula for the sequence satisfying that re- 
‘currence along with the initial conditions ay = 0 and 
a =1. 

19. For cach of the following second-order recurrence re- 
lations, find a general closed formula (using constants 
C and K as in Theorem 5) for a sequence that satisfies 








(Nore: This problem requires 
the use of complex numbers.) 
42,—2(4.,—4,5) (Nore: This problem re- 
quires the use of complex numbers.) 

20, For each of the recurrence relations above, find the 
closed formula for the sequence satisfying that recur- 
rence along with the initial conditions ay = Landay = 1. 

21. Prove Theorem S. 

22. In Theorem S, there is an assumption that the quadrati 

‘equation x? = cx + d has two different roots, I this is 
not the case, then the closed formulas will have a differ- 
ent form. The following proposition is to be proved in 
Exercise 24, 
Proposition. fx" = cx + d has only one real solution 
+ then no matter what numbers are used for C and K, 
the closed formula ay = (C + K -n)r" satisfies the recur- 
rence relation 








a, = ys d ias 


Use this proposition to find a general closed formula for 
a sequence satisfying each of the following recurrence 
relations: 








(2) a, = da, — 4g 2 
(b) a, = —62,., — 9,5 
(€) a, = 8a, 5 — 162, 


25. For each of the recurrence relations above, find the 
closed formula for the sequence satisfying that recur- 
rence along with the initial conditions a) = Landa, = 1. 

24, Prove the proposition in Exercise 22. 





Chapter 5 Summary 


56 Excursion: Solving Recurrence Relations 437 





5,1 Introduction 


Terms nnd concepts 


© You should recognize these terms and their relation- 
ship to the questions concerning ordering and repeti- 
tion: 
~ Ordered lisi—order matters, repetitions arc allowed, 


~ Unordered list or bag—order does not matter. repeti- 
tions are allowed 





~ Permutation—order matters, repetitions are not al- 
lowed 
= Set—order does not matter, repetitions are not al- 
lowed 
© You should recognize that: 
— P(n, r) counts the r-element permutations drawn from 
aset of size n. 
= C(n. r) counts the r-element subsets of a set of size n 
(also referred to r-combinations from the set) 
© You should realize that some textbooks, calculators. and 
instructors use alternative notation for P(n. r) and for 
Cin r). 
© You should know that combinatorial equivalence refers 
to the use of a 1-1 correspondence to demonstrate that 
two counting problems have the same solution. 





Strategies for counting 


© You should understand the utility of skillful organization. 
as an aid in counting. 

@ You should be able to apply a variety of organizational 
schemes in various applications including atleast the fol- 
lowing: 
~ Tables where each row/column is completely filled 
~ Tables where some rows/columns have more entries 

than others 
~ Trees 

© As specific examples of this organization, you should 
thoroughly understand the table used to count outcomes 
of rolling two dice, and the tree used to count outcomes 
of multiple coin tosses. 

© You should be able to demonstrate that two sets are the 
same size, without knowing the size of either set, by de- 
scribing a 1-1 correspondence between the two sets (that 
is, by using combinatorial equivalence). 


5.2 Basic Rules for Counting 
Terms and concepts 


© You should be able to state and apply these rules: 

~ The rule of products (or product rule) —in set symbols, 
n(A x B) = n(A) -n(B) 

~ The rule of sums (or sum rule)-—in set symbols, n(A U 
B) = n(A) +n(B), provided A n B is empty 

— The rule of sums with overlap—in set symbols, n(A U 
B) = n(A) +n(B)-n(A 0 B) 

~ The rule of complements (or complement rule) -inset 
symbols, n(A') = n(U) -n(A) where U is the universal 
set 





© You should recall that a permutation is a list in which 
order matters and repetitions are not allowed, and that 
Pin.r) counts the r-clement permutations drawn from 
a set of size n. 

© You should recognize that n! (7n factorial") is the same. 
as Pin. n). 








Formulas 
© nt = (nin 1)-- XO) and e 1 
9 Pinr)mn(n-1):(n-2)-(n-rYy) 
© Pin.) = ios, provided that n > r 





© The number of ordered lists of length r (repetitions al- 
lowed), drawn from a set of size n, is n^ 





© Concerning the rule of products: 

— Youshould beable toapply the rule tosituations where 
the list to be counted can be considered to be con- 
structed in multiple steps. (We described algorithms 
‘that carried out these constructions) 

~ It is necessary that the number of ways to complete 
each step is independent of how the earlier steps were 
completed. 

© Concerning the rule of sums: 

- Youshould be able toapply the rule tosituations where 
the list to be counted can be split into pieces. 

~ Itis necessary that every pair of pieces must be disjoint. 

~ Inthe case of two pieces, the inclusion-exelusion prin 
ciple allows us to apply the rule by subtracting the 
overlap of the two pieces. (There are corresponding 


| EEE 


438 Chapter 5 / Combinatorics 


methods for three or more pieces, but they are more 
complex.) 
9 Concerning the rule of complements: 

— You should recognize that it is frequently easier to 
count the complement of a set. 

— Once you have counted the complement, you should 
beable to apply the complement rule to solve the orig- 
inal problem. 

9. You should be able to solve problems involving combi- 
mations of these three rules. 

© You should realize that the formulas for counting or- 
dered lists and permutations come from applying the 
product rule. Thus, many problems have alternative so- 
lutions: 

~ as an application of the product rule; or 

~ as an application of ordered lists or permutations 


5.3 Combinations and the Binomial 
Theorem 

Terms and concepts 

*9 You should recall that C (n, r) counts the r-element sub- 
sets drawn from a set of size n, and that an r-clement 
subset is also called an r-combination. 

© You should know that the binomial theorem tells how to 
expand (1 +x)", and that the result can be phrased in 
terms of C(n, r). 

© You should know that the arithmetic triangle (frequently 
called Pascal's triangle) provides a visual tool for con- 
structing the coefficients in the binomial theorem. 


Formulas. 
e C(n r) = 
© C(n, r) = prfr provided that n > r 





@ The binomial theorem: 
(1+ x)* 9 C(n. 0)  C(n. 1)x +C(n, 2)? + 
+C(n mx" = $ C (n, Kj 


Counting with equivalence classes. 


© You should understand the use of equivalence classes to 
derive the formula for C(n, r). 

© You should be able to solve problems by taking a known 
set and splitting it into equivalence classes each of the 
same size—for example, counting circular arrangements, 


Using C(n,r) to count 





‘© You should recognize that many different types of prob- 
Jems can be interpreted as counting r-clement subsets of 
a set with n elements. 

© Youshould be able to combine the rule for counting sub- 
sets with the sum, product, and complement rules from 
the previous section, 


‘The binomial theorem 


‘© You should be able to use the theorem to determine the 
coefficient of x* in various expansions. 

© You should be able to create the first several rows of the 
arithmetic triangle, and relate it to the binomial theorem. 

© You should be able to explore various patterns in the 
arithmetic triangle, using tools such as induction and sub- 
stituting particular values for x in the binomial theorem. 





5.4 Binary Sequences 

Terms and concepts 

© You should recall that a binary sequence is an ordered 
list of 1s and 0s. 


© You should recall that an unordered list with repetitions 
allowed is also called a bag, 


Counting binary sequences 


© You should be able to use r-combinations to count bi- 
nary sequences of length n with r zeros. 

© You should be able to generalize this algorithm to count 
sequences drawn from sets other than (0, 1}—for exam- 
ple, ordered lists of length 10 drawn from Įm, a. 1] con- 
taining exactly 3 m's and 2 a's 

© You should be able to use this method to count rear- 
rangements for sets containing repetitions (for exam- 
ple, the distinct arrangements of the letters in the word 
MISSISSIPPI). 


Unordered lists with repetitions (bags) 
© You should be able to relate these problem types, using 
1-1 correspondences: 
- Counting unordered lists with repetitions 
~ Counting non-negative integer solutions to equations 
suchas a + b+ c = 10. 
© You should beable touse binary sequences to solve both 
these types of problems. 








© You should be able to generalize the algorithm to other 
situations (for example, adding the restriction that a and 
b must be positive and c must be at least 3). 


Formulas 


© You now have formulas for each of the four types of list: 


What? How many? 
Ordered lists of length r w 
Permutations of length r Pin.r) 


Unordered lists of sizer || Cír-en- 1r) 





Sets of size r cur) 
However, more important than memorizing these for- 
mulas is understanding, and being able to apply, the rea- 
soning that leads to the formulas. 


5.5 Recursive Counting 
Terms, concepts, and skills 


© You should recall that a recursive formula defines cach 
term of a sequence using an expression involving one or 
more earlier terms of the sequence. You should also re- 
call that one or more initial terms of the sequence must 
be given as numbers 
© When we use recursive formulas to solve counting prob- 
lems, we refer to the result as a recursive model of the 
situation, 
© You should be able to create a recursive model for a va- 
riety of counting problems. Each such recursive model 
involves two components: 
~ Determine the count for one or more small versions 
of the problem, 
- For the general version of the problem, imagine that 
all smaller versions of the problem have been solved. 








5.6 Excursion: Solving Recurrence Relations 439 


Use the solutions for the smaller versions to develop 
a recursive formula for the general situation, 


5.6 Excursion: Solving Recurrence 
Relations 


Terms and concepts 


© You should know that the terminology recurrence re- 
lation refers to recursively defined sequences, and that 
solving a recurrence relation refers to finding a closed 
formula for the sequence. 

© You should be able to build a difference table for a re- 
cursively defined sequence, or for a sequence whose first 
few terms are given explicitly 

© You should understand the related concepts: 
ences, second difference, third differences, and so on 

© Youshould know thata first order recurrence relation de- 
fines each item of the sequence in terms of the previous 
item. Similarly, for a second order recurrence relation, 
cach term in the sequence can be related to the two pre 
‘ceding terms in the sequence. 





ir differ- 





Solving recurrence relations 
© You should be able to use one or more of these tech- 
niques to solve particular recurrence relations: 
— Apply Theorem 1 when the first differences are con 
stant 
~ Apply Theorem 2 whenever the difference table even 
tually has a constant row. 
— Apply Theorem 1 when the first differences have the 
form 4, = c- d". 
— Apply Theorem 4 to first order recurrence relations of 
the form a, = b -an-ı +c, with b # 1. 
- Apply Theorem 5 to second order recurrence relations 
of the form a, = C-ay-j +d dyn When x? = cx +d 
has two different solutions. 














Probability 


Probability is an important topic in the study of games. Most games people 
play involve some element of chance. Many times the element of chance is 
entirely based on some simple random device like a pair of dice or a deck of 
cards, while the game itself consists of countless uses of the simple device 
along with many decisions and hidden strategies. The issues of decisions 
and strategies are the subject of what it is formally known as game theory 
in mathematics. In this chapter, we strive to understand how probabilities 
connected with some large event (like a game) are related to those connected 
with the simple events (like a dice roll) that drive them. 

We will start off by simply seeing how the ideas from combinatorics are 
related to questions of probability. We will see that the standard rules in 
counting and even the recursive model of thinking readily spill over into the 
study of probability. In some sense, this chapter will reinforce those skills of 
organized algorithmic and recursive thinking that we had built in the previous 
chapter. When things begin to get complicated, we will find that matrices are 
helpful for keeping track of an abundance of information, so we introduce 
matrix arithmetic and see what kind of problems matrices can help us solve. 


5.1 Introduction 





In An Essay on Probability, Augustus De Morgan (1806-1871) wrote, "I consider 
probability as meaning the state of mind with respect to an assertion, a coming 














61 Introduction 441 


event, or any other matter on which absolute knowledge does not exist.” This will be 
‘our starting point as we seek the origins of the mathematics of probability. 

In the year 1654, 12 years after the birth of Isaac Newton and the death of 
Galileo Galilei, a French gambler named Chevalier De Méré (1607-1684) had a 
problem. It concerned a fair method of splitting a “pot” in a game of chance that is 
interrupted partway through. The heart of the matter can be illustrated by comparing 
the following two questions: 


1. What is the probability that four rolls of a single die result in at least one roll of 
6? 

2. What is the probability that 24 rolls of a pair of dice result in at least one roll of 
"double 6"? 


De Méré would have reasoned that in the first problem there are six possible 
‘outcomes for each roll and we are performing 4 = (2/3)(6) rolls, while in the sec- 
ond problem there are 36 possible outcomes for each roll and we are performing 
24 = (2/3)(36) rolls. Since in both cases we perform 2/3 as many rolls as there are 
total outcomes per roll, the answer should be the same. The problem facing De Méré 
was that he had observed in practice that the first probability was greater than , while 
the second was less than 4. His conclusion was that there must be a basic inconsis- 
tency in mathematics. He communicated the problem to his friend, the prominent 
mathematician and infamous neurotic, Blaise Pascal (1623-1662), Pascal wrote to 
the great amateur mathematician Pierre Fermat (1601-1665) about the question, ini- 
ing a series of letters between Pascal and Fermat that effectively resulted in the 
birth of the new mathematical field of probability.” 

You have surely already guessed that De Méré had discovered no basic inconsis- 
tency in mathematics As we shall see in this chapter, the answer to the first question 
is about 0,52, while the answer to the second one is roughly 0.49, The fact that De 
Méré was able to "observe" this difference in practice explains why he made his 
living gambling. The fact that he did not understand it perhaps explains why he di 
not make his living doing mathematics. 

The first thing to do is to establish the language of probability and some of 
the basic facts that we will use. In probability, we are interested in the outcomes 
of certain experiments. By experiment we can mean anything from rolling dice to 
buying a lottery ticket to, well, performing a laboratory experiment. We will assume 
at first that we only perform experiments under which every possible outcome is 
equally likely. We will see later some easy ways to deal with situations where the 
‘outcomes are not equally likely, but in general this is one of the complexities that 
makes the study of probability interesting, We will refer to the set of outcomes of a 
given experiment as its sample space. 

Defining what is meant by “equally likely” is a rather sticky point primarily 
appealing to one's intuition. We might say that two outcomes are equally likely if a 
rational person would not favor one outcome over the other in many repetitions of 
the experiment. For example, if the experiment is the toss of a fair coin, then there 











* Actually, there was an Italian mathematician named Gerulamo Cardano (1501-1576) who published a 
gambler’ handbook, the Liber de Ludo Alese, in which many of the basic problems of probability were 


discussed. For some reason, though, he is only mentioned in footnotes when this subject is introduced, 





babiity 


is no reason to bet that heads will come up more often than tails in arbitrarily many 
tosses, so we will consider the outcomes of heads and tails to be equally likely in this 
experiment. On the other hand, if the experiment is the roll of a fair (six-sided) die 
in a game, then one might argue that there are only two outcomes: either I get a 6 
or Ido not get a 6. No one who has played games with dice would believe that these 
two outcomes are equally likely, however. 





@ Example 1 Suppose our experiment is to roll a pair of (six-sided) dice and 
record the sum of the face-up numbers we see. How should we represent rolls of a pair 
of dice so that the outcomes are equally likely? 


SOLUTION  Thereare three reasonable ways to represent the result of rolling 
two dice. 


© We could describe all outcomes by giving the set of all possible sums: 
(2,3, 4,5, 6,7, 8,9, 10, 11, 12) 


In this case, there are 11 possible outcomes, but anyone who has played 
dice games will tell you that getting a sum of 2 is not as likely as getting a 
sum of 7. 

© We could describe all outcomes by giving the set of all possible rolls as 
follows: 


double 1, double 2, double 3, double 4, double 5, double 6, 
{1, 2), (1, 3), (1.4), (1, 5}, (1, 6), (2,3). (2. 4). (2,5), 12, 6}. 
(3, 4), (3.5). (3, 61, (4. 5]. (4.6). 15. 6) 


It is more subtle, but a fact, that in this list, the outcome (1, 2) is twice as 
likely to occur as the outcome "double 1." To see this, consider the dice as 
being colored, one green and one red. In this case, getting "red 1, green 2" 
and "red 2, green 1" would both be considered the outcome (1, 2], while. 
only the roll "red 1, green 1" would be considered the outcome "double 1." 

© Using the above analogy, we could describe all outcomes by describing the 
rolls as ordered pairs, with the first number representing the red die and 
the second number representing the green die. Only in this representation 
are the outcomes equally likely. The outcomes are shown in Table 6-1. 

















Rai | a) (02 (63 (9 (05 (9 
mRd2| 21) G2 eG» A Qc» O 
Rd3| 6) G2 GÐ) G9 G9 G9 
Reds | (40 GD AD 49 459 6 
Reas | 6D) (62 (63 (59  (&9 69 
Reas | (61) 6D (63 (69  (&9 66 


Table 6-1. Equally Likely Outcomes for Example 1 








61 introduction 443 


With any particular experiment we will associate a set S of outcomes that will be 
called the sample space. For an experiment, it will interest us to know the proportion 
of outcomes in which a certain event takes place—we will call such outcomes success- 
ful. Technically, an event is defined to be any set of an experiment’s outcomes. For 
example, having a winning lottery ticket is an event in the experiment of buying a 
lottery ticket. Of course, having a losing ticket is an cvent, too, even though it might 
feel weird to call this outcome "successful." We can now define what we mean by 
probability. 





Definition Given an experiment with a sample space 5 of equally likely 
‘outcomes and an event £, the probability of the event (denoted by Prob(E)) is 
the ratio of the number of successful outcomes to the total number of outcomes, 
That is, 


n(E) 


eT 








Practice Problem 1 How many ofthe 36 equally likely outcomes listed in Example 
1 have a sum of 10? Using the definition above, what is the probability of getting a sum 
of 10 when a pair of dice is rolled? 


"lll Example 2 Consider the experiment of drawing two cards from the top of a 
standard deck of 52 cards, and the event E of the two cards having the same value. 


1. Describe the set S of all outcomes, represented so they are equally likely 
2. Describe the event E in terms of your representation. 
3. Compute Prob(E)  n(E)/n(S) 


SOLUTION 


1, We can describe the sample space $ as the set of all permutations of length 
2 with entries from the deck of 52 cards. There are (52)(51) of these, 

2. The outcomes in £ can be formed with a two-step process: Choose any card 
for the first card and then choose any card of the same value for the second 
card. There are (52)(3) of these. 

3, Now the probability calculation is easy: 

mE) (620). 


1 
Prob) = TS) 7 D61) - 17 





a 


Practice Problem 2 Consider the experiment of tossing a coin five successive 
times, and the event E that the last two tosses have the same result. 


1. Describe the set of all outcomes, represented so they are equally likely. 
2. Describe the event E in terms of your representation. 
3. Compute Prob(E) = n(E)/n(S)- 


444 Chapter 6 / Probability 





Sometimes in the study of probability, it is easier to find the probability that 
a given event does nor happen. For this reason, we define the complement of an 
event E as those outcomes in the sample space that are not in E. This definition is 
consistent with the Chapter 3 notion of a set’s complement relative to an understood 
universe, In the context of probability, it is traditional to use the notation £ for the 
complement of £. 








Bl example 5 whats de probabit) dar for a sided die rolled three times 
the same result comes up more than once? 


SOLUTION We will describe the outcomes in the sample space $ as lists of 
length 3 with entries from (1, 2, 3,4, 5, 6). If E denotes the set of outcomes that 
use the same number more than once, then the complement £ is the set of those. 
outcomes that use all different numbers, That is, is the set of permutations of 
length 3 with entries from (1, 2, 3, 4,5, 6}, so in this case, itis easy to compute 
n(É) = P(6, 3) = 120, Since n(S) = 6° = 216, we have 


m) _ n(S)~nE) 
wS) nS 

















Prob(E) = 





‘This solution gives us the general relationship between the probabilities of an 
event and its complement, 


Proposition 1 Given an event E, 
Prob(E) + Prob(E) = 1 


‘This means that once we know one of these quantities, we can easily find the 
other, 


"ll Example 4 Whar is the probability that ina group of sis people wo will have 
birthdays in the same month, assuming that all months are equally likely? 


SOLUTION We will let an outcome be represented as an ordered list of 
length 6 from (1, ..., 12} (for the 12 months). For example, if the first person 
has a birthday in January, the second person in February, and everyone else 
in August, we will represent this as 1,2,8,8,8,8 The question asks for the 
probability of the event Æ, which is the set of all outcomes in which some 
number occurs more than once. To illustrate Proposition 1, we will instead find 
the probability of the complementary event E , which is the set of all outcomes 
in which no number occurs twice. 

There are a total of 12° — 2,085,984 outcomes, which we assume are equally 
likely. An outcome is in E if it does not contain a repeated number, which 
actually means that the outcome is a permutation of length 6 from (1, .. 12) 
Hence, there are P(12, 6) 665.280 outcomes in £, so we have 


P(12,6) _ 665,280 
126 2,985,984 
1— Prob(E) = 0.78, 














Prob(E) 2022 





We conclude that Prob(E) 





MEN a a m0 ———— UR eU —— 


61 Introduction 445. 





H Practice Problem 3 /feveryone in your class writes down his or her favorite two- 
digit number, what is the probability that two people will have written down the sume 
Explore more on number? 
the Web, 


We conclude this section with some experiments involving cards that we will 
investigate further in the exercises, 


"Bl Example 5 Wehaveapacketof three cars in which ones blue on both ses, 
one is red on both sides, and one is red on one side and blue on the other. One of three 
cards is chosen and placed on the table in full view of both of us. We now wager on the 
color of the face-down side of the card. 

You will give me 5 if the face-down side is the same color as the (visible) face-up 
side, and I will give you $6 otherwise. For example, if the chosen card has a red side 
up, then the face-down side is either red or blue, and you win 50% of the time. So you 
should definitely play this game, right? 











SOLUTION Label the faces of the cards (first card) By and B2, (second card) 
R; and Ro, and (third card) Ry and Bs, The six possible (equally likely 
comes of the "experiment" of drawing a card and placing it on the table are 
(By, Bs), (Ba, B1), (Ri, Ra), (Ra, Ri), (Ba, Ra), and (Ra, Bs), where in each 
the first entry denotes the face that is up and the second entry denotes the fac 
that is down. In only the last two of these would you win, giving you an actual 
probability of | of winning, 














n 


BE Example 6 Try playing the following game of soliuire with a normal deck of 

2 cards. Shuffle the deck and then go through it removing any two adjacent cards that 
tare the same color (red or black). Do this until there are no instances of adjacent cards 
of the same color in the deck. If you exhaust the deck, you are a winner, What is the 
probability of winning? 


SOLUTION Since the values of the cards do not matter, we will think of 
a deck as having just 26 indistinguishable red cards and 26 indistinguishable 
black cards, So the total number of arrangements of the deck is C(52, 26) (like 
a binary sequence with twenty-six 1’s and twenty-six 0's), An arrangement. 
that results in a win seems very hard to analyze, but there is actually a simple 
argument, It is left to you to verify that the following procedure always results 
in an arrangement that will win. 

Take any arrangement of 13 red and 13 black cards (there are C(26, 13) 
of these) and any arrangement of the other 13 reds and 13 blacks. Now weave 
together these two arrangements alternating taking a card from the first ar- 
rangement and a card from the second arrangement, Believe it or not, this 
will always result in a winning arrangement and there are no other winning 
arrangements except for those you get this way! 

Therefore, there are C(26, 13)C(26, 13) winning arrangements, so the 
probability of winning this game is 











C(26, 13)? /C(S2, 26) = 22% 





446 Chapter 6 / Probability 


S c0 CONO s 4 AME 


Practice Problem 4 Whar is the probability of winning the solitaire game in the 
previous example if we use two standard decks of cards shuffled together? 


Solutions to Practice Problems 


1 From the table in Example 1, we can simply list the out- 
comes (4,6), (6.4), and (5.5) that have a sum of 10. 
Since there are three of these, the probability that two 
rolled dice will sum to 10is $ = 

2 (u) The outcomes can be represented as ordered lists 

of length 5 with entries from (7I, T], so there are 
25 = 32 outcomes in the sample space S. 

(b) The event Æ consists of those outcomes that look 
like __ H H or__T'T. In either of the twocases, 
there are 2" = 8 outcomes, so n(E) = 16. 

(6) It is now easy to compute 








So there is a 5096 chance that the last two tosses will 
be the same. 

3. We will assume there are 32 people in the class—your 

numbers might be different. We will represent the out- 

‘comes as ordered lists of length 32 with entries from 


Exercises for Section 6.1 


1. fone cardis tobe drawn froma standard deck of cards, 
find the probability that 
(a) The card is a spade, 
(0) The card i 


red ace, 





2. A gumball machine contains 20 yellow gumballs, 40 
green gumballs, 25 white gumballs, and 15 black gum- 
balls. If one gumball is purchased, find the probability 
that 
(a) The gumball is yellow. 

(b) The gumball is green or white. 
(©) The gumball is not black. 

3. If two dice, one red and one green, are rolled, find the 
probability that 
(9) Exactly one of the dice isa 6. 

(b) The sum of the dice is 5. 

c) The sum of the dice is at least 10. 

(d) The value on the red dice is less than the value on 
the green die. 

(e) The values on the two dice are different. 


(10, 11, ...98, 99}, so there are 90" outcomes in our 
sample space S. If E is the event that two people have 
the same number, then the complement E is the event 
that everyone has different numbers. So F is the set of 
permutations of length 32 with entries from {10, 11,..., 
98, 99), of which there are P(90, 32). Hence, 


mE) P003). 
Prob) = "p = AS = 00018 
and so 
Prob(E) = 1 — Prob(F.) = 1 — 0.00184 = 0.99816 





Thatis thereisa better than 99.8% chance of two people 
picking the same number. 


4 The argumentis the same, but now there are S2 red cards. 


and S2 black cards in the entire pack of 104 cards, so the 
answer changes 10 


(52,26) /C(104, 52) = 15.5% 


4. Ifa coin is tossed five times, find the probability that 

(a) The first toss matches the second toss. 

(b) We get five heads. 

(©) We get at least one result of tails 

5. Foreach of the following experiments, describe the sam- 
ple space S sothat the outcomes as described are equally 
likely: 

(a) Three (six-sided) dice are rolled. 

(b) A club with 20 members elects a president and a 
vice president, 

(©) A coin is flipped six times, 

(d) Three people choose cards from a deck. 

6 The following probability questions refer to the exper- 
iments in the previous exercise: 

(a) What is the probability that if three dice are rolled, 
all three dice have the same value? 

(b) Ifa club's members consist of 8 men and 12 women, 
and everyone is equally likely to be chosen an offi- 
cer (either president or vice president), what is the 
probability that among these two officers are one 
man and one woman? 





1. 


10. 


an 








(©) What is the probability that a coin tossed six times 
results in three heads and three tails (in any order)? 

(d) What is the probability that three people choose 
cards of the same suit from a standard deck of 52 
cards? 

If a blackjack hand (consisting of two cards, one face- 

down and one face-up) is dealt from a standard deck of 

cards, find the probability that 

(5) The face-down card is an ace. 

(b) The face-down card is a spade and the face-up card 
isa heart. 

(©) The face-down card is an ace and the face-up card 
is a spade. 

(d) The two-cards have the same suit. 

(e) The face-up card has a value of a 10, jack, queen, or 
king. 

40 One card is an ace and the other has a value of a 10, 
jack. queen, or king. 

A club with 12 female members and 11 male members is. 

electinga president and vice president. How many elec- 

tion outcomes are possible? Assuming cach outcome is 

‘equally likely. find the probability that 

(a) Both officers are male, 

b) The president is female, 

(e) The two officers arc of the same sex. 

‘The club in the previous exercise is forming a budget 

committee consisting of three club members. How many 

committees can be formed? Assuming each committee 

is equally likely to be appointed, find that probability 

that 

(9) The committee includes neither the club's president 
nor vice president, 

(b) The committee has at least one female. 

(©) The committee is all-female. 

(d) David and Jenny (a married couple) are not both 
on the committee. 

In a box of lightbulbs there are 12 good bulbs and 4 

defective bulbs. If five bulbs are to be removed, find the 

probability that 

(a) All five bulbs are good. 

(b) At most one bulb is bad. 

A packet of five cards consists of the ace of spades, the 

ace of hearts, the 2 of spades, the 3 of diamonds, and the. 

4 of clubs. If the packet is thoroughly shuffled, what is 

the probability that the two aces end up side by side? 

In one version of poker, each player gets a five-card 

hand of cards from a standard deck. 

(a) A flush is a hand consisting of all cards from the 
same suit. What is the probability that a five-card 
hand is a flush? 





61 Introduction 447 


(b) A straight sa hand consisting of card values in con- 
secutive order, regardless of suits, where an ace can 
be counted as either the highest value in the deck 
‘or the lowest value in the deck (but not both), What 
is the probability that a five-card hand isa straight? 

46) A straight flush is a hand that is both a straight and 
a flush. What is the probability that a five-card hand 
isa straight lush? 

13, The combination to a lock consists of an ordered list 

of three digits from (0, 1.2. ...9]. Assume that every 

‘such list is equally likely to be guessed by a thief trying 

to guess the correct combination. Find the probability 

that 

(a) The combination guessed uses the digit 9. 

(b) The combination guessed uses no repeated digits, 

(©) The combination guessed has the first two digits cor- 
rect, 

What isthe probability that in a class of 16 students two 

will have birthdays on the same day (assuming that cach 

day is equally likely)? You may either ignore February 

29 or try the more difficult problem of accounting for it 

How does the answer in Exercise 14 change when the 

number of students is doubled? 

On a TIS3 calculator, the command line 

Prod(seq((366-1365,,1,20)). will give the probabil- 

ity that everyone in a group of 20 people has a different 

birthday. Use this (or other technology you have at 
your disposal) to answer the following questions about. 
the birthday problem: 

(a) 1n a group of 40 people, what is the probability that 
atleast two people share a birthday? 

(b) How many people must be in a group for there to 
be a 0.95 probability that at least two people will 
share a birthday? 

€) How many people must be in a group for there to 
be a 0.99 probability that at least two people will 
share a birthday? 

Some institutions use the last four digits of a student's 
Social Security number as a “student number.” Assum- 
ing that all four-digit strings of digits are equally likely 
to occur, what is the probability that there will be no 
duplication of student numbers in an incoming fresh- 
man class consisting of $00 new students. (Nore: You 
will need a technology tool like the one discussed in the 
previous problem.) 

In a particular lottery game, a purchased ticket con- 

sists of seven distinct numbers from (1, 2,....80). On 

Wednesday nights, someone's grandmother draws 11 

numbered ping pong balls from a set of balls num- 

bered from (I....80]. Anyone whose seven ticket 
numbers all appear among the 11 drawn numbers 
is a winner. If 1 always buy a ticket with numbers 











aR 


448 Chapter 6 / Probabiity 


11, 22, 33, 44, 55,66, 77, what is my probability of win- 
ning in a given week? 

19. The following questions refer to a simple dice game: 

¥ (a) Suppose Jessica and John each roll a six-sided dic. 

‘What is the probability that John’s rollis higher than 
Jessica's? 

b) Suppose Jessica rolls an eight-sided die and John 
rolls a six-sided die. What is the probability that 
John’s roll is higher than Jessica's? 

(c) Generalize your answer to the previous problem for 
an resided die versus an m-sided die. (Assume that 
nzm) 

20. Suppose that a nonnegative integer solution to the 
equation w + x + y+ z = 10is chosen at random (cach 
one being equally likely to be chosen). What is the prob- 
ability that in this particular solution w is less than or 
equal to 2? 

21. A man goes to the store to buy seven candy bars The 
store sells five different types of candy. Assuming each 
choice is equally likely to occur, what is the probability 
that he gets at least one bar of each type? 








ee inane lr 3 


22. 1 10 apples are distributed among five children (so that 
every possible distribution is equally likely), what is the. 
probability that every student gets at least one apple? 

23. If n apples are distributed among 32 children (so that 
every possible distribution is equally likely), what is the. 
probability (in terms of n) that every student gets at 
least one apple? 

24. Use the answer to Exercise 23 to decide how many ap- 
ples should be distributed so that there is a 0.99 proba- 
bility every student gets at least one apple. 

25. Use index cards to make the set of cards described in 
Example 5. Conduct the experiment 10 times recordi 
how many times (out of 10) the face-down color was 
different from the face-up color. Combine these results 
With those of others in the class. Does the | probability 
given in the example seem reasonable? 

26. Play the game of solitaire described in Example 6 ten 
times, and record the number of times the game is 

27. Verify the claim about winning configurations of the 
deck of cards for the solitaire game described in Exam- 
ple 6. 








6.2 Sum and Product Rules for Probability 








Mathematics is replete with systems in which formal rules are used to build answers 
to complicated questions using an understanding of simpler problems. In the study 
of probability, when a complex event is made up of simpler events, we fully expect to 
be able to compute the probability of the complex event in terms of the probabilities 
of the simpler parts. To create such a system, we need only think about the ways 
simple events can be combined to form complex events. 





The Sum Rule 


In Chapter 5, we found that counting the elements in the union of two sets is partic- 
ularly simple when the sets have no elements in common. As we shall see, the same 
principle applies in the calculation of probability for complex events, 





Definition Twoevents are said tobe disjoint (or mutually exclusive) it they 
cannot occur simultaneously. 








The term “disjoint” is compatible with the meaning of disjoint as it applies to 
sets when you remember that technically an event is just a named set of outcomes, 
and sets are said to be disjoint if they share no common member. 








62 Sum and Product Rules for Probability 449. 


Bl EE Rice toes ofa dia the evonis “geting 3 T A ore 

disjoint, However, for two tosses of a die, the events of “getting a 3 on the first roll” 
and "getting a 4 on the second roll” are not disjoint since a roll of 3 followed by a roll 
of 4 is an outcome in both events. 


Practice Problem 1 For each of the experiments given below, decide if the events 
described are disjoint: 


1. When tossing a coin four times, let Ey be the event that there are exactly three heads 

and E be the event that there are exactly two heads. 

When choosing four cards, let E be the event that the cards have the same value 

and E be the event that the cards have the same suit. 

J. When choosing a committee of three people from a club with 8 men and 12 women, 
let E, be the event that the committee has a woman and let Ez be the event that the 
committee has a man. 


a 


Theorem 1 (The Sum Rule) If E, and E; are disjoint events in a given 
experiment, then the probability that E, or Ez occurs is the sum of Prob(E,) and 
Prob(E2). That is, 


Prob(E or Ex) = Prob(Ei) + Prob(E2) 


{for disjoint events. 


V baie 2 in epang beckgommon one usesrcitsoftvo diero generate 
moves, and there are usually combinations of dice outcomes that are important. For 
example, one might hope for either a 5 on at least one of the dice or a sum of 5 on 
the two dice. Verify that these are two disjoint events, and find the probability of this 
happening. 


SOLUTION We will use £; for the event of “getting at least one 5" and E» 

for the event of “getting a sum of 5.” It should be clear that £; and £; are. 

disjoint since it is impossible for a single roll of the two dice to be counted in 

both events. Using the list of 36 equally likely events in Example 1, we can 

easily find the number of outcomes in £; and Ez. In Table 6-2, we list the 11 

outcomes in Æ; in bold and the 4 outcomes in E; as underline 
Since E, and E; are disjoint, 





Prob(E; or Ez) = Prob(Ei) + Prob(E2) 





It is not necessary for two events to be disjoint to find the probability that either 
happens, but the sum rule shows us that when two events are disjoint, the probability 
of either event happening is related to the probability of the two events in a very 
simple way. 





450 Chapter 6 / Probably 


0) (02 (3 (9 (Q5 (9 

QU) Q2 Q3 Q9 @ Q9 

Gn) G2 GJ G4 BH G9 

GD G2 (43 U) (9 4,8) 

GD (2 (9 (19 (85 6,6) 

(6,1) (6,2) (6,3) (6,4) (6,5) (6,6) 
Table 6-2 Outcomes for Example 2 








Theorem 2 (The General Sum Rule) |f Ex and Ey are any events in a 
given experiment, then the probability that Ey or Ez occurs is given by 


Prob(Es or Ex) = Prob(Ei) + Prob(E2) — Prob(E and E1) 
If Ex and E are disjoint, then Ey n Ex =, so Prob(Ey and E) = 0. 





This general rule is more versatile but requires that we calculate the probability 
of the two events occurring simultaneously. This can sometimes be rather subtle, so 
we will discuss this at length after some examples and practice using simple instances 
of the general sum rule. 





"Bl example 3 wha ha probably di liec carts chosen fom a standard 
deck of cards consist of either three face cards (i.e, cards having a value of jack, queen, 
or king) or three cards of the same suit 






SOLUTION We will represent the cards as subsets of size 3 taken from the 
deck of 52 cards; hence, n($) = C (52, 3). Let E denote those subsets consisting 
of three face cards, and let Ez denote those subsets consisting of three cards of 
the same suit. Since there are 12 face cards in a standard deck, n(E;) = C (12, 3), 
and since there are four suits, each with 13 cards, n(E2) = 4-C(13, 3). Now 
Ei N Ez is the event that the subset consists of three face cards of the same suit. 
ice there are only three face cards in each suit, this means that n(E, N E2) = 4. 
The general sum rule says that 
Prob(E or Ez) = Prob(E;) + Prob(E2) — Prob(E; and Ez) 
C23), 4.C(353) — 4 
CGA3)' C23) CGL 
1360 


aij "0005 























o 


Practice Problem 2 Whar is the probability that when tossing a fair coin five 
times, either the first two tosses have the same result, or the last two tosses have the 
same result? 


The Product Rule 


In the preceding problems, the intersection of £, and E> was easy to count because 
it was small, but we will need to have better tools available to deal with more com- 
plicated problems. In some cases, the tool is very easy to understand, 














62 Sum and Product Rules for Probabilty 451 


esl Example 4 Suppose a die is rolled and a card is drawn from a standard deck. 
What is the probability that the die and the card both show an even number value? 





SOLUTION We represent the outcomes as the set of ordered pairs (a, b), 
where a € (1,2,3,4,5,6) and b is a standard playing card. Hence, the sample 
space has 6 x 52 outcomes, The successful outcomes are the ordered pairs (a, b), 
wherea e (2,4, 6]and b € (2C, 4C, 6C, 8C, 10C, 2H. 6H, ...}. Hence, the event 
E consists of 3 x 20 outcomes, 

Therefore, 





3x20 3 20 


6x32 6"5 


This is simply the probability that the die is even times the probability that the 
card is even. 


Prob(E) 











o 


It seems like the probability of two events happening is simply the product 
of the probabilities that each event happens. This is the nicest possible rule, but 
unfortunately, it does not always work. Just as we need the assumption that two 
events are disjoint in order to apply the nicest version of the sum rule, we must 
acknowledge that an assumption on the events is also necessary in order for this nice 
version of the product rule to be correct. 








Definition Two events are said to be independent if the occurrence of one 
event is notinfluenced by the occurrence (or nonoccurrence) of the other event. 











"Ml example 5 ror avo waves afacoin the events "geting aresultof heads onthe 
first toss” and "getting a result of heads on the second toss” are independent, Explain 
why in the experiment of dealing two cards from a standard deck, however, the events 
“first card chosen is a black ace" and “second card chosen is a black ace" are not 
independent. 


SOLUTION The probability that the second card chosen is a black ace de- 
pends on whether the first card is, or is not, a black ace. Specifically, if the first 
event fails, then there are two successful outcomes for the second card, but 
if the first event succeeds, then there is only one successful outcome for the 
second card. 








a 


Practice Problem 3 For each of the experiments given below, decide if the events 
described are independent: 








1. When rolling a six-sided die four times, let Ey be the event that the first two rolls 
sum to 7 and let E; be the event that the last two rolls sum to 10, 

2. When choosing a committee of three people from a club with 8 men and 12 women, 
let Ey be the event that the committee has a woman and let E; be the event that the 
committee has a man. 





|. 


452 Chapter 6 / Probability 


3. In a household with four children, let E be the event that the first child is male and 
let Ez be the event that atleast half of the children are female. 


Theorem 3 (The Product Rule) If E, and E; are independent events in 
a given experiment, then the probability that both Ey and Ez occur is the product of 
Prob( Ey) and Prob(E2). That is, 

Prob(E, and E2) = Prob(E,) - Prob( E2) 


Jor independent events. 


un Example 6 Suppose I have a “loaded” die for which the probability of a 6 
appearing is 5, while the probability of each of the other faces appearing is +5, What 
is the probability of getting a 5 and then a 6 on two tosses of the loaded die? 


SOLUTION The events £j, “getting a 5 on the first toss,” and £o, “getting a 
6 on the second toss," are independent, so by the product rule, 
e 1 


Probl Ei and Ej) = Prob(Ei) - Prob(E:) = 75 +5 = 25 





o 





Practice Problem 4 For each experiment in Practice Problem 3, compute 
Prob(Ey and E;) along with Prob(E1) « Prob(Ez). Check that these values are equal 
only in the cases where the events are independent. 


Just as we were able to drop the assumption of “disjoint events” to create a 
general sum rule, we can likewise drop the assumption of "independent events” to 
create a general product rule. If two events are not independent, then the occurrence 
or nonoccurrence of one of them affects the likelihood of the other one happening. 
‘To generalize the product rule, we must first make this more preci 











Definition Given events Ey and E; for some experiment, we define the 
probability of E, given Ex, denoted by Prob(E;\E2), as the probability that Ey 
happens given that Æ occurs. Note that if E, and Ez are independent, then 
Prob(E\\Es) = Probl), 








This sort of probability is called conditional probability since it expresses the 
probability of one event happening if something else occurs as well. 


a Example 7 1n Practice Problem 3, we found that when choosing a committee 

of three people from a club with 8 men and 12 women, the event E, of the committee 
having a woman on it and the event Ez of the committee having a man on it are not 
independent. Compute Prob(E,|Ez). 


SOLUTION Tomake the discussion easier to follow, we will refer to event Ei 
as W and to event Ez as M. So we are calculating Proh(W|M), the probability 
that a committee contains a woman, given that it contains a man, The “given 
that it contains a man" condition implies that our sample space of equally 














62 Sum and Product Rules for Probability 453 


likely outcomes consists of those committees that contain at least one man. 
We consider the complement (those committees with no men), and calculate 
that 


n(M) = 


so there are C(20, 3) — C(12, 3) = 920 committees in our sample space. Now, 
within this sample space, which committees contain a woman’ These are the 
‘ones containing one man and two women (there are C(8, 1) «C(12, 2) of these) 
or two men and one woman (there are C(8, 2)  C(12, 1) of these). Hence, 





(12,3) 












C8 1)- C(12,2)  C(8.2) -C(12, 1) 


Prob(E,|E; 5 





Proi(W|M) = 








o 


Practice Problem 5 — For the preceding example, calculate Prob(M|W), the prob- 
ability that the committee contains a man given that it must contain a woman. 


In the solutions to the example and practice problem, we essentially argued that 
MEL OE 





Prob(E\\E: 





and that 





Prob(E3\E\) = 





(Ei) 


‘These ideas lead to our general product rule, 


Theorem 4 (The General Product Rule) — If E, and Ez are any events in 
a given experiment, then the probability that both E and E occur is given by 
Prob(Ey and E)  Prob(Ez) - Prob(E;\E2) 
= Prob(E\) + Prob(Es| Ei) 





Note that if Ey and E are independent, then this says the same thing as Theorem 3. 


In words, this theorem says that we can choose either of the two events and 
calculate its probability, then multiply that by the conditional probability of the other 
event, given that the chosen event occurs. In many situations, it is helpful to picture the 
‘events as taking place one after the other, but thi not necessary for the theorem 
to be applie 








ll Example 8 Two marbles are chosen from abag containing three rei, five white, 
and eight green marbles. 


1, What is the probability that both are red? 
2. What is the probability that one is white and one is green? 





SOLUTION 


1. We can solve this using the methods of the preceding section, but it will be 
instructive to see how the product rule applies. We picture the marbles being 
removed one at a time, and define the event R; as “the first marble is red,” 
and the event Rz as “the second marble is red." The general product rule 
states 


Prob(R, and R;) = Prob(Ri) - Prob(Rs|Ri) 


Clearly, Prob( Rj) is $. To calculate Prob(R;|R:), we imagine that the event 
“first marble is red" occurs, and see that of the remaining 15 marbles there 
are two red ones, giving à The final answer is 4 - & = 3). 

To epply tbe produci rales again convenient to. imagine the marbles 
being drawn out one at a time. Given this viewpoint, there are two distinct 
ways we could get the desired result: white for the first marble and green 
for the second, or green for the first and white for the second, Again, we 
use meaningful names for the events (e.g., G2 denotes the event “the second 
marble is green.") In this problem, we apply the sum rule and then the 
product rule, obtaining 








" 


Prob(W, and Gs) + Prob(G; and Wi) = Prob(W,) - Prob(Gs]W ) 
+ Prob(G;)  Prob(WsIG\) 
HEU EL 


16 15*16 15 


n 


Practice Problem 6 When you deal a blackjack hand (first card face-down and 
second card face-up), let T indicate the event that the face-down card has a value of 10 
(Le, a 10, jack, queen, or king), and let A stand for the event that the face-up card is 
an ace. 


1. Calculate Prob(T|A), the probability that the face-down card has a value of 10 
given that the face-up card is an ace. Use this to calculate Prob(T and A). 


2. Calculate Prob(T and A) by using Prob(A|T) instead. Is the answer the same? 


Conditional probability is extremely important in understanding the results of 
statistical surveys, medical procedures, or quality control experiments, In these ap- 
plications, there are no theoretical probabilities—everything is determined by data 
collection. The mathematical field of statistics addresses the connection between data 
sets and the inferences one can draw from them. 





BE Example 9. Drug testing of athletes can be a tricky business because medical 

tests are not always accurate for many different reasons. Suppose that a test for steroids 
has been determined (by extensive experimentation and data analysis) to have the 
following characteristics: 


© When the athlete has used steroids, the test result is correct with probability 0.995 
and incorrect with probability 0.005 








62 Sum and Product Rules for Probability 455 


© When the athlete has not used steroids, the test result is correct with probability 0.98 
and incorrect with probability 0.02. 


An athlete at a certain event is randomly chosen for drug testing, Let P be the 
event that the steroid test is positive, and let S be the event that the athlete has used 
steroids. Assuming that 3% of all athletes at that event are using illegal steroids (i.e, 
Prob(S) = 0.03), what is Prob(P and S)? What does this mean? 


SOLUTION According to our general product rule, 
Prob(P and S) = Prob(P\S) - Prob(S) 


Prob(P\S) is the probability that the steroid test is positive given that the athlete 
really has used steroids, and we are given that this probability is 0.995. We are 
also given that Prob(S) = 0.03; hence, 


Prob(P and S) = 0.995 - 0.03 = 0.02985 





‘This means that there is a probability of 0.02985 that a tested athlete chosen at 
random will be a steroid user and be caught by the test. 
a 


Bernoulli Trials 


We conclude this section by showing how to use the sum rule and product rule 
together to solve a type of problem that is very common in the study of games. In this 
kind of problem, there isan experiment that is based on the independent repetition of 
a simpler experiment. In this case, we refer to cach instance of the simpler experiment 
as a rial" to distinguish it from the larger experiment, 

We have already seen experiments like this. For example, rolling single die once 
isa simple experiment that is repeated four successive times to form the experiment 
in part of Practice Problem 1. 








u Example 10 Suppose a baseball player gets a hit with probability every time 
he steps to the plate. What is the probability that the player gets exactly one hit in four 
tries? 

The simple experiment here is the batter appearing at the plate to try to get a 
hit, and this trial is repeated four times. We will represent outcomes for this larger 
‘experiment as an ordered list of length 4 with entries taken from (H, N}, where H 
represents getting a hit and N represents not getting a hit. For exampl NH 
represents the player getting hits on his first and fourth try, but not on his second or 
third try. 

It is important to notice that the outcomes for this larger experiment are not 
‘equally likely! For example, since it is more likely to not get a hit than to get a hit 
each time at the plate, we expect that the outcome N. is more likely than the 
outcome H H H H. This is precisely why this probl rs from those using our 
basic approach to probability problems in the previous section. 

















* In the more general study of probability. these are called Bernoulli rials in honor of the Swiss mathe- 
‘matician Jacob Bernoulli (1654-1705) who wrote one of the earliest books dedicated to probability. 





456 Chapter 6 / Probability 


Since we are given that the batter's probability of getting a hit is always 3, the 
successive trials are independent, and so we can use the product rule, In the example 
above, 


Prob(H NN H) = Prob(H)  Prob(N)  Prob(N) - Prob(H) 
=f 2) 2) 1 
(GG 
15/25 
-() ) 
Notice that the probability of the batter nor getting a hit 
solve the example problem. 


SOLUTION 





3. We are now ready to 


© Let E; be the event of getting the sequence HI N N N. 
© Let Ez be the event of getting the sequence N H N N. 
© Let E; be the event of getting the sequence 
© Let Es be the event of getting the sequence N N N H. 








These events are certainly disjoint, so by the sum rule the probability that 
one of them happens is simply 
Prob(E or E or Es ot Ej) = Prob(Ey) + Prob(Ez) + Probl Es) + Prob(Es) 
= Prob(H NNN) + Prob(N HNN) 
+ Prob(N.N H N) + Probi 


- (96) «(99 











a 


It is possible to visualize all the outcomes of four successive attempts at hitting 
with the same kind of tree structure that we encountered in Chapter 5, At each 
branch in the tree shown in Figure 6-1, the simple experiment is performed. If the 
baseball player gets a hit, he upper branch is followed (labeled H), and if the player 
does not get a hit, the lower branch (labeled N) is followed. Thus, all 16 possible 
results of the four plate appearances are presented in a visual way. 

This tree structure is already fairly large, so drawing it is not a particularly 
effective way to solve problems, but visualizing it internally can be very useful. Notice 
that the 16 final outcomes are precisely the four-digit binary sequences using symbols 
H and N, It is this fact that we will take advantage of in the next section to solve 
general problems of this type. 











Practice Problem 7 What is the probability that the player from Example 10 will 
get at least one hit in four plate appearances? 














Figure 6-1 


Solutions to Practice Problems 


(a) When tossing a coin four times, no outcome can con: 
sist of “exactly two heads" and also “exactly three 
heads"; hence, these two events are disjoint, 

(b) When choosing four cards, if the four cards all have 
the same value, then they cannot all have the same 
suit; hence, these two events are disjoint 

(c) When choosing a committee of three people from 
a club with 8 men and 12 women, there are many 
ways in which the committee can include a woman. 
and à man, so these events are not disjoint. 

Let E, be the event of getting the same result on the 

first two tosses, and let £; be the event of getting the 

same result on the last two tosses, It is straightforward 
to count 








9 iEn E) S2 28 
Ther 





efore, 
Prob(E, or Ez) = Prob( Es)  Prob(E;) 
—-Prob(E, and Ez) 


NND 
7"ÀR'n9 





à 
(a) The events are independent, The first two rolls have 
no influence on the last two rolls, 
(0) The events are not independent. The probability of 
the committee having a man (E3) is different when. 
E, occurs than itis when E, does not occur, Specifi- 





62 Sum and Product Rules for Probabiity 457 


HHHH 
HHHN 
HHNH 
HHNN 
HNHH 
HNHN 
HNNH 
HNNAN 


Decision tree for four tries at hitting. 





cally, if E, does not occur, then E happens for sure 

(ies its probability is 1), and if £i does occur, then 

is not guaranteed to happen (i.c, its probability 
is less than 1). 

(©) The events are not independent, Four-children 
households where the oldest child is a girl are more 
likely to have two or more girls than four-children 
households where the oldest child is a boy. 

The two quantites are equal only in the first part, which 

is the only part that describes independent events 

(a) We have Prob(E, and Ex) = 55, while Prob( Ei) 
Prob(Es) = $ * $ 

(b) We have 














Prob(E, and Ex) = 1— Prob(no men or no women) 














-( SEa , C23) 
C(20,3)  C(20,3) 
~ 07579 
while 
Prob(Es): Proves) = (1 — Ey) 
«(1 We 3) 0.7674 
€(20,3), 


(©) Assuming there is a | probability that any given 
child will be male, we have Prob(E, and E) = 
Prob(E;) u 





‘The sample space this time is the set of all committees 
with at least one woman, of which there are (by using 
the complement) C(20, 3) — C(8, 3) =1,084. OF these, 






ee oan i en | | 


458 Chapter 6 / Probability 


the ones that contain a man are, as in Example 7, the 
committees containing one man and two women or two 
menand one woman. As calculated in the example, there 
are 864 of these, so 


Me, 
1084 


216 


Prob( E3\E;) = Prob(M|W) zs 


© (a) Ifthe face-up card is an ace, then there are 51 other 
cards in the deck, and 16 of these are either a 10, 
jack, queen, or king. Hence, 


ini - K 


By the product rule, 


iF 


Prob(T and A) = Prob( TIA): Probl) 


Exercises for Section 6.2 


1. For each of the following experiments, decide whether 
the given events are disjoint. For each that is not, de- 
scribe something that is in both events, 

(9). Experiment; Drawing two cards. Event 1: Getting 
at Teast one ace. Event 2: Getting at least one club, 

(b) Experiment: Rolling two dice. Event 1: Getting at 
least one 5. Event 2: Getting a sum of 8, 

(©) Experiment: Drawing two cards. Event I: Getting 
two aces. Event 2: Getting two clubs. 

2. A cardis drawn from an ordinary deck of S2 cards, Show 

F how to use the basic sum rule to find the probability that 
the card is 
(a) An ace or a jack. 

(b) A diamond or a black face card (jack, queen, or 
king). 
(©) An even number value or a red face card. 

3. Acard is drawn from an ordinary deck of 82 cards Show 

F how to use the general sum rule to find the probability 
that the card is 
(a) An ace ora heart 
(b) An ace or a black card. 

(c) A diamond, a club, or a king, 

4. For each of the following experiments, decide whether 
the given events are independent. For each that is 
mot, demonstrate that rob(Ey and Ez) se Prob(E\) 
Prob(E;). 

(9) Experiment: Dealing two cards from a standard 
deck. Event 1; The first card is an ace, Event 2: The 
second card is a 10, jack, queen, or king. 








(b) The product rule gives 
16 


Prob(T and A) = Prob(T) -Prob(AIT) = 55-3; 





Which is the same result 

7 Let E denote the event of getting at least one hit, The. 
only way he does nor get at least one hit is if he ends 
up at NWN in the tree, so this is the complementary 
event E. The product rule tells us that the probability of 
Eis 








Prob(N N N. 





Prob(N)  Prob(N)  Prob(N) 
Prob(N) 
(3) 
Hence, we know that Prob(£) = 
(2/3)* = 0.8005, 











(b) Experiment; Rolling a red die anda green die. Event. 
1: Getting a red 5, Event 2: Getting a sum of 8 
(©) Experiment: Rolling red die anda green die. Event. 
1: Getting a red 5, Event 2: Getting a green 6, 
S. Two cards are drawn from a deck, with replacement. 
J£. (This means that one person chooses a card, looks at it, 
and returns it, and then another person chooses a card, 
Jooks at it, and returns it.) Show how to use the product. 
rule to find the probability that 
(9) The first card isan ace and the second card is black, 
(b) Both cards are spades. 
(c) Neither card has a value from (2,3, 4,5). 
(@) At least one card is an ace. (Hint: Consider the 
‘complementary problem.) 
6. A fair six-sided die is cast, a fair coin is tossed, and a 
card is drawn from a standard deck, 
(a) What is the probability that you tossed a heads or 
drew an ace 
(b) What is the probability that you rolled a 4 or drew 
an ace? 
(©) What is the probability that you rolled a 4, tossed a 
heads, and drew an ace? 
7. What is the probability that when a pair of dice are 
fF. rolled, either (at least) one die shows a 5 or the dice 
sum to 8? 
8. What is the probability that when a pair of dice are 
F rolled, either (at least) one die shows a 4 or the dice 
sum to 102 




















9. What is the probability of rolling three dice and obtain- 
ing a sum between 5 and 10 (inclusive)? 

10, John tossesa penny fourtimes and Jessica tosses a nickel 
four times. What is the probability that at least one of 
them gets four results of heads? 

11. John and Jessica each toss a coin four times, What is the 
probability that at least one of them gets four results 
that are the same (ie, ether four heads or four tails)? 

12, John, Jessica, and Anna each toss a coin four times 
What is the probability that at least one of them gets 
four results that are the sam 

13. Suppose that we each roll a six-sided dice, You win if 
the sum of our rolls is even, and I win otherwise, What 
is the probability that I win? Now suppose I win if the 
sum is a multiple of 3 and you win otherwise—what is 
the probability that I win now? 

M. Show that it is more likely to get at least one 6 on four 
rolls of a single die than at least one "double 6" on 24 
rolls of a pair of dice, by computing the probabilities 
exactly. (These are the questions from the beginning of. 
Chapter 6.) 

15. What isthe probability of rolling a six-sided die six times 
and having all the numbers 1 through 6 result (in any 
order)? 

16. Whatis the probability of rolling a six-sided die six times 
and having only odd numbers result? 

17. What is the probability of getting a straight or a flush in 
a five-card poker hand? (Sce Exercise 12 in Section 6.1 
for more on the rules of poker.) 

18. What is the probability of having nothing good in a 
five-card poker hand? (That is, no two cards have the 
same value, and the hand is neither a straight nor a 
flush.) 
In blackjack, your opponent receives a face-down card. 
and a face-up card. In particular, you can see your op- 
ponent face-up card, This situation is a natural appli- 
cation of conditional probability, For each question be- 
low, specify events Ey and Æ so that the given question 
is about Prob(Es|£;), and then answer the probability 
question. 

(3) What is the probability that his face-down card is 
a 10, jack, queen, or king, given that you see his 
face-up card isan ace? 

(b) What is the probability that his face-down card is 

a 10, jack, queen, or king, given that you see his 

face-up card is an ace and you know your own hand 

consists of two kings? 

What is the probability that his face-down card is 

a 10, jack, queen, or king, given that you see his 

Tace-up card is an ace and you know your own hand 

consists of a 4 and a 5? 























19. 




















62 Sum and Product Rules for Probability 459 


20. In certain poker games, five-card hands consist of three 
face-down cards and two face-up cards, In particular, 
you can sce two of the cards in your opponents hand 
For each question below, specify events £ and E; so 
that the given question is about Prob(E:|E1), and then 
answer the probability question. 

(9) Whatis the probability that his hand isa flush, given 
that you can see he has the ace and 5 of clubs? 

(b) What isthe probability that his hand is lush, given 
that you can see he has the ace and 5 of clubs, and 
you know your own hand consists of (4H, 6D, 65, 
108, 0D}? 

(c) Whatis the probability that his hand isa flush, given 
that you can see he has the ace and 5 of clubs and. 
you know your own hand consists of all clubs? 

21, In the poker games described in Exercise 20, five-card 
hands consist of three face-down cards and wo face 
up cards In particular, you can see two of the cards in 
your opponent's hand. For each question below, spec 
ify events E, and E; so that the given question is about 
Prob E| E). and then answer the probability question, 
(9) What is the probability that his hand is a straight, 

given that you can see he has the 2 of diamonds and 
the 6 of clubs? 

(b) What is the probability that his hand is a straight, 
given that you can see he has the3 of diamonds and 
the 6 of clubs? 

(©) What is the probability that his hand is a straight, 
given that you can see he has the 5 of diamonds and 
the 6 of clubs? 

22, Consider the experiment and events of Example 7. 

(a) Find Prob(E,[E;) (also written Prob(W\M) in the 
example) 

(b) Find Prob(E;[F;) (also written Probl MIW) in the 
example), 

(©) In this example, verify that 

Prob E3\E\)Prob() + Probl Es\F;)Prob( 
Prol) 





























and that 
Prob(E,|E;)Prob(Es) + Prob Ei [EsyProb(Es) 

Probi) 

(@) Explainin words why the previous statement should 
be true for any events Æi and E». 

23. Using the information and notation from Example 9, 
(a) What is Prob(P and), and what does it me: 
(b) Using the solution in Example 9 and the previous 

answer, determine Prob(P), 
(©) What is Prob(S|P), and what does it mean? 
(d) Whatis Prob(SIP), and what does it mean? 











MER O0 SS 





460 Chapter 6 / Probability 


24, What is the probability that the baseball player in Ex- — 26. What is the probability that in six tosses of a fair coin 


ample 10 gets exactly two hits in four tries? an even number of heads’ arise? How does this answer 
25, What is the probability that a fair six-sided die rolled change if we sneak in a coin that gomes up heads 75% 
five times comes up 6 exactly once? of the time? 


6.3 Probability in Games of Chance 








‘The last example 
abilities in games, so 


Section 62 illustrates a common problem when analyzing prob- 
is worthwhile to address it here in more generality. 





Theorem 1 Given a simple experiment, called a Bernoulli trial*, and an event 
that occurs with probability p, if the trial is repeated independently n times, then the 
probability of having exactly k successes is 


(nk) p* (1 = py’ 


PROOF We represent the outcomes of the experiment as ordered lists of length 
n with entries from (S, F}, where $ denotes success and F failure of the trial, The 
number of different such lists that use exactly k S's isC (n, K) since these are merely 
binary sequences. Since Prob(S) = p and hence Prob(F) = 1 — p, the probability 
of any particular outcome with K S's and n — k F's is p^ > (1 — p)"-*. Hence, the 
probability of having k successes is this probability times the number of outcome 
of this sort, C(n, K). " 














"EB example 1 whatitihe probability that in 10 successive rolls ofa fair, six-sided 
die, we get exactly five results of 6? 





SOLUTION This is a direct application of Theorem 1, with n = 
/6, but we will go through this problem to be sure that i above 
clear. 

We will represent the outcomes as ordered lists of length 10 using the 
entries $ and F, where S$ denotes successfully rolling a 6 and F denotes the 
failure to roll a 6, Getting exactly five results of 6 corresponds to those lists that 
use exactly five S's and five F's, Since these are effectively binary sequences, the 
number of these is C(10, 5) = 252, and the probability of each one happening 
can be found using the product rule to be 








Prob(S) + Prob(S) - Prob(S) - Prob(S) - Prob(S) « 
Prob(F) - Prob(F) - Prob(F) Prob(F) - Prob(F) 


-0O 


* Jacob Bernoulli (1654-1705) was a pioneer in the study of probability 








5 


Explore more on 
the Web. 


AAA 
AB Aaa 


E AABB 
NS As ÅsBAA 
“<a Wo BAB 

58 


Figure 6-2 
Best-ol-three series, 


63 Probability in Games of Chance 461 


Therefore, the probability that one of these occurs is 
1755 
C(10,5) G) (3) = 0.013 
6) V6 


Practice Problem 1 Suppose we havea fake coin that comes up heads with prob- 
ability 3. What is the probability that in 10 successive tosses of this fake coin, we get at 
least eight results of heads? 





Many sporting events are based on a series of games that are played until one 
team wins a specified number, For example, in many playoffs, teams play a "best-of- 
seven" series, meaning that they play games until one team has won four games (the 
majority of the seven possible games), This is somewhat confusing since the series 
stops immediately once one team has won four games, soa best-of-seven series might 
consist ofa total of four games, five games, six games, or seven games, Analyzing these 
series is related to Theorem 1, but we can already see there will be some significant 
differences. One way to see the different issues is to look at a simple version of a 
series from the point of view of a tree diagram. 


















u Example 2 If team A wins every game it plays with probability 3, then what 
is the probability that team A wins a best-of-three series over team 








SOLUTION We can visualize all possible series using the tree structure in 
Figure 6-2, Here we denote a series as an ordered list with entries from (A, B]. 
where A denotes a game that team A wins and B denotes a game that team 
B wins, Note that the number of games in the series is either two or three, so 
‘we will have to apply Theorem 1 with care, The three series in which team A 
is the winner are AA, ABA, and BAA. These series are disjoint events, so we 
can simply add the probabilities of cach: 








Prob(A wins the series) = Prob(AA) + Prob(ABA) + Prob(BAA) 
at G ? (SY (3) (8Y (1 
-() *G (9*9 (9) 

a 


DM example 3 (siste the toca! Barone sam wins every game bey play witha 
probability of 0.75. Whatis the probability that they would win a best-of-seven playoff 
series (Le, they play games until one team has won four games) by a margin of four 
games to two 





SOLUTION The series could be represented as a list of B's and O's (for 
Barons and Opponents) that uses two O's and four B's and ends with a B. 
For example, the list BOBBOB represents a series in which the Barons win 
games 1, 3, 4, and 6. Since these series must end with a B, the only choices to 
be made involve how to distribute the two O's and three B's among the first. 
five games; there are C(5, 2) = 10 of these altogether. By the product rule, the 








(000037 9 ^ waa 


462 Chapter 6 / Probability 


Explore more on 
the Web, 


series BOBBOB above would happen with probability 


00000000 


probability of occurring, the final answer (by the sum rule) is the above prob- 
ability added to itself 10 times. That is, the probability of a 4-2 Barons win 


w.() Qj 


Practice Problem 2 Using the same assumptions as in the previous example, what 
is the probability that the Barons win a best-of-seven series by a margin of four games 
to three? 


n 





E] Example 4 After game 1 of the World Series (a best-of-seven series) in 1992, 

the announcers took great pleasure in announcing that over the previous 20 years, it 
had happened 12 times that the team which won the first game went on to win the series. 
They seemed to be suggesting that 60% is surprisingly high. Is it? 


SOLUTION Let the teams be called A and B, so that we can denote a World 
Series as we did for the series in Example 3. We will assume for the moment 
that the probability of A winning any particular game is } and that the games 
are independent events. Our sample space will be the set of all World Series 
in which team A wins the first game. The problem will be to determine the 
probability that something in our space results in A winning the series. Note, 
however, that each outcome in the sample space is not equally likely since we 
do not have a fixed series length among items in the sample space, 

Therefore, we will break the problem into cases based on the total number 
of games in a series, and then we will apply the rule of sums. Assume team A 
wins the first game. Then 





© In order to have a four-game series, team A must win the next three games. 
There is a (4)* = | probability that this happens. 

@ In order to have a five-game series, it must look like A__A, where the 
blanks are filled by any ordered list of two A's and one B. There are 
C(3, 2) =3 of these lists, and the probability of each of them happening is 
(4)' = &. Thus, the probability is $- 

© In order to have a six-game series, it must look like A. A, where the 
blanks are filled by any ordered list of two A's and two B's, of which there 
are C(4, 2) = 6. The probability of each is $, so the probability for this 
case is $. 

9 In order to have a seven-game series it must look like A~- A, where the 
blanks are filled by any ordered list of two A's and three B's, of which there 
are C(5, 2) = 10. The probability of each is 4,, so the probability for this 
case is 10/64. 


























63 Probability inGames of Chance 463 








os 
os 
04 
o2 
o 02 04 05 08 1 
p 
Figure 6-3 Probability of series win versus probability of game 
win, 


By the sum rule, the probability that team A wins the series after winning 
the first game is} + + A+ H =} = 65.6%. 

So in fact, 12 out “othe last 20 is Slightly lower than one would expect, with 
each team having a } probability of winning each game. a 





V Eoo 500) foe hans li sports Aio lavi cv Faccio wi bi 
of-seven series after being down two games to none. What would the probability p of 
team A winning any given game over team B have to be so that team A will have at 
least a 50% chance of coming back from a 2-0 deficit to claim the series? 


SOLUTION If A starts off down two games to none, A can only come back 


and win the series by winning four of the next five games, making the remainder 
of the series look like one of the following: 


© A wins the next four games in a row with probability p*. 


© The remainder of the series looks like ~-~- ~A, where the four blanks 
consist of three A's and one B in any order, with probability 4 (1 — p) - p*. 





Hence, the probability that team A comes back to win the series is 





pPG-Ap) 


The graph of this expression as a function of p in Figure 6-3 shows us that the 
probability team A wins after losing the first two games exceeds 0.5 when p is 
at least 0.69. ü 


40 — p) + p 


Tennis is an example of a game that does not have a finiteset of outcomes. Any 
number of points can make up a tennis game, since to win the game a player must 
“win by two.” The scoring in tennis is archaic. Individual point totals take on values 
of 0, 15, 30, 40, or "game" in which the convention is that once tied at 40 (such a tie 








464. Chapter 6 / Probability 


score is called "deuce"), a player must win two points in a row to win the game, but 
there are no additional numerical scores used. These seem, like strange conventions 
if you have never played tennis before, but you can always think of tennis as being a 
game where you play to a score of four but you have to win by two. For this reason, 
there is no fixed number of points in a tennis game. It is possible to have a tennis 
game last 4 points or 40. 


2 Example 6 Suppose a tennis player has a 3/5 probability of winning each 
point she plays. What is the probability that she wins a game by a score of "game"-30. 
(This would be a score of 4-2 if it was any other kind of game.) 
SOLUTION UsingA for our preferred player and B for her opponent, a game 
of this type must look like a list of four A's and two B's, where the rightmost 
letter (i. the final point) is an A. There are only C(5,2) = 10 such binary 
sequences, and each one has a probability of 
GG) 
3) Vs 
of occurring, so the probability that a game ends with this score is 


OO 


‘The more interesting problem in tennis is how to analyze games that at some 
point are tied at deuce. We will see how to deal with this in Section 6.5. 


Solutions to Practice Problems 


1 There are three disjoint cases to consider: Either we get 
exactly eight heads, we get exactly nine heads, or we get 
all 10 heads. According to Theorem 1, the probabilities 
for these cases are C(10,8) ($)* (3). C (10.9) ($) G 
and C(10, 10) ($)” (3)" respectively. Hence, the proba- 
bility that one of these cases occurs is the sum 


cua seven) G) 


cava ($) G) =oaass 


As in Example 3, such a series (ie., a 4-3 Barons win) 
must end with a B, so it must look like a sequence of 


Exercises for Section 6.3 


1. What is the probability that in seven rolls ofa six-sided 
F die, the result of 1 appears exactly five times? 

2. What is the probability that in seven rolls of a six-sided 
F dic, the result of 1 appears at least five times? 





three O's and three B's (in some order) followed by a 
B. For example, BOBBOOB is the sequence BOBBOO 
followed by the letter B. There are C(6,3) — 20 such 
sequences, and each represents a series that has proba- 


bility 
G WE 
3) G) 
Therefore, the probability that the Barons will win 4-3 


3. What is the probability of getting exactly 3 heads on 10 
F tosses of a fair coin? 
4. Whatis he probability that in 10 tosses ofafaireoin, the 
F result of heads appears at least eight times? Compare 
this to your answer in Practice Problem 1. 








What is the probability of getting exactly one 6 on 10 

tosses of a fair six-sided die? 

What is the probability of getting at least three heads 

on five tosses of a fair coin? 

. What is the probability of getting at least two 3's on five 

tosses of a fair six-sided die? 

‘What is the probability of getting more heads than tails 

on four tosses of a fair coi 

9. What is the probability of getting more 6's than 5's on 
four tosses of a fair six-sided die? 

10, What is the probability when tossing a pair of dice of 
getting either a 5 on one dic, a sum of 5, or a sum of 8? 
(Thisis the kind of problem hat arisesin backgammon.) 

11. What is the probability that in five dealt cards, exactly 
two cards will have the same value? (This is called “a 

pair" in poker.) 
In military clubs, a dice game is played with rules simi- 
lar to those of draw poker. A player rolls five dice and 
these five values comprise his or her hand. The player 
then has the option to re-roll any number of the dice 
to improve the result. Suppose 1 roll a pair of 3's and 
re-roll the other three dice. 

(a) What is the probability that my new hand contains 
three of a kind? 

(b) What is the probability that my new hand contains 
two pair? 

(©) What is the probability that my new hand contains 
three of one value and two of some different value? 
(This hand is called a full house in draw poker.) 

In the dice game described in Exercise 12, suppose 1 

leave two dice with values 3 and 4 on the table and 

re-roll the other three dice. Answer questions (a-c) in 

Exercise 12 for this new situation. 

‘The Bears and the Cougars play a best-of-three seriesin 

Which the probability of the Bears winning each indi 

ual game isconstant. The Bears have a 0.9 probability of 

winning the series. What is the probability of the Bears 
winning each individual game? 

18. Which islarger, the probability that a fair coin will come 
up heads exactly three times in six tosses, or the proba- 
bility that a fair (six-sided) die will come up as a S or 6 
exactly twice in four tosses? 

16. For each value of k from 0 to 8, find the probability of 
getting exactly k heads on eight tosses of a fair coin. 
Create a bar graph with the values 0 through 8 along 
the horizontal axis, with the height of each bar deter- 
mined by the answers to the probability questions. This 
graph illustrates what is called the binomial probability 
distribution. 

17. Asa group, repeat the experiment “Tossa fair coin ight 
times and record the number of heads” a total of 100 


mS 


Pa 





ax 


14, 





63 Probability in Games of Chance 465, 


limes. Create a bar graph with the values 0 through 8 
along the horizontal axis, with the height of the bar la- 
beled k determined by the proportion ofall outcomes of. 
the experiment that had exactly k heads. Compare this 
bar graph to the one drawn in the previous exercise, 

IR. Christopher's favorite numbers are 1 and 4, Jessica likes 
2,3,and S, and John's favorite number is 6. What is the 
probability that on nine rolls of a fair dic, each child 
sees his or her favorite numbers three times? 

19. Assume that Kenny always gets a hit with probability 

1/3 and a base-on-balls with probability 1/6. 

(a) Determine the probability that in five plate appear- 
ances, Kenny gets at least two hits. 

(b) Determine the probability that in five plate appear- 
ances Kenny gets a basc-on-balls at least twice. 

(c) Determine the probability that in five plate appear- 
ances, Kenny gets on base with a hit more times 
than he gets on with a base-on-balls. 

We assumed in Example 3 that the Barons had a prob- 

ability of 2 of winning every game they played. What 

is the probability that the Barons would win a best-of- 

seven-game series? (Use the results from Example 3 

and Practice Problem 2.) 

21. John wins each set of tennis he plays with probability 

3. Read the following two questions, and make an in- 

tuitive guess about which answer will be bigger. Then 

answer both questions to see if you are right. 

(a) If he is behind one set to none, what it is the prob 
bility that he can come back to win a best-of-five-set 
tennis match? 

(b) If he is behind two sets to one, what is the probabil- 
ity that he can come back to win a bestof-five-set 
tennis match? 

Suppose that team A has a 60% chance of winning any 

given game against team B. 

(a) What is the probability that team A wins a best-of- 
three series against team B? 

(b) What is the probability that team A wins a best-of- 
five series against team B? 

(©) What is the probability that team A wins a best-of- 
seven series against team B? 

23. Home Field Advantage. Many best-of-seven series are 
scheduled so that the first two games are played at the 
home field of team A, the next three are played at the 
home field of team B, and the last two are played back 
at the home field of team A. (Of course, if the series 
ends early, some games will not be played.) Assume 
that both teams win on their own home field with prob- 
ability $. What is the probability that team A (the first 
home team) wins the series? 


LI 





"p 





TT a re 


466 Chapter 6 / Probability 


6.4 Expected Value in Games of Chance 





Gambling games are based on chance occurrences like getting cards of all the same 
suit or rolling a sum of 8 on two dice three times in a row. We do not think of a game 
like baseball or tennis as being based on chance occurrences, but we often make 
implicit assumptions along these lines, For example, we think that a .320 hitter is 
more likely to get a hit today than a .220 hitter. In Section 6.3 we made assumptions 
about the probability that one team beats another, even though the contest is surely 
not random. In this section we will analyze some other aspects of games related to 
the explicit or implicit probabilities involved in them. 

We have already seen examples of how to compute probabilities for “large” 
events given information about "smaller" events. For example, we analyzed the prob- 
ability of winning a best-of-seven series in terms of the probability of winning an 
individual game, Another aspect of larger events that is also of interest is the aver- 
age number of the smaller events that make up the larger one. For example, what 
is the average number of games that a best-of-seven series lasts? How many points 
are needed to resolve a tennis game that is tied at deuce? To answer these kinds of 
questions, we will need to investigate the mathematical idea of average values. 














The Definition of Average Value 


ua Example 1 Suppose that among 100 households there are 30 with no children, 
25 with one child, 30 with two children, 10 with three children, and 5 with four children. 
What is the average number of children per household? 


SOLUTION There are a total of (0)(30) + (1)(25) + (2)(30) + (3)(10) + 


(4)(S) = 135 children among the 100 households, for an average of 1.35 children 
per household, o 











We can arrive at the same answer by combining the idea of a “weighted average 
with our knowledge of probability. The number of children in each household is a 
number from (0,1, 2, 3,4), and the probability that each of these numbers occurs is 
simply the proportion of households with that number of children, That is, if we think 
of our probability experiment as choosing a household, we will have the following: 








© Prob(there are 0 children) 
© Prob(there is 1 child) = 3 
© Prob(there are 2 children) = $h 





© Prob(there are 3 children} 





© Prob(there are 4 children) = si 


A more efficient way to represent this is through the notion ofa random variable. 
This is simply a measurement of some numerical value associated with each possible 
outcome of a probability experiment. In this case, the random variable X is the 
number of children in the household chosen by the experiment. This allows us to write 
ire 0 children) as we did above. 





Prob(X = 0), for example, instead of Prob(ther 








6.4 Expected Value in Games of Chance 467 





Definition For a given probability experiment, let X be a random variable 
whose possible values come from the set of numbers (xi. ....3,). Then the 
expected value of X, denoted by E[X ],is the sum. 

(x1) « Prob(X = xi) + (2) Prob(X 


This is sometimes called the average value of the random variable, thinking of 
the average of the values X takes on over many repetitions of the experiment. 





2) «oe Qs)  Prob(X = xy) 














We 





an use this definition to find an alternative solution to Example 1: 
E[X] = (0) - Prob(X = 0) + (1) - Prob(X = 1) + (2) - Prob(X 
+ (3): Prob(X = 3) + (4) Prob(X = 4) 





2) 





30 2! 10 5 
= (0): x55 * («355 +2) G95 * 99-195 +) 10 
...(0)(30) + (25) + (2(30) + (310) + (45) 
" 100 
= 135 





Practice Problem 1 Let X represent the number of siblings of a person in your 
class. Collect all the necessary data about X, and use the definition to compute E|X | 





Often, the term average value is more appropriate than expected value for our 
intuition about the meaning of E] X]. Consider the following example. 





"lll Example 2 On the loaded die from Example 6 in Section 6.2, what is the 
expected value on one roll? Compare your answer to the expected value on one roll of 
a fair die. 
SOLUTION For that die, the probability of a 6 is 4, with a probability of $ 
for each of the other faces, Let X represent the outcome of one roll—so for 
either die X has possible values from (1, 2, 3, 4, 5, 6]. For the loaded die, we 


can compute 
1 1 
5) ý (70) TR (3) 


= (35) +@ (55) + (55) «e( 


1 1 
= 71) +5 = 





EX] 








and similarly. for the fair die, 


axi - o (2) «o (2) «e Gho) " e( 


1 
-g00-35 


)+(5) 








468 Chapter 6 / Probabiity 


Value of X Teena | 





Ls 





Number of rolls approx.) || 5,000 | 5,00 | sooo | 5,000 | 5.000 | sooo 
Table 6-3 Approximate Distribution of 30,000 Die Rolls 


In cach case, we claim that the expected value on one roll of a die is not a whole 
number. Of course, we never really expect to roll a 3.5 on a die, so the word "expec- 
tation” is a poor choice in this context. The quantity 3.5 is actually our prediction 
for the average of all the values of X that we will see in many, many trials of the 
experiment. 

In this case, imagine rolling a fair die 30,000 times. Because the die is fair, we 
expect the distributions of rolls to roughly break down as shown in Table 6-3. So if 
we find the average of all 30,000 recorded values of X, we will have 








Sum of X values 
Number of trials 


xz (05.000) + (2)(5,000) + (3)(5,000) + (4)(5,000) + (5)(5,000) + (6)(5,000) 








30,000 
5,000 5,000 5,000 5,000 5,000 5,000 
7 39,000 * 30,000 * 30,000 * “30,000 ^ O30000 + 30,000 








3. 





"5509 SMS EO 
- eO GE Ese: -35 
Mgt Qc t Get (05 - Get Og 35 
This interpretation of expected value as "long-term average" is particularly rel- 
evant in games that are to be played many, many times such as gambling and lottery 
games. 


a Example 3. Suppose you pay $2 each time to play the following game: Two 
dice are rolled, and you win $5 for each 6 that comes up. Do you expect to win more 
than you pay if you play many, many times? 


SOLUTION Let X represent the amount of money you win in one play of 
the game. Hence, X can take values from (0,5, 10), and 


© Prob(X E 





s 


0) = Prob(getting no 6's) 











© Prob(X = 5) = Prob(getting one 6) = MEE] 
© Prob(X = 10) = Prob(getting two 6's i 
Hence, 
E[X] = 0- Prob(X = 0) -- 5 Prob(X = 5) +10- Prob(X = 10) 
SOs ERE 
SU 5636 36 
60 
ELE 


Since this is less than the $2 it costs to play the game each time, we can see 
that you will actually Jose an average of about 33 cents for each time you play 
over the long term. 


o 








64 Expected Value in Games of Chance 469. 


Practice Problem 2 Suppose in Example 3 the payoffis $5 for rolling one 6, and 
$25 for rolling two 6's. Now is it worth $2 to play? 


"lll Example 4 tn Example of Section 6.1, the folowing card game is described 


We have a packet of three cards in which one is blue on both sides, one is red on 
both sides, and one is red on one side and blue on the other. One of three cards 
is chosen and placed on the table in full view of both of us. We now wager on the 
color of the face-down side of the card. You will give me $5 if the face-down side 
is the same color as the (visible) face-up side, and I will give you $6 otherwise. 

What is the expected value of my winnings each time the game is played? 





SOLUTION Let X represent the number of dollars I win in each play of the 
game. Then X can only take one of the two values from (—6, +5}, and from our 
solution to Example 5, we know that 

€ Prob(X 
9 Prob(X = Prob(the red/blue card is not chosen) = 
So E[X] = (-6)- } + (5) 4 = $1.33. This means that, on average, I will 
win $1.33 for each time we play this game, so in the long term, it is a bad idea 
for you to play. 


6) = Prob(the red/blue card is chosen) = 3, and 











Expectation in Bernoulli Trials 


A fundamental idea for relating small trials to larger experiments is the structure 
of Bernoulli trials developed in Section 6.3. The concept is that an experiment is 
made up of a sequence of independent trials whose individual probabilities are well 
understood. In this case, there is an easy general statement we can make about the 
expected number of successful trials in an experiment, 


Theorem 1 Suppose an experiment consists of the independent repetition of a 
trial n times, and the probability of that trial's individual success is p each time it 
is performed. If X denotes the number of successful trials in this experiment, then 
EX] n.p. 





This theorem follows easily from the basic rule for expected value: 
EX + Y] = EIX] + EY] 
but we will prove it by induction instead, This not only allows us to see an induction 


proof in a probability context, but it also illustrates a conceptual framework that is 
essential in the next section. 





PROOF (By induction on n.) Let P(n) denote the statement “The average num- 
ber of successes in n Bernoulli trials with probability p is n : p." Statement P(1) is 
"The average number of successes in one Bernoulli trial with probability p is p.” 
Letting X denote the number of successes in one trial, we know that X takes on 
values from (0, 1}, and so by the definition of expected value 


E[X] = (0) - Prob(X = 0) + (1) - Prob(X = 1) = p 





_———————— 





= AN 


Now let m > 1 be given such that statements P(1), P(2), ..., Pm — 1) have 
all been checked to be true and we are ready to consider the next statement P(m). 
Let X denote the number of successes in an experiment consisting of m Bernoulli 
trials Just as we did in Section 6.3, we can represent outcomes of this experiment. 
as ordered lists of length m whose elements come from (5, F), where we think of 
S and F as representing “success” and “failure.” 

Every outcome of the experiment looks like one of the following, where each 
blank is to be filled by any ordered list of length m — 1 with entries from (S, F). 


470 Chapter 6 / Probability 





Case 1. 5. . In this case, the total number of successes is one more 
than the number of S's in the blank. 

Case 2. F. > In this case, the total number of successes is exactly the 
number of S's in the blank. 





By the definition of expected value, 


E[X] = (Number of successes in case 1) - (Probability of case 1 happening) 

+ (Number of successes in case 2) - (Probal 

= (1+(m=1)- p)- (p) + ((m—1)- p)-Q.— p) 

p*mp-p)*imp-p-m pp) 
=m-p 


That is, E[X] =m- p, completing the proof of statement P(m). " 





of case 2 happening) 








2 Example 5 A softball player gets a hit with probability $ each time she steps 
to the plate. In any given game, this player has exactly five plate appearances. Use the 
definition of expected value to show that the expected number of hits per game is 3. 
Compare this with the prediction made by Theorem 1. 
SOLUTION Let X denote the number of hits in a game. Then X takes on 
values from the set (0, 1,2, 3, 4, 5). By the definition of expected value, 
ELX] = (O)Prob(X = 0) + (1)Prob(X = 1) + --: + (5)Prob(X 





5 


= ees. (2) «axc.»(3) (2) «exe» (3) (2) 


+0¢6.3(2) (2) acea) (2) «cs. (3) 
=3 


This is the same as the value (5) - (3) predicted by the theorem. 
o 


Practice Problem 3 Use the definition of expected value to show that the average 
number of results of heads in an experiment consisting of tossing a coin three times is 
15. 


Average Values in Sports 


We conclude this section by solving some problems that come up in sports when two 
opponents play a best-of-five or a best-of-seven series. We once again model a series 








64 Expected Value in Games of Chance 471 


as a sequence of independent games. Since the total number of games in a series 
is not fixed, two issues arise. First, we cannot directly use the result in Theorem 1. 
Second, the average number of games in a series becomes an interesting quantity to 
study. 


E] Example 6 Ifnwo teams A and B play a best-of five series" and ifteam A has 
a } probability of winning any given game, then what is the average number of games 
in the series? 


SOLUTION The variable X representing the number of games in a series 
has values from the set (3, 4.5]. In order to compute E[X], we need to know 
Prob(X = 3), Prob(X = 4), and Prob(X = 5). 

As in the previous section, we will represent a series as an ordered list of 
A's and B's in which the winner appears three times, the loser appears two or 
fewer times, and the winner appears last. For example, ABA denotes a series 
in which A wins games 1,2, and 4 and B wins game 3. 


© For X to be 3, we must have either AAA or BBB, so 





Prob(X =3) = (i) * G) 


a) * Va 
= 0.4375 
© For X to be 4, we must have either ___A or ___B , where in the first case 
the blanks are filled by two A's and one B (C(3, 1) = three ways) and in the 
second case the blanks are filled by two B's and one A (three ways). Each 
of the three series in the first case occurs with probability (1)  (3),and 
cach of the three in the second case occurs with probability (4) - (3). 


ma-a OO 


3515625 
9 Since X = 5 is the only other possible case, we know that Prob(X = 3) + 
Prob(X =4) + Prob(X = 5) = 1, so 














Prob(X = 5) = 1 — (Prob(X = 3) + Prob(X = 4)) 


= 1 — 0.4375 — 03515625 = 0.21093; 








Therefore, the average number of games is 

E[X] =3- Prob(X = 3) +4- Prob(X = 4)+5- Prob(X = 5) 
(0.4375) + 4 - (0.3515625) + 5 - (0.2109375) 
= 3.7 games 





* Recall that this means they play games until one team has won three games. 





Practice Problem 4 If rwo teams A and B play a best-of-three series, and if team 
A has a 3 probability of winning any given game, then what is the average number of 
games in the series? 


Since this analysis is similar for any problem involving series, it is very tempting 
to answer a more general question once so that we can think about specific questions 
on a higher level, 


ua Example 7 If two teams A and B play a best-of-seven series, then assuming 
that A wins a game with probability p (so B wins with probability 1 — p) and that the 
games are independent events, what is the average number of games needed to settle 
the series in terms of the variable p? For what value of p will the series take the longest? 


SOLUTION The variable X representing the number of games a series takes 
has values from among [4, 5,6, 7]. In order to compute [X], we need to know 
Prob(X = n) for each of n € (4,5,6,7]. 

Denote a sericsas an ordered list of A'sand B'sin which the winner appears 
four times, the loser appears three or fewer times, and the winner appears last. 
For example, AABBAA denotes a series in which A wins games 1, 2,5, and 6 
and B wins games 3 and 4, Note that ABAAAB, for example, does not denote 
a possible series since the winner, A, does not appear last. 


9€ For X to be 4, we must have either AAAA or BBBB, so 
Prob(X = 4) = p*+(1—p)* 


© For X to be S, we must have either -A or .... B, where in the first case 
the blanks are filled by three A's and one B (four ways) and in the second 
case the blanks are filled by three B's and one A (four ways). Each of the 
four series in the first case occurs with probability p^(1 — p), and each of 
the four in the second case occurs with probability p(1 — p". Therefore, 


Prob(X = 5) = 4p'(1 — p)+4p(1 — py 


© Similarly for X to be 6, we must have either ......A or ~- B, where the 
first case has three A's and two B’s in the blanks (C (5, 3) = 10 ways) and 
the second case has three B's and two A's in the blanks (also 10 ways). 
Each series in the first case occurs with probability p*(1 — p)?, while each 
series in the second case occurs with probability p*(1 — p), so 


Prob(X = 6) = 10p*(1 — p + 10p7(1 — p)* 














© Using the same analysis for a seven-game series, one can determine that 
Prob(X = 7) = 20p*(1 — py +20p*(1 — p)* 
Therefore, 
FIX] = 4p“ + (1 — p! -20]p*0 — p) + p(1 — pl 
60 p'0 — pF + p — p] 1401p — py p - pl 
—4 Apc Ap! +4p° — S2p* + 60p* — 20p* 





64 Expected Value in Games of Chance 473 





Length 48 





o2 04 08 os D 
p 


Figure 6-4 Series length versus probability of game win. 


In the case where p = 4 (when we would expect that E[X] should be at its 
largest), this means that E[X] = 93/16 = 5.8125, The graph that shows the 
expected value for all values of p is given in Figure 6-4. ü 


Solutions to Practice Problems 


1 Compare your answer with that of someone else in your — a(i) ng cea (i) 











class 
2 We make a simple change in Example 3: 
Mate, 203623 (1) =15 
E[X] = 0- Prob(X =0) + 5 Prob(X = 5) 8 
+25. Prob(X =25) 4 The variable X representing the number of games in a 
25.00 1 series has values 2 or 3. 
=0. m 45 425 
bai Mor ue: © For X tobe 2, we must have either AA or BB, so 
2y (1 
Prob =2)= (3) «(5 
‘Since this is more than the $2 it costs to play the game 3 3 
each time, you will win an average of about 8 cents for wt 
‘each time you play over the long term, so this is a game 7$ 


you should play, although no one can blame vou for not 
getting too excited about it. 
3 Let X denote the number of heads tossed. Since the 


coin is tossed three times, X takes on values from the peiores) 











set (0. 1,2,3). By the definition of expected value, e a ana ce pea 
E[X] = (0) - Prob(X = 0) + (1) - Prob(X = 1) ax Prob(X =2) +3- Prob(X =3) 
+ (2) - Prob(X = 2) + (3) - Prob(X =3) T daa penne 
@)- Prob(X 22) + (3): Probl X = 3) ERONATA 


- coo (!) +oceo() 








ATA Chapter 6 / Probability 


Exercises for Section 6.4 


1. If you roll two dice many times and record the sum of 
the two dice in each ease, what do you expect will be 
the average of all these sums? What if you use a pair of 
loaded dice like the one in Example 2? 

2. Suppose a three-person committee is formed for a club 
‘by drawing names out of a hat, If the club has 15 men 
and 10 women, what is the expected number of women 
‘on the committee’? How do you explain the meaning of 
this to someone who cringes at the thought of “expect- 
ing” a fractional number of women? 

3, Suppose a three-person committee is formed for a club 
by drawing names out of a hat. If the club has 30 men 
and 20 women, what isthe expected number of women 
on the committee? 

4. Suppose a three-person committee is formed for a 
club by drawing names out of a hat. If the expected 
number of women on the committee is 2.0 and there 
are 10 men in the club, how many women arc in the 
club? 

5. Suppose five cards are drawn from a standard deck. 
What is the expected number of aces among the five 
cards? 

6. If two cards are drawn from a standard deck of cards, 

‘hat is the expected sum of the values on the cards? 

(Use 10 as the value of a jack, queen, or king, 11 as the 

value of an ace, and the numerical value for all other 

cards) 

In any given course, Amy has a $ probability of getting. 

an A, a } probability of geting a B, a } probability of 

getting a C, a ! probability of getting a D, and a 4 prob- 
ability of getting an F. Amy's Dad is a firm believer in 
carrots over sticks, so he agrees to pay her $10 for each 

A, SS for each B, and $1 for each C she receives How 

much does Amy expect to get each semester that she 

takes five courses? 

. In Exercise 19 of Section 63, we assumed Kenny hasa | 
probability of getting a hit and a 2 probability of draw- 
ing a base-on-balls every time he appears at the plate, 
Suppose he negotiates in his contract for an end-of-the- 
season bonus of $1,000 for each hit and $100 for each 
basc-on-balls he gels. If he can reasonably expect 600 
plate appearances this year, how much does he expect 
his bonus will be? 

9. Here is a real gambling game that we can analyze for 
fairness. A player pays $10 for the privilege to play. The 
player tosses a die three times with the following payoff: 
$10 for the first die that shows a 6, (an additional) $15 
for the second die that shows a 6, and (an additional) 
$20 for the third die to show a 6. What is the expected 
amount of money the player will win playing this game 


1 








one time? How much can the casino expect to make if 
10000 of these games are played cach day? 

10. Inanewlottery game that has been proposed, the player 

pays $2 and receives five fair dice, These dice are rolled 
once, and the player receives 
© SI fora roll containing two of the same value: 
© S2 for a roll containing three of the same value; 
© S4 for roll containing four-of-a-kind; and 
© S8 for a roll containing five-of-a-kind, 
How much can the player expect to win on average for 
each time this game is played? If 100,000 people in the 
state play this game each day, how much money will the 
state make in one year from this game? How much can. 
you make the payoff for five-of-a-kind and still make a 
profit on this game? 

11. Use the identity in Exercise 36 of Section 53 and the 
definition of expected value to find the expected num- 
ber of heads if six fair coins are tossed. 

12. Use the identity in Exercise 36 of Section 53 and the 
definition of expected value to find the expected num- 
ber of heads if N fair coins are tossed, Check your an- 
‘wer with the prediction in Theorem 1 

13. Use the identity in Exercise 37 of Section 5.3 and the 
definition of expected value to find the expected num- 
ber of times a 1 is seen in 10 rolls of a fair die, 

14, Use the identity of Exercise 37 of Section 5:3 and the 
definition of expected value to find the expected num- 
ber of times a 1 is seen in N rolls of a fair die. Check 
your answer with the prediction in Theorem 1, 

15, What is the expected number of boys in a two-children 
household in which at least one child is known to be 
à boy? Compare this answer to the expected number 
of boys in a two-children household in which the older 
child is known to be a boy. Why are these not the same? 

16. If the Bisons have a 1 probability of winning any given 

F game against the Mustangs, what will be the average 
length of a best-of-five series between these two teams? 

17. I the Sharks have a 2 probability of winning any given 

F game against the Minnows, what will be the average 
length of a best-of-five series between these two teams? 

18, Repeat Example 7 for a best-of-three series 

19. Repeat Example 7 for a best-of-five series. 

If two teams A and B play a best-of-seven series, and 

if team A has a 2 probability of winning any given 

game, then whatis the expected number of gamesin the 
series? 














"B 


21. In a best-of-seven series between two evenly matched 
teams (ie. cach team hasa $ probability of winning any 
given game), what is the average margin of victory of 








65 Recursion Revisited 475 


the series winner over the series loser? For example, a 24, Flipa coin untilit comes up heads recording the number 





team winning a series four games to one has a margin of required tosses. Do this 20 times and find the average 

of victory of 4—1 = 3, number of required tosses. Now compute the theoret- 
22. Find the real average of lengths of best-of-seven series ical expected number of tosses to compare with your 

in some professional sport. Compare to Example 7. result, (You may need to use the following fact about 
23. Use a graph of the expression in Example 7 to deter- infinite sums: } +4 + $+ +++ = 1.) 


mine what p must be in order for the expected length 
of the series to be less than 5 


6.5 Recursion Revisited 





One of the themes in Section 5.5 was the art of counting the number of sequences of 
a particular type using a recursive description of the sequences. The basic principle 
involved is that when it comes time to construct sequences of length n, you may take 
the point of view (essentially by mathematical induction) that you already know how 
to construct sequences of length n — 1 or less, 

For example, to construct a binary sequence of length n, we can use the two- 
step algorithm: (i) Choose a first digit; (ii) choose a binary sequence of length n — 1 
to follow the first digit" This same approach can be applied to questions about 
probability experiments. 














E] Example 1 This problem revisits Exercise 24 of Section 6.4. Use recursive 
reasoning to find the average number of tosses of a fair coin that it takes to get a result 
of heads for the first time. 





SOLUTION Theset ofall possible gamesis infinite, so we cannot simply count 
the total number of outcomes as part of answering the probability question. 
However, we can use recursive reasoning instead. The set S of all possible 
sequences of tosses for this game looks like this: 


S={H,TH, TTH, TTTH, TTTTH, TTTTTH, TTTTTTH, ...) 


Let xbe the average number of flips to get the first result of heads. When we start 
a new game, there is a 1/2 probability that the game will be over immediately 
(i... the game will last one flip) and a 1/2 probability that it will last one flip 
longer than average. Formally, this means that 











1 1 
3:05:00 


and the solution to this equation is clearly x = 2. 


n 





We can visualize the recursive nature of the set S in Example 1 using a decision 
tree structure, The important feature in Figure 6-5 is that all the ovals are identical, 
and each occurs with probability 0.5 within the next larger oval. 





* I followsthat the numberof binary sequences of length nis simply twice the numberof binary sequences 
of length — 1. 





NN E ÉL EEE 


ESS F — XD 


476 Chapter 6 / Probabiity 











Figure 6-5. The recursive nature of $. 


Practice Problem 1 How many flips on average will it take for a coin to get its 
second result of heads? 


Remember that in the game of tennis you must win by at least two points, so 
if there is a tie late in the game (this situation is called "deuce"), the game could 
conceivably go on for a long time until someone has a two-point lead, A tennis game 
at deuce is equivalent to a game that starts with a score of 0-0 and ends when either 
player has a two-point lead, if that is easier to visualize, 








"lll Example 2 suppose wo tennis players are evenly matched (so each has a 
probability of 4 of winning any given point). If the two players are tied at deuce, what 
is the average number of additional points that must be played to resolve the game? 


SOLUTION We will write a of A's and B's as before to signify a list of 
points in the order in which they are won. The set of all possible outcomes of 
the experiment looks like 








S = (AA, BB, ABAA, BAAA, ABBB, BABB, ABBAAA, ABBABB, ....) 





so the expected length is the (infinite) sum of the length of each game times the 
probability of that game occurring, To avoid the infinite sum, we use a recursive 
model: 


Case 1. The game looks like AA, 

Case 2, The game looks like BB. 

Case 3. The game looks like AB. , Where the blank can be filled with 
any game from $ above, 

Case 4, The game looks like BA 
any game from $ above, 


, where the blank can be filled with 





‘The average length of the games in 5 then is 


(Length of games in case 1)(Prob. of having a game in case 1) 

+ (Length of games in case 2)(Prob. of having a game in case 2) 
+ (Length of games in case 3)(Prob, of having a game in case 3) 
+ (Length of games in case 4)(Prob. of having a game in case 4) 





55 Recursion Revisted 477 
The key here is to notice that the average length of games in cases 3 and 4 is two 


more than the overall average length of games in 5. Using their observation, 
we can write the average length of games in S as 


Q) QJ «e (2) «eos immu (3) 


+ (2+ avg. length in S) + (5) 


If we let the variable a represent the average length of games in 5, this expres- 


sion becomes. 
(1/4) +2- (1/4) + (2 +a) 0/4) + (2 + a) 01/4) 





2+a 


2 





from which it follows that a = 4. 


Practice Problem 2 Fill in the details of the following solution to the problem 
“IfA and B are playing tennis and A wins each point with probability 3, what is the 


probability that A eventually wins a game that is currently tied at deuc 
Representing games as we did in the previous example, we have four different 





cases for what a tennis game can look like 


Case 1. The game looks like AA 


Case 2. The game looks like BB. 
Case 3. The game looks like AB. where the blank can be filled with 


any game from $. 
Case 4. The game looks like BA, ; where the blank can be filled with 
any game from S. 


Because the points played are independent, the probability that A wins such a 


game is 





(Prob. of having a game in case 1) - (Prob. of A winning a game in case 1) 
+(Prob. of having a game in case 2) (Prob. of A winning a game in case 2) 


+(Prob. of having a game in case 3) (Prob. of A winning a game in case 3) 
case 4) 





+(Prob. of having a game in case 4) - (Prob, of A winning a gam 
Let w denote the probability that A wins this kind of game. The analysis above can be 


translated into the equation 


«Gh +) 





OC) 


which can be solved algebraically to yield w 


| ———— ——————————— 








478 Chapter 6 / Probability 





A tennis game is just a particular instance of a more general type of game studied 
in what is known as the gambler’ ruin problem. Imagine that when two tennis players 
are tied at deuce, each player is given two "markers." When either player winsa point, 
she takes a marker from her opponent, and the game ends when one player is out of 
markers. This leads to exactly the same rules for resolving a deuce game in tennis, 
and it is easy to see how to generalize it to what we will call the Hank and Ted game. 

In this game, two players, whom we will always call Hank and Ted, each begin 
with a pile of markers. A turn in the game consists simply of a coin being flipped. If 
the result of the toss is heads, then Hank wins a marker from Ted, and otherwise Ted 
wins a marker from Hank. The game is over when either player is out of markers, In 
a gambling context, the end of the game occurs when one player is ruined. Get it? 

Clearly, the person who starts with more markers is more likely to win all the 
markers. Also, if both players begin with the same number of markers, it seems clear 
that they should each have a 3 probability of winning the game. But this is perhaps 
where our intuition abandons us. There are a couple of interesting questions in the 
careful analysis of this game. Because there are infinitely many different ways the 
game can be resolved, we will have to use recursive thinking to answer them, 





"lll example 3. inthe Hank and Ted game played with a fair coin, 


I. Is it better to have 6 markers to your opponent's 4 markers or 7 markers to his 5, 
or does it not matter? 

2. Will a game that starts with 8 markers to 1 last longer or shorter on average than a 
‘game that starts with 2 markers to 4? 


SOLUTION 


1. Suppose the game is being played with a total of M markers. Let p, be the 
probability that Hank wins if he starts with n markers (and so Ted starts 
with M — n markers). Clearly, p; = 0 and pyy = 1. Also, if Hank starts with 
n markers and if the first coin toss is heads (probability = }), then the rest 
of the game is played (independent of the first toss) with Hank starting with 
n+1 markers. Similarly, if the first toss is tails, then the rest of the game 
is played with Hank starting with n — 1 markers. By the rules of sums and 
products, this means that 


Pe-ipestipa forevery n> (61) 


To solve this, note that equation (6.1) can be rewritten as 
Pei Px = Pan- pei foreveryn>1 


This means that the difference between any two consecutive values of p; 
isa constant—let’s call it k. We know that py — po = 1 — 0 = l anditis easy 
to see that 


Pu — Po = (Pu — pui) + (pui — pu-2) +--+ (Pi — Po) = (M)(4) 
Therefore, k = jj. Moreover, using the same argument as above, we have 
that p, = p, — Po = (n)(k), so we determine that p, = jy for all n. This 
means that Hank has a $ probability of winning if he starts with 6 markers 
to Ted's 4, and he has a 5s probability of winning with 7 markers to Ted’s 5. 





65 Recursion Revisted 479. 


2. Suppose again that the game is being played with M total markers, and let d, 
represent the expected duration (number of coin tosses) of a game in which 
Hank starts with n markers. Clearly, d; = dy = 0. Also, as before there isa 4 
probability that the game's first coin toss is heads and this move is followed. 
bya game in which Hank hasn + 1 markers, which has an expected duration 
of d.i. Similarly, there is a } probability that the game's first toss is tails, 
which must be followed by a game of expected duration d,_.Sothe expected 
duration of the game in which Hank starts with n markers (where n > 1) is 


1 1 
da= 5 (dns +1) + 5(dr-a +1) (62) 


We will not explicitly solve this, but the method would be similar to that 
above. We can easily check that d, — n x (M — n) satisfies the relation (6.2) 
with the correct initial values. (See Exercise 23 below.) This means that the 
expected duration of a game that starts with 8 markers to 1 is (8)(1) = 8 coin 
tosses, and the game that starts with 2 markers to 4 also has an expected 





duration of 8 = (2)(4) coin tosses. a 


Practice Problem 3 Suppose the Hank and Ted game is played with an unfair coin 
that comes up heads with probability 3. How should you modify recurrence relations 
(6.1) and (6.2) to model this new game? 


Solutions to Practice Problems 


1 Letx denote theaverage length of such asequence. Each 
sequence of coin tosses looks like one of the following: 
er. , where the blank contains a sequence of 
H's and T's that stops when the second heads is 
tossed. This means that the average length of the se- 
quences that go in the blank is x; hence, the average 
length of all sequences in this case is x + 1 

oun. ; where the blank contains a sequence of 
H's and T's that stops when the first heads is tossed. 
We know from Example 1 that the average length 
of the sequence in the blank is 2; hence, the average 
length of all sequences in this case is 3. 

Since each case above occurs with probability } 

put this together to conclude that 


1 


wecan 








1 
G*1)*5:3 


2 
which has solution x = 4. 


Exercises for Section 6.5 


1. What is the expected number of rolls of a six-sided die 
that is rolled until a 1 appears? 

2. How many times on average will three dice have to be 
tossed for the sum to exceed 12 for the first time? 


2 The blanks oceurat the end of the solution as lows 
*7() 06) - (G) 


HOSES 


which can be solved algebraically to yield w 
3 Following the reasoning that gave us (6.1) and (62), we 
will have 





2 1 
GPeitzPet foreveryn=1 





Pr 


and 


de) Ades) foreveryn>1 





3. A pair of dice are thrown until at least one of the dice 
comes up 1 for the first time. How many tosses, on av- 
erage, are required? 

4. Generalize the previous three exercises by proving 
the statement “If a trial is successful with probability 
p and an experiment consists of performing the 








480 chapter 6 / Probably 


repeatedly until it is successful for the first time, the 
expected number of trials in the experiment is 1.7 

5. The following problems are not examples of recursive 
problems and the previous exercise cannot be applied 

to them: 

(a) Ifa deck of cardsisshuffled well and cards are dealt 
face-up until the first ace appears, how many cards 
‘would you expect on average to deal? 

(b) If a deck of cards is shuffled well and cards are 
dealt face-up until the first spade appears how many 
cards would you expect on average to deal? 

(©) Explain why Exercise 4 does not help with these 
problems. 

How many tosses ofa coin on average will it take before 

you have seen at least one head and at least one tail? 

7. Let ig be the expected number of tosses required to 

obtain the ^ head on repeatedly tossing a fair coin. 

Generalize the solution to Practice Problem 1 to find a 

recursive description for An- 

8, Which are you more likely to see first if you repeatedly 

ossa coin, a head followed immediately by a head or a 
tail followed immediately by a head? (Be careful. This 
problem is not like the others.) 

. Ifthe server ina tennis game hasa $ probability of win- 

ning cach point she plays, what is the probability she will 
eventually win a game that is now tied at deuce? 

If the serverin a tennis game has a $ probability of win- 

ning cach point she plays, on average how many more 

points will a game last that is now tied at deuce? 

. Ifthe server ina tennis game has a 3 probability of win- 

ning each point she plays, on average how many more 
points will a game last that is now tied at deuce? 

If the server in a tennis game has a 3 probability of win- 

ning each point she plays, what is the probability she 

will eventually win a game that is now tied at deuce? 

|. Generalize the previous four exercises. Specifically if 

the server ina tennis game has probability p of winning 
each point she plays, what is the probability she will 
eventually win a game that is now tied at deuce? What 
is the expected number of points of such a game? 

14, The following questions refer to Example 2 and tie in 
with the coin-tossing tennis experiments in the exercises 
of the previous section. A and B are playing tennisstart- 
ing with a score of 0-0, Remember that the first player 
to win at least four points wins and being ahead by at 
least two points wins the game. Assume each player has 
a } probability of winning any given point. 

(a) What is the probability that Player A wins after ex- 
actly four points are played? Exactly five points? 
Exactly six points? Exactly seven points? Exactly 
eight points? 


WE We 





"BOW 


a. 


16. 


n. 


WE 


az 





(b) Whatisthe probability that the game will be stuck at 
“deuce” after exactly n points? (Think recursively.) 

c) What is the probability that Player A wins after ex- 
actly n points are played? (Use the previous fact.) 

Ifthe server winsa tennis point with probability 2, what. 

is the probability that the server wins a tennis game that 

starts at 0.0? 

If the server wins a tennis point with probability 3, what 

is the average length of a tennis game that starts 0-0? 

When Pete and André play table tennis, they like to 

play to seven with the rule that one has to win a game 

by three points. This exercise suggests a way to analyze 
the length of this kind of game once it is tied at the end 

‘of the game. Assume that Pete and André each have a 

probability of } of winning cach point they play. 

(a) Leti represent the average length of the game once 
it is tied (at the end of the game when the "win- 
by-three” rule applies), let a represent the average 
length ofa game in which André istwo pointsahead 
of Pete, and let p represent the average length of a 
game in which Pete is two points ahead of André. 
Using Example 2 as a guide, explain why 











1 1 
Qe Qa) Qe p) 


(b) In the same spirit, explain why 


let 1 

a=} 143 @+a)+3 Qu) and 
tiai 1 

p=3 1+3 piden 


ic) Use algebra to determine the values of the un- 
knowns t, a, and p from the three equations we 
have found. 

What is the probability that Hank wins the Hank and 

Ted game in which Hank starts with 6 markers to Ted's 

8 markers? Play the game a number of times with a coin 

or online to support this theoretical result 


. What is the expected length of the Hank and Ted game 


in which Hank starts with 6 markers to Ted's 8 markers? 
Play the game a number of times with a coin or online 
to support this theoretical result. 
Without using the "formulas" derived in Example 3, ex- 
plain why it makes sense that the Hank and Ted game in 
which Hank and Ted each start with 7 markers should 
last exactly one move longer than the game described 
in Exercise 19. 
Based on the analysis in this section, fill in the blanks in 
the sentence below: 

In the Hank and Ted game played with a fair coin, if 
Hank starts with X markers and Ted starts with Y mark- 
ers, then the probability that Hank wins is. i 








65 Recursion Revisited 481 





da ~ d | sat - pem | d-d, l 


d-d 
k 


Table 6-4 Table for Exercise 22(b) 


the probability that Ted wins is ; and the 
expected length of the game is moves. 

22. These problems involve a careful analysis of the num- 
bers d, in Example 3. Assume for these problems that 
we are playing the Hank and Ted game with a fair coin, 
where Hank starts the game with 6 markers to Ted's 4 
markers. (Hence, dy = di = 0.) 

(a) Use equation (6.2) to show that. 


did = the ~ dea 


(b) Letting the letter k represent d; — d», use your pre- 
vious answer to fill in Table 6-4. 





(©) Use the values in the table above with the fact that 
(dio — di) + (do — di) + (ds — dr) + ---+ (h =d) 
= dy — dy =0 

to find a value ofk. 


(d) Use this value of k to fill in Table 6-5. 
i. Algebraically verify that the function d, = (n)(M — n) 
the recurrence relation (6.2) and the conditions 





d, = dy b 

In Practice Problem 2, we addressed a Hank and Ted 
game in which Hank has snuck an unfair coin into the 
game that comes up heads 1 of the time. In the solu- 
tion to that problem, we showed that the probability p. 
of Hank winning when he has n markers satisfies the 
recurrence relation 





2 1 

FP + shes 

Fillin the details below to find the probability that Hank 

wins this game if each player starts with 10 markers, 

(a) Show the algebra that allows the given recurrence 
relation to be rewritten as 


Pa 


1 
Past Pa=3(Pn~ Pet) (63) 
(b) Leting k= pi — p, explain why (6.3) implies that 


Td. 
Pest — Bex 


* 


for all n > 0. 
c) Use the fact that 





Pa- Po = (Pa Pas) + (Drv ~ Pu) +- 


gi p) (Qi — m) 


to explain why px — py =k (55s!) 
4) Show that for 1 < n <20, 
2-1 
y) 


p (= 


Bai 

and use this result to answer the given question, 
In Practice Problem 2, we addressed a Hank and Ted 
game in which Hank has snuck an unfair coin into the 
‘game that comes up heads $ of the time. In the solution 
1o that problem, we showed that the expected length 
d, of a game when Hank has n markers satisfies the 
recurrence relation 


4, 








aene Hit) 


For the game in which each player starts with 10 mark- 
(a) Show algebraically that any closed formula with the 
form 





satisfies the recurrence relation. 

(b) Find the value of so that ds = 0 

(©) Compute the value of dio. 

(d) Find the initial distribution of markers for which the 
game is expected to last the longest. 

In Exercise 24, we examined the Hank and Ted game 

played with a coin that comes up heads with probabil- 

ity 3 If Hank starts such a game with 10 markers, how 

many markers should Ted start with in order for the 

game to be fair? (That is, how can we assure that the 

Probability of cach player winning is as close to $ as 

possible?) 

















"UM 


a! 





Tt 


Table 6-5 Table for Exercise 22(d) 





482. Chapter 6 / Probability 


E 





6.6 Excursion: Matrices and Markov Chains 





The recurrence relations from the previous section all came from the same idea— 
each game is modeled as some initial outcome followed by the remainder of the. 
game. In addition, the games involved had well-defined states determined by a score 
or a distribution of markers, Problems involving states and rules for moving between 
the states can be more easily modeled through the mathematical operation of matrix 
multiplication, 

In this section we will develop the concept of matrix multiplication in the context 
of the probability rules we have studied so far. Students already familiar with adding 
and multiplying matrices will learn a new way to think about these operations, so 
the beginning of the section should not be skipped. In order to investigate real 
games using these tools, some form of technology capable of matrix arithmetic should. 
be used, since the matrices involved can be large and the number of operations 
performed with them can be even larger. 

We will introduce the idea of matrix multiplication using the Hank and Ted game 
played with three total markers. The “matrix” involved is simply an array with rows 
and columns labeled with the four possible states of the game as follows: 








State 1 is when Hank has 0 markers to Ted’s 3, 
State 2 is when Hank has 1 markers to Ted's 2, 
State 3 is when Hank has 2 markers to Ted's 1. 
State 4 is when Hank has 3 markers to Ted's 0. 


‘The entries in the array will be the probabilities of the game moving from one 
state to another in one move of the game. For example, the entry in row 2, column 1 
is the probability of the game changing in one move from state 2 (Hank 1, Ted 2) to 
state 1 (Hank 0, Ted 3), which is 4 since this transition happens if the coin comes up 
tails. On the other hand, the entry in row 2, column 2 is the probability of the game 
changing in one move from state 2 (Hank 1, Ted 2) to state 2—such a transition is 
impossible, so the probability of it happening is 0. 

If we continue doing this for all pairs of states of the game, we will arri 
4x 4 array: 








e at the 








Row1 (satel) > [1 0 0 0 

Row2(state2)—> | 1/2 0 1/2 0 
= Row3(state3)> | 0 1/2 0 1/2 

Row4 (sated) > | 0 0 0 1 


We call this the transition matrix for this game since it shows the probabilities of 
all one-move transitions between states in the game. In the formal definition below 
and in the remainder of this section, we use the notation M; ; to refer to the entry in 
row and column j of a matrix M. 





Definition The transition matrix for a game with states 1,2,...,n is the 
matrix M with 


Mi; = Prob(the game changes from state i to state j in one move) 














66 Excursion: Matrices and Markov Chains 483 


Noes: 


1. Weare making an implicit assumption that the moves of the game are independent 
events, so the entries in the transition matrix are constant probabilities and the 
product rule can be correctly used. 


. Since state 1 and state 4 represent the end of the game, the transition matrix shows. 
à probability of 1 that the game stays in whichever of these two states the game 
ends up in. This is the way in which we deal with the indefinite length of the game. 
We essentially let all games continue forever, eventually becoming trapped in one 
of the "winning" states. These states are called absorbing states 

3. The defining characteristics of this type of matrix are that all the entries are prob- 

abilities and every row has entries that sum to 1. That is, any matrix with these 

two characteristics is the transition matrix for some game. 


Matrix Multiplication 


Now that we have captured all the probabilities for how the Hank and Ted game 
can change in a single move, a natural next question is "What is the probability of 
the game moving from state Ì to state j in two moves?” We will use the notation 
Prob(i — * — j) to represent this probability. Answering this type of question is 
simply an application of the sum and product rules for probability. For example, to 
compute the probability of the game moving from state 3 to state 3 in nwo moves 
Prob(3 — * — 3)), we recognize that this may be accomplished only by moving from 
state 3 to state 2 and then state 2 to state 3. Extending our notation a bit and using 
the product rule, we can compute 


Prob(3 = 2+ 3) (3) (3) 


In general, the product rule tells us that 














Prob(3 = x — 3) = Prob(3 + x) Prob(x — 3) 


Of course, moving from state 3 to state 4 to state 3 is impossible since state 4 is an 
absorbing state, but this fact is correctly reflected in the above rule: 





Prob(3 + 4 — 3) = Prob(3 + 4) Prob(4 = 3) 
= (M34): (Maa) 


= (;)@=o 


In some games, there might be more ways to get from state 3 to state 3 in two 
moves, but we can account for all possible intermediate stages by simply invoking 
the sum rule, For example, to fully consider all the possible ways to go from state 3 
tostate 3, we should write 
Prob(3 — + > 3) = Prob(3 + 1+ 3) + Prob(3 + 2 > 3) 
+ Prob(3 — 3+ 3) + Prob(3 + 4+ 3) 


Which in this case just gives us the same answer as before: 


1 
ProB >> 3) 20e 140407 1 





484 Chapter 6 / Probability 


This computation can be made directly from the transition matrix 


1 0 0 0 

12 0 12 0 
M-|o 12 0 12 

0 0 0 1 


by looking at row 3, which reflects the probabilities of where the game can go from 
state 3, and column 3, which reflects the probabilities of how the game can get to 
State 3. There we see 


Prob(3 — 1 3) = Prob(3 — 1) - Prob(1 > 3) = Ms; 
Prob(3 — 2 — 3) = Prob(3 — 2)- Prob(2 — 3) = M32 


«Mia = (0)(0) 


-E 


M33 = (0)(0) 
wo=(Jo 


In other words, the probability of going from state 3 to state 3 in two steps is 
Myx» Mia + Msa: Ma3 + Mss Mia Mas Mis 


That is, row 3 of M is multiplied by column 3 of M term for term and the results 
added. We will refer to this operation as row-column multiplication, and it is always 
possible to perform as long as the "row" and the "column" in question contain the. 
same number of entries. 

‘The operation of matrix multiplication is just the extension of this simple idea 
to an entire matrix. 


Prob(3 — 3 — 3) = Prob(3 — 3)- Prob(3 — 3) = Mss 
Prob(3 = 4 — 3) = Prob(3 — 4)- Prob(4 — 3) = Ms 





Definition | Given matrices M and N where the number of entries in the 
rows of M is the same as the number of entries in the columns of N, we define 
the product M - N to mean the new matrix Po that the entry row i, column j of 
Pis the row-column product of row i from M and column j from N. Formally, 
we write P = M - N to mean that 


Pu S Mia -Niy Miis No + Mis Niy t 








Note that the matrix products M - N and N - M are not defined to be the same, 
and we shall see below that they usually are not. Also note that it is absolutely 
essential that M have rows the same length as the columns of N for the definition of 
M-N to make sense. 

It is advisable to stop here and practice matrix multiplication, especially if it is 


new to you. 
cma ub m 
Practice Problem 1 LaA- 12 0 i2 0 |.a=|§ $, 
0 12 0 12 ANS 


c fe 2 jana = [5 Jt 





66 Excursion: Matrices and Markov Chains 485 


Compute each of the following or explain why it cannot be done: 


LAB $c 
2 B-A & CI 
3C-B 7. 1-C 
4 B.C & BI 


Based on your answers above, why is I known as an identity matrix? 


Returning to the transition matrix M for the 3-marker Hank and Ted game, we 
can now form the entire matrix product M - M, which can be written M". 


nde an ag 1 0 0 0 to 5$ d$ 
w.|W2o0 12 0j||y2 0 12 oj |12 1⁄4 0 1/4 
=| o2 0 12|"| 0 12 0 12|5|14 0 1/4 12 
ngo t 0 0 0 1 0 0 0 1 


The notion of matrix multiplication has been based on the use of the sum and 
product rules for probabilities to find that the probability of a game moving from state 
3tostate3 in two movesis the entry in row 3, column 3of the product M - M = M^. In 
general, the entry in row , column j of M? is the probability that the game progresses 
from state / to state j in two moves. This fact, in turn, can be further generalized to 
the following theorem. 





Theorem 1 IfM isan n x n transition matrix reflecting the one-move transition 
probabilities for states 1 through n of a game, then for any integer k = 1, the entry in 
row i, column j of the matrix M* is the probability of the game moving from state i to 
state j in k moves 


PROOF The proof is done by induction on k and is left for Exercise 2: m" 





A side effect of this is the fact that M* itself is another transition matrix, so the 
entries must all be probabilities with rows summing to 1. This is significant because 
it means that when we raise a transition matrix to a large power (as we will do next), 
we will not get huge numbers. It also meshes with our understanding of the games 
themselves. For example, if we use a matrix M to model a children's game where a 
single die is used to generate moves, then the matrix M? accurately models the same 
game with each turn consisting of rolling the die twice. 





Exploiting the Matrix Model 


‘There are two points to doing all of this. First of all, there are standard computer 
applications and advanced calculators that are very good at doing matrix arithmetic, 
Consequently, by modeling our problem in these terms, we can bring the computa- 
tional power of our machines to work on the problem without having to write any 
specialized code. The second point to using matrix models comes from the wealth 
‘of mathematical knowledge about systems that have matrix representations, This 
includes ways to give elegant solutions to our problems and more advanced analysis 
than we will see in this course. You will most likely see these so-called Markov chain 
techniques in a first course in linear algebra. 





a OESOMBEESUEMEN E A VLLLLLLECLOZGIZEQS[0 





486 Chapter 6 / Probability 


a (| 


We now return to our example and the analysis of the simple Hank and Ted 
game with 3 markers. We saw that from the transition matrix 





TTE 
i2 0 12 0 

M-| ig 0 1p 
0790. Ma Y 


for the game (which describes how the game could progress from each state to each 
other state in one move of the game) we could find the matrix 


3h50 3 Of 90 
12 1/4 0 1/4 
14 0 1/4 12 
MORET PCS 





M 


which describes the probabilities of getting from each state to each other state in 
exactly two moves of the game. Theorem 1 tells us that M? similarly describes the. 
probabilities of getting from each state to each other state in exactly three moves of 
the game, and so on. 





"Bl example 1 civen rs 


3 ci 104.4 
5/8 0 1/8 1/4 
1/4 1/8 0 5/8 

(09801 a 


M = 








1. What is the probability that a game in state 2 isin state 1 three moves later? 


2. What is the probability that a game starting with Hank 2, Ted 1 has Hank 1, Ted 2 
after three tosses of the coin? 


SOLUTION 


1. According to Theorem 1, this is the row 2, column 1 entry in M, which is 3. 


2. Since Hank 2, Ted 1 is state 3 and Hank 1, Ted 2 is state 2, this question is 
answered by the row 3, column 2 entry in M^, which is 









Practice Problem 2 Given that 


Wie erem 

5/8 1/16 0 5/16 

5/16 0 1/16 5/8 
Qj ^or aloe 


LU 





1, What is the probability that a game in state 2 is in state 4 four moves later? 
2. What is the probability that a game starting with Hank 2, Ted I has Hank 1, Ted 2 
after four tosses of the coin? 








166 Excursion: Matrices and Markov Chains 487 


Computing Large Powers with Technology 


If we have a computer or calculator available, we can just as ea 
powers like 





wily compute large 


1 0 0 0 
0666... 10-9  Q — 0333 
0. 0 10?" 0666.. 

0 0 o i 


MO a. 





This allows us to collect information about the game after an enormous number of 
moves, Remember that the original matrix was set up so that instead of the game 
actually ending, it just gets stuck eventually in either state 1 or 4. So, for example, 
the fact that the probability of a game progressing from state 2 to state 4 and being 
stuck there in 1,000 moves is 0.333... (the entry in row 2, column 4 entry of M!) 
means that a game starting with Hank 1, Ted 2 ends in a win for Hank within 1,000 
moves with probability 0.333. 
The following facts follow similarly from the matrix M10; 





© A game starting with Hank 1, Ted 2 ends up as a win for Ted within 1,000 moves 
with probability 0.666... 

© A game starting with Hank 1, Ted 2 has Hank 1, Ted 2 after 1,000 moves with 
probability 10-9, a 

© A game starting with Hank 1, Ted 2 has Hank 2, Ted 1 after 1,000 moves with 
probability 0. 





Tn summary, after 1,000 moves, the probability that the Hank 1, Ted 2 game ends 
in a win for Hank is about 4, the probability that the Hank 1, Ted 2 game ends in a 
win for Ted is about 3, and the probability that the game is still going on is very, very 
small. consistent with the solution to the general Hank and Ted problem that 
we formalized in Exercise 21 of Section 6.5. 

These observations can be generalized as the following corollary to Theorem 1. 











Corollary 2 Given a transition matrix M with absorbing state j, the probability 
that a game beginning in state i will eventually be absorbed in state j is given by the 
row i, column j entry of the matrix 


Mz, = lim M" 


provided this matrix exists. 


For those that have not seen the limit notation used before, it merely means that 
if the matrix powers "settle down” on a single matrix as n grows without bound, then 
that single matrix is the limit of the powers of M. We will not address the mathematical 
formalities of this statement, but rather, we will rely on experimentation and intuition 
about the nature of the games to which we apply this idea. For example, by looking 


ee coo A 


488 Chapter 6 / Probability 


at the M^? above as well as 


1 0 0 0 
grown. | 0.666... 1039 Q — 0333 
^ = | 0333.. o — 1039" 0.666 

0 0 0 1 


it appears that the powers are settling on the matrix 


100 0 
2/3 0 0 1/3 
13 0 0 2/3 
0 00 1 





Hence, the entries in column 1 and column 4 give all the probabilities of a 
game ending up in state 1 (a win for Ted) and state 4 (a win for Ted), respect- 
ively. 

Let's put together all this information to observe the complete analysis of a 
game. 





"BE Example 2 suppose Hank and Ted play their game starting with 5 totat mark- 
ers. There are six possible states in the play of this game: 


9 State 1 is H =0, T =5. 
© State 2 is H =1,T 
© State 3 is H =2, T =3. 
9 State 4 is H =3, T =2 
9 State Sis H = 4 T= 1. 
© State 6 is H = 5T. 














Find the transition matrix for this game, and find and interpret Moo. 


SOLUTION For the transition matrix 


i 0 8. 1$ 0. b 
2 0 12 0 0 o 
alo 1270 12.0 o 
7|o 0 12 0 12 0 
0 0 0 12 0 12 
OMRON Os 102^ 1 








66 Excursion: Matrices and Markov Chains 489 


Starting state || Prob(Ted wins) | Prob(Hank wins) 








Hank 1, Ted 4 "m 15 

Hank 2, Ted 3 as 2s 

Hank 3, Ted 2 25 as 

Hank 4, Ted 1 15 as 

Table 6-6 Probabilities for the Hank and Ted Game with 
5 Markers 


we can compute the following: 


lesbe wd—dk ow 3 
08 10^ o 10 0 02 
06 o 10% (0 10% 04 


jo 
MUS og 19 o w o 06 
02 0 10% 0 10% 08 
06 € € o ei 
be tins Oe o 9 
08 10749 o 1049 o 02 
mon 0. 10 o 10-9 Q4 


04 109 o 1079 O 06 
02 o 10 o 10*9 08 
0 0 0 0 0 í 


From this it appears that the powers are settling on the matrix 


10000 0 
4/5 0 0 0 0 1/5 
3/5 0 0 0 0 2/5 
2/5 0 0 0 0 3/5 
1/5 0 0 0 0 4/5 
0 0000 1 


Ma 


From this information we can record in Table 6-6 the probability of either player 
winning from any given starting position, o 


Even though we have so far only addressed questions that have already been 
answered in Section 6.5, there is a benefit to the additional computational efforts 
required by the matrix model; The new method is much, much more versatile, Exer- 
cise 24 of Section 6.5 showed the difficulties with the Hank and Ted game when an 
unfair coin is used to play. In contrast, the matrix model requires a minimal amount 
of change to reflect this kind of game, 





Practice Problem 3 For the Hank and Ted game played with 4 markers using a 
coin that comes up heads 3 of the time, find the transition matrix M, and then find and. 
interpret Moc. 


ll Eample 3 ea tovisgarie tied at deuce, the gamë wilt end when one player 
is two points ahead of the other. Assuming that Player A has a 3 probability of winning 








490 Chapter 6 / Probability 


each point, use a matrix model to find the probability that Player A wins a game that 


starts tied at deuce. 


SOLUTION This game has five states: 


© State 1 is “Player A wins” 
9 State 2 is "Player A up one point.” 

© State 3 is “The game is tied." 

© State 4 is “Player A down one point.” 
© State 5 is “Player A loses” 





The transition matrix for this game is 


JESUM OU. 0. 0 

34 0 1/4 0 0 

M=| 0 3/4 0 1/6 0 
0 0 34 0 1/4 

C pte ups git 


so we can compute 


TOAN vant 
0975 107 0 10 0025 
M'?^a090 0 10? 0 0100 
0.675 10? 0 107 035 
a Me GO. i 





Lee cdd X 
0975 107? o — 103"? 0.025 
M'™ ~ 10,900 0 107^ © 0.100 
0.675 10 0 1077 0325 
QI NO. SU OI d 








From this it appears that the powers are settling on the matrix 


1 000 0 1 000 
0.975 0 0 0 0.025 39/40 0 0 0 
M«-|090 0 0 0 0.100}=| 9/10 0 0 0 
0.675 0 0 0 0305 27/40 0 0 0 

0 000 1 0 000 


0 
1/40 
1/10 

13/40 

1 


Hence, the probability that a game starting tied (in state 3) ends up as a win for 
Player A (in state 1) is qj, the entry in row 3, column 1 of this matrix. This is 


the same answer as in Exei 





ise 12 of Section 6.5. We can further tell from the. 


matrix model that even if Player A is down one point in this game, he still has 


a 2 probability of eventually winning the game. 


o 


Exercise 17 of Section 6.5 investigated a game having a "win-by-three" rule. 


We saw that this was considerably more complicated than the 





in-by-two rule of 


tennis or volleyball. Once again, the matrix model is versatile enough to handle this 


variation with only minor changes. 





66 Excursion; Matrices and Markov Chains 491 


Practice Problem 4 Assume that a game of table tennis starts tied, and the winner 
is the first player to win three points more than the other player. If Player ^ has a 3 
probability of winning each point, use amatrix model to find the probability that Player 
A eventually wins a game that starts off tied, (Hist: This game has seven states.) 





Expected Values with the Matrix Model 


Now we will see what else we can discover from our matrix model, Notice that we can 
sum the entries of row 3, column 2 of the matrices M, M^, M°, and M* to combine the 
information they contain about going from state 3 to state 2 in four moves or less. In 
fact, if we wish to have this information about all states of the game simultaneously, 
we should perform this summation for each position in the matrix. This is exactly 
how we will define matrix addition: 





Definition Given matrices A and B, we denote by A + B the new matrix 
whose row i, column j entry comes from adding A;,j to B; for each i and j. 
Note that this definition only makes sense if matrices A and B are the same 
size, both in number of rows and number of columns. 











wi Example 4 The Hank and Ted game played with a total of 3 markers has 
transition matrix 

1 4$ 0,0 

12 0 12 0 

0 12 0 1/2 

0 0 0 1 


M 


where statei corresponds to Hank havingi — 1 markers and Ted having 4 — i markers 
Discuss the meaning of the sum M + M? + M? + M* in terms of the game. 


SOLUTION Weealculated each of the matrix powers M^, M^, and M* earlier 
in this section, so 


4g. 0 Q9 
9/4 5/16 5/8 13/16 
13/16 5/8 5/16 9/4 
Q^ 0. 0 4 


MAM' eM +M (64) 





has 3 as its entry in row 3, column 2. We might at first glance take this to mean 
that the probability of going from state 3 to state 2 in four moves or less is È 
However, it should be troubling that some of the other entries in this matrix 
(like the 9/4 in row 2, column 1) are greater than 1. This should at least indicate 
to us that the entries are not probabilities. 

In order to understand what these entries do mean, we need some new 
notation, We will represesent an outcome in four moves as 





where we will use in the blanks either the state numbers or an if we don't care 
about the state. For example, the event of starting in state 3 and being in state 











492 Chapter 6 / Probability 


2 three moves later is denoted by 
35422 
The fact that the row 3, column 2 entry of M? is $ means that 


Prob «e 2) g 


On the other hand the event of going from state 3 to state 2 in one move, 
denoted by 


3225 





has probability } since the row 3, column 2 entry in M is $ 

‘The problem is there are some outcomes like 3 — 2 — 3 — 2 — 3thatare 
in both events (hence, these events are not disjoint), and so the sum § + 3 = § 
is not the probability of going from state 3 to state 2 in one or three moves. 

We should not give up on this sum too quickly, however—it does turn out 
to have some significance. Notice that in the above example the only outcomes 
that are counted twice are those that entail the game which starts at state 3 
being in state 2 twice in the four moves, Of course beyond four moves, it is true 
in general that 





The sum of the row 3, column 2 entries in M, M?, M^, M4, ... adds k times the 
probability that a game starting in state 3 is in state 2 exactly k times. 





‘This means that the sum of these entries is the same as 


(1) x Prob(game starting in state 3 is in state 2 one time) 
+(2) x Prob(game starting in state 3 is in state 2 two times) 
+(3) x Prob(game starting in state 3 is in state 2 three times) 
+(4) x Prob(game starting in state 3 is in state 2 four times) ++ 







This is exactly our formula for expected values! o 


It follows from this type of reasoning that in the sum M + M? + M? +++, the 
entry in row i, column j is the expected number of times that a game which starts 
at state į will be in state j. This is not an accurate statement when į = j because 
nowhere in the analysis have we accounted for the fact that a game starting in state 
i is in state i one time without moving at all. So we are careful with our wording in 
the following result, which we state without proof. 








Theorem 3 Given a transition matrix M for a game, in the matrix sum 
Q=M+M+M 4 Mb 4.04 Mn 
the entry in row i, column j is the expected number of times that a game which begins 


in state i will enter state j within the first n turns of the game. 


Once again, for this to be useful, we need n to be fairly large. In fact, the the- 
oretical result we are after involves understanding the sum in Theorem 3 as a limit 
process in which n — oc. There is a hidden difficulty with this strategy as it stands 














66 Excursion: Matrices and Markov Chains 493 


now, With the use of technology, it is easy to see that sums like 


10 0 0 0 
662 033 067 328 





MEM MI | 998 9.67 033- 662 (es) 
0 0 0 10 
or 
s» 0 0 o 
MAMA pMa 929.033 067. 1661 "m 


1611 0.67 033 3329 
0 0 0 50 


have some large entries, In fact, these examples indicate that as n gets large, so do 
some of the entries in the matrix sum, and so we will never see these sums "settling. 
down” on some fixed matrix, 

It we consider our Hank and Ted example, the large entries that we saw in the 
evaluation of M +M? + ++ + M? can be understood by recalling an observation 
we made about the individual powers of M. Each power M^ is a transition matrix in 
its own right, and so it has probabilities for entries and rows that sum to 1. Hence, 
when we form the sum M + M? + +--+ M'*, we will have rows that sum to 100, and 
When we sum 500 powers of M, we will have rows that sum to 500, so it is clear that 
some entries are going to get large without bound as we add more and more powers. 
This is a real problem for thinking of the powers as "going to infinity. 

However, one of the nice properties of our matrix model is that if we simply 
ignore the rows and columns of the transition matrix corresponding to the absorbing. 
states, then the remaining matrix still models the other states perfectly well. For 
example, if we let N be the result of ignoring states 1 and 4 in the matrix M for the 
Hank and Ted game with 3 markers from the start of this section: 


0 1/72 
x [we 0 














then computing. 


067 033 6n» 





a s 33 0, 
Qanama anan [28 287] 
distills an accurate picture of states 2 and 3 from the mess that was shaping up in 
(6.6). 

We can now finally see how to use this to analyze the expected length of the 
original Hank and Ted game with 3 markers that has driven our discussion so far. 


Bl race S dic hesitle reddis dboseio aal yea a spa np fa 
Hank and Ted game played with 3 total markers, 


SOLUTION This is a fairly simple game since it only has two possible start- 








ing points, H = 1, T — 20r H = 2, T = 1, These correspond to states 2 and 3, 
respectively, We remarked before that it seems like 
1000 
ve las o o m| ences 
Mya 0 0a | ttes 
0001 





494 Chapter 6 / Probability 


From this we were able to give the probabilities of reaching either ending state 
from any beginning state, 
In the matrix sum (6.7), we saw 


Q=N+N?+ vos [08 es] 


067 033 
Since the sum of the entries in first row above seems to be 1, we can state that 


The expected number of times that a game which begins in state 2 (which 
corresponds to the first row in N) enters a nonabsorbing state is 1. 


‘Summing the entries in the second row of N gives us a similar statement about 
games that begin in state 3. 

Since every game must consist of some number of moves among nonab- 
sorbing states followed by a single move into an absorbing state, the expected 
number of moves in a game that starts in state 2 is 1 + 2 moves, The same 
number of moves is expected of a game that starts in state 3, the game starting 
with H =2,T=1. 











‘These results can be summarized in the following corollary to Theorem 3. 


Corollary 4 Suppose N is the transition matrix for the nonabsorbing states (also 
called transient states) of a game, and the infinite matrix sum 
Q=M+M4+M +... 


exists. Then the expected length of the game that starts in the state corresponding to 
row i is 1 more than the sum of the entries in row i of Q. 


Practice Problem 5 Ifthe Hank and Ted game is played with 4 total markers and 
a coin that comes up heads 3 of the time, how many moves on average will the game 
last if each player starts with 2 markers? 





In Section 6.5, we also addressed problems where a recursive process was in- 
volved but where there were no winning probabilities to worry about. In these prob- 
Jems, the only issue is the expected duration of the process. In terms of matrices, these 
problems are typically modeled with a transition matrix having only one absorbing 
state reflecting the single way that the process can end. 








"Bl Gample 6. te marier analyze the game where a coin is tossed until it 
first comes up heads. In particular, use the matrix model to explain why the average 
‘number of tosses for this to happen is 2. 


SOLUTION  Ifwethink of state 1 as heads and state 2 as tails, then this game 
is over. The 





must continue in state 2 until it first goes to state 1, at which time. 
following 2 x 2 matrix describes the transition probabilities. 


Ld ka in 











66 Excursion: Matrices and Markov Chains 495 


Since state 1 is the absorbing state, we work instead with N = [1/2], and esti- 
mate the sum 


Q=N+N +N + 


by calculating the first 100 terms N -- N? +N? +-+- - NY = [1-1/2] = 
[i]. From this estimate we see that the game which begins in state 2 will en- 
ter state 2 on average one time. The game also requires one flip to enter the 
absorbing state 1, so the expected number of tosses in this game is 1 +1 =2, 









o 


Our final example shows how matrix models can be used to analyze children’s 
board games. These games usually consist of a small number of states, fixed proba- 
ies for moving between states, and no strategy or decisions in playing the games, 








"lll Example 7 Consider the following scaled-down version ofa children’s game 
Each of two children starts with a cherry tree with four cherries, and they take turns 
spinning a spinner with four equally likely outcomes. Three of these outcomes instruct 
the child to remove one, two, or three cherries, respectively, from her tree, and the 
fourth position tells the child to put all his cherries back on the tree, What is the 
expected number of spins for a child to get all his cherries off of the tree? 


SOLUTION The following 5 x 5 matrix represents the transitions for the 
game: 
Ocemis[ 1 0 0 0 0 
1cherries 3/4 0 0 0 1/4 
M-2chemies | 2/4 1/4 0 0 1/4 
3chemies | 1/4 1/4 1/4 0 1/4 
4cherties | 0 1/4 1/4 1/4 1/4 
From this we can compute the following matrix Q as 
Q-N«-N'N + 
where N is the submatrix of M corresponding to the transient states of the 
game: 
Smtelf[o 0 0 14 
_State2} 1/6 0 0 1/4 
= State3 | 1/4 1/4 0 1/4 
State4 | 1/4 1/4 1/4 1/4 
If we estimate Q using the first 500 terms of this sum, we get 


019 0.15 012 049 
049 0.19 015 0.61 

0.61 049 0.19 076 

076 0.61 049 095 

From this matrix Q, we can extract the information that, for example, when 
starting in state 4, the game will land an average of 0.76 moves in state 1, 0.61 
moves in state 2, 0.49 moves in state 3, and 0.95 moves in state 4. Therefore, 
the game lands in some transient state on average 


0.76 + 0.61 + 0.49 + 0.95 = 2.8 moves 


N 


QNEN EN eo NY n 





496 Chapter 6 / Probability 


which means the game lasts on average a total of approximately 2.8 +1 =3.8 
moves, a 





Summary of Markov Chain Techniques 


Suppose we are given a process with a finite number of states (cach is either 
transient or absorbing) anda transition matrix M that models the probabilities 
of changing states, Then 





© The entry in row i, column j of M" is the probability that a process which 
begins in state i will end in state j exactly n transitions later, 


© The entry in row i, column j of the matrix limit 
Mx = lim M" 


(if the limit exists) is the probability that a process which begins instate =| 
eventually settles into state j. | 
© The entry in row i, column j of 
QENAN EN +e | 
(where N is formed by deleting from M the rows or columns of the absorb- 
ing states) is the expected number of times that a process which begins in 
state / will enter state /. 
© Adding 1 to the sum of the entries in row į of Q (from above) gives the. 


average number of transitions needed for a process that begins in state i 
to be absorbed, 


Obtaining Exact Answers 


In this short section, we will try to get a feel for the mathematics involved in obtain- 
ing exact answers to some of these problems. This is important since we can be in 
situations where it is not obvious that a process is almost certainly in an absorbing 
state after 100 or 1,000 transitions the way it is with our small dice and coin games. 
The proofs of some of these facts are not too hard—they are within the scope of a 
typical first linear algebra course in college—but they would take us too far afield to 
pursue in this book, 

‘We have seen that the advantage to only paying attention to the nonwinning 
states (cransient states) is that in a game guaranteed to end, the entries in N" all tend 
to Oas n gets large without bound, and this fact turns out to be enough to guarantee 
that the entries in the infi sum 


NN EN See 











do not grow without bound. 

Once we are confident that this infinite sum exists, it turns out to be fairly 
casy to find even without doing huge technology-intensive calculations. Instead of 
computing huge powers of matrices, we simply apply the following polynomial fact 
from Exercise 14 in Section 2.3: 


( xat ea ees) 

















66 Excursion: Matrices and Markov Chains 497 


In terms of matrices, this means that 
(—-NYN4-N +N? +.) =N 


where 7 is the identity matrix’ having 0's everywhere except down the main diagonal 
where it has T's 


"lll Example 8 Show how teabove fact can be applied to the Hank and Ted game 
played with a total of 3 markers. 


SOLUTION Wehavealready established that this problem uses the following. 
transition matrice 


1 à 0.0 
_|12 0 12 0 epo oaa 
M*['5 19 0 al sn Nel ite r] 
0 0 0 1 
which means that 
fa. =r 
i82 Lis 1 | 


Our next goal is to find a matrix that we can multiply by / — N to get / as the 
result, Without a little background we cannot give a simple method for doing 
this here, so we will just leave it to a calculator or computer algebra system at 
this point, The inverse of — N is 


4/3 2/3 
2/3 4/3 


a result that can be much more easily checked than found, Observe that 
43 2/3),[ 1 -12]_[1 0 
23 4n|'l-a 1 01 


‘This means that we have a much easier way to compute the infinite sum: 









A PO lena [809 29].1:0. nya 
Nora Ee Les HN rg 35] [v a 
[ua 2/3 
[xs 18 
Note that this is the same result we inferred from looking at the first 500 
terms of the sum in Example 5, Now we know for sure that this is correct. a 


uw Example 9 (Example 6 revisited.) Use the exact matrix model to explain why 
the average number of tosses for the first heads to come up is 2. 


SOLUTION Recall that state 1 is heads and state 2 is tails, and the following 
is the transition matrix for the problem: 


1 0 
Me [is i2] 


* The 2 x 2 identity matrix was used in Practice Problem 1 








498 Chapter 6 / Probabiity 


Since state 1 is the absorbing state, we work instead with N = [1/2], and com- 
pute the sum 


NAN? 4N? +- 


by calculating instead (J — N)~!- N = [1/2]! - [1/2] = [1]. That is, the game 
that begins in state 2 will enter state 2 on average one time. The game also 
requires one flip to enter the absorbing state 1, so the expected number of 
tosses in this game is 1 + 1 = 2 as we saw before. n 


un Example 10 (Example 7 revisited.) Use the exact matrix methods to find the 
expected number of spins for a child to get all her cherries off of the tree in the children's 
game in Example 7 


SOLUTION Recall that the following 5 x 5 matrix represents the transitions 
for the game: 


R0 0. 00 
34 0 0 0 1/4 
M=|2/4 1/4 0 0 1/4 
1/4 1/4 1/4 0 1/4 
O 1/4 1/4 1/ 1/ 


From this we can compute the matrix 
Q-N-N EN + 

where N is the submatrix of M corresponding to the transient states of the 

game: 

LJ 0 0 1/4 

1⁄4 0 0 14 


1/4 1/4 0 1/4 
1/4 1/4 1/4 1/4 


We saw that to find Q we should compute 


019 015 0.12 049 
049 019 015 0.61 
0.61 049 0.19 0.76 
076 061 049 095 


N 


Q=(I-Ny'N= 


This agrees with our computation of Q in Example 7, so all the conclusions 


there are still valid here. 
a 


In this section, there have been several unproven theorems and some perhaps 
mysterious statements, but we hope that seeing these tools used will serve to motivate 
you to learn more about linear algebra. We have perhaps raised as many questions 
as we have answered. Does the matrix (1 — N)“ always exist? Is there a quick way 
to find M" for large values of n? These questions are central to the foundation of 
the subject of Markov chains They are some great questions for your linear algebra 
instructor when you take that course later in your academic career. 











Solutions to Practice Problems 


-1 0 
1 w) A-B=|-12 -1/4 
ys 1/3 


(b) B-A is undefined. 
(c) C Bis undefined. 

1 -2 
-13 0 

o a2 
o o 
a[i -4 

C= [ aed ] 

Crs 

wre 

[T 

T is called an identity matrix because it does not change 

any (compatible) matrix multiplied by it. 

2 (a) According to Theorem 1, this is the row 2, column 4 
entry in M*, which is $. 

(b) Since Hank 2, Ted 1 is state 3 and Hank 1, Ted 2 is 
state 2, this question is answered by tbe row 3, col- 
umn 2 entry in M°, which is 0. This makes sense if 
you think about it. 

3. Forthe Hank and Ted game played with 4 markers, there 
are five possible states: 

© State Lis H = 0,T=4. 

9 State 2is H = 1,7 =3. 

© State 3is H = 2,7 =2. 

© State dis =3,T = 1. 

© State Sis H = 4,T =0. 

‘The unfair coin makes the transition matrix M somewhat 

asymmetric: 


d) B-C = 








E E WE ET) 
23s 0 35 0 0 
M=| 0 2/5 0 3/5 0 
0 0 2/5 0 35 
0 up. 0. d 
We can compute. 
1 0 o 
0.585. 10-^ 0415 
M™ = | 0308 0 0.692 | and 
0123 107^ 087 
0 oot 





r pore é 
0585 1079 0 10% 0415 
10% O 0692 
o 1079 0387 
TO 


M y 





66 Excursion: Matrices and Markov Chains 499 


From this it appears that the powers are settling on the 
matrix 


1 000 0 
OS85 0 0 0 0415 
M, =| 0308 0 0 0 0692 
0123 0 0 0 087 

0 000 1 


Since state 5 is the winning state for Hank, this means 

that 

© Inthe game that starts Hank 1, Ted 3, Hank has prob: 
ability 5:0415 of winning, 

© Inthe game that starts Hank 2, Ted 2, Hank has prob- 
ability ~0.692 of winning. 

© Inthe game that starts Hank 3, Ted 1, Hank has prob- 
ability =0.877 of winning. 

Fora "win-by-three" game, we have seven states: 

© State 1 is “Player A wins” 

© State 2 is “Player A up two points 

© State 3is “Player A up one point.” 

© State 4is “Game tied.” 

© State Sis “Player A down one point.” 

© State 6is “Player A down two points.” 

© State 7is "Player A loses.” 


The transition matrix M for this game is 





1 0 0 0 0 0 
23 0 13 0 0 0 


o 
0273 0 13 0 0 0 

M=|0 0 23 0 13 0 0 
0 0 0 23 0 13 0 
9 0 0 0 23 0 13 
9 0 0 0 0 0 4 


With some experimentation, it appears that the powers. 
of M are settling on the matrix. 


1 00000 0 
0984 0 0 0 0 0 0016 
0952 0 0 0 0 0 0048 
Mx =| 0889 0 0 0 0 0 om 
0762 0 0 0 0 0 0238 
0508 0 0 0 0 0 0492 

0 00000 1 


Hence, game that starts tied (i.e. in state 4) has a prob- 
ability of approximately 0.889 of being a win for Player 
A. 

This problem uses the same transition matrix M as Prac- 
tice Problem 3, and the matrix N of transient states for 





500 chapter 6 / Probability 


this game looks like this: 
0 3/5 0 
N=|2/5 0 3/5 
0 25 0 
Summing powers of N allows us to form 


0462 1154 0.65 
Q-NXN e NI | 0.769 0923 1.154 
0308 0769 0.462 





Exercises for Section 6.6 


1. Hank and Ted start a game with Hank having 4 mark- 
ers to Ted's2 markers, but they roll a die for each move 
and Ted wins a marker from Hank with probability 
So Hank starts with twice as many markers, but Ted has 
twice the probability of winning each time. Give a tran- 
sition matrix for this game, being clear about what the. 
states are, 

2. John and Jessica start a game with 4 markers each. On 
each move, John rolls a six-sided die and Jessica rolls 
an eight-sided die with the rule that whoever rolls the 
higher number wins a marker and no markers change 
hands when they roll the same value, Give a transition 
matrix for this game, being clear about what the states 

3. Pete and André decide to play a game of table tennis 
where they start tied, and play until one player has won 
four points more than the other. Assuming that André 
has a 2 probability of winning cach point, give a tran- 
sition matrix for this game, being clear about what the 
states are, 

4. A certain mathematical board game consists of 16 
squares, each one labeled with a subset of (1,2, 3, 4). 
On each move, a player rolls a four-sided die labeled 
1234—if they are on square S and they roll the value 
K, they move to square S U {k). Note that this might con- 
stitute a move from a square to itself. Give a transition 
matrix for this game, being clear about what the states 
are. 

5. Here is a miniature version of a children’s board game 
for your analyzing pleasure. The game board is shown 
in Figure 6-6, and a four-sided die (with sides labeled 1, 
2,3, and 4) is used to generate the moves. A game piece 
starts at square A and moves left-to-right across cach 
row, working its way up to square Fin the top row. Give 
a transition matrix for this game, being clear about what 
the states are. 

6. A game is played on the board in Figure 6-7 using a six- 
sided die to generate the moves, A game piece starts 
‘on square A and moves left-to-right, finishing the game 
when it makes it to square B. Give a transition matrix 





‘The middle row of this result tells us about the game 
beginning in state 3 (with Hank and Ted each with 2 
markers). Since the sum of the entries in the middle row 
is approximately 0,769 + 0,923 + 1.154 = 2.846, the av- 
erage length of a game that starts in state 3 is approxi- 
mately 2.846 + 1 = 3.846 moves. 








for this game, being clear about what the states are. To 
what problem that was addressed earlier in this chapter 
does this game correspond? 

Play the game in Exercise 1 ten times to get a feeling for 
the probability each player wins and the average length 
of this game, 

8, Play the game in Exercise 2 ten times to get a feeling for 
the probability each player wins and the average length 
of this game, 

9. Play the game in Exercise 3 ten times to get a feeling for 
the probability each player wins and the average length 
of this game. 

10. Play the game in Exercise 4 ten times to get a feeling 
for the average length of this game. 

11. Play the game in Exercise 5 ten times to get a feeling. 
for the average length of this game. 








Here is a quick way to compute a fairly large 
power of a matrix: Find M by multiplying M » 
M, then find M* by multiplying M? x MP. In this. 
‘same way, we can find M* by multiplying M* x M* 
and then M^ by multiplying M* > Mf, Apply this 
strategy to answer Exercises 12 to 14, 











E | GotoD F 
c D GotoA 
A Goto B 

















Figure 6-6 Game board for Exercise 5. 














(66 Excursion: Matrices and Markov Chains 501 





A |GotoA | Gotoa 














Gowa 


GotoA | GotoA | B 














Figure 6-7 


12, In the game described in Exercise 1, what is the proba- 
bility that Hank wins within 16 moves? 

$. In the game described in Exercise 2, what the probabil- 

ity that John wins within 16 moves? 

In the game described in Exercise 3, what is the proba- 

bility that André wins within 16 moves? 

Give the transition matrix for a best-of-five series be- 

tween evenly matched opponents, and use this matrix to 

find the average length of such a series. (Nore: Such a 

series must be over after five games, so you do not need 

to calculate large matrix powers.) 

16. Give the transition matrix for a best-of-five seris in 
which team A has a è probability of winning each game, 
and use this matrix to find the average length of such a 
series. (This revisits Exercise 16 from Section 64.) 

17. Give the transition matrix for a best-of-five series in 

J^ which team A hasa 1 probability of winning each game, 
and use this matrix lo find the average length of such a 
series. (This revisits Exercise 17 from Section 64.) 

18. Suppose that team A has a ] probability of winning 

ff cach game in a best-of-five series against team B. Use 
the transition matrix from your solution to Exercise 16 
to find the probability that team A wins the series. (This 
revisits Exercise 2(b) from Section 63.) 











n. wav ME prove by induction that for 
alln> 1, 
for" o 
at [ 0 Gy] 
20. Let M. [ie and prove by induction that for 
alln> 1, 
1 o 
M icorreunel 
21. Let M= [ 1 and prove by induction hat tor 
alaz1, 
afam o 
m a] 


Game board for Exercise 6. 





and 
aa f 0 xus 
me "ung 0 ] 
012 i 
22. Let M = [ |3 42] and prove by induction tnat toran 
nzi, 


MAM EMI ee M” = 
i = 1/9") 

ia-a/ 

Explain how this proves that the infinite sum 


1⁄3 3] 


40-09) 
40-0747) 


Maawa [Y 28 





i. Prove Theorem 1 by induction on k 2 1. 

"The remaining problems require the use of technol- 
‘ogy to perform the appropriate matrix operations, 

24, What is the expected length of the game described in 

Exercise 1? 

What is the expected length of the game described in 

Exercise 2? 

What is the expected length of the game described in 

Exercise 3? 

. What is the expected length of the game described in 

Exercise 4? 

What is the expected length of the game described in 

Exercise 5? 

What is the expected length of the game described in 

Exercise 6? 

Give the transition matrix for a tennis game between 

equally matched opponents, and use this matrix to find 

the average length of such a series 

31. If the server wins a tennis point with probability $, what 
is the probability that the server wins a tennis game that 
starts at 0-07 (This revists Exercise 15 from Section 6.5.) 

32. If the server wins a tennis point with probability 1, what 
is the average length ofa tennis game that starts at 0+ 

(This revisits Exercise 16 from Section 65.) 


SWE WR AE WR WE 














502 Chapter 6 / Probability 








Chapter 6 Summary 
64 Introduction Formulas. 
© Sum rule: 
Terms and concepts. 
-For disjoint events: Prob(E; or E;)  Prob(Ei) + 
© You should understand how the terms experiment, out- Probes) 
come, equally likely, and sample space are used in the : ^d 
study of probability. hon y L9 EA or Ei) = Prob(E;) + 
© You should know what we meanbyaneventandbysu- By uct rule: " » 
cessful outcomes of an experiment. For. k a ndent events: — Prob(E; and Ej) 
@ You should recognize the notation Prob(£) used to de- ndopa ouam n a= 
note “the probability of event E." Prob(Ei) - Prob(Ex) 
-General form: — Prob(E, and Ez) = Prob(Ex) + 


Formulas 


© Prob(E) = “e} (provided the outcomes in S are equally 
likely). 
© Prob(E) + Prob(E) = 





Sample spaces and probability calculations 

© You should be able to give sample spaces for a variety 
of situations and to determine whether the given sam- 
ple space has equally likely outcomes. In particular, you 
should be able to give an equally likely sample space for 
rolling a pair of dice. 

© Using the counting techniques of the preceding chapter, 
you should be able to calculate probabilites using the 
formula Prob(E) = 23]. 

© You should realize that sometimes it is easier to cal- 
culate the probability that an event does not occur. 
In this case, you should be able to use the formula 
Prob(E) + Prob(E) = 1 to calculate Prob(E). 


6.2 Sum and Product Rules 
for Probability 

Terms and concepts. 

© You should know what it means for two events to be 
disjoint (or mutually exclusive). 

© You should know what it means for two events to be 
independent, and you should realize that this is mot the 
same as disjoint, 

© You should understand the idea of conditional probabil- 
ity, along with its notation: Prob( E; |E:), the probability 
of E, given. 

' You should be able to distinguish a trial (or Bernoulli 
trial) from the larger experiment it is a part of. 








Prob(E;|E;) = Prob(E,) - Prob(EsiEi) 

© The sum rule generalizes for three or more disjoint 
‘events. Applying the general sum rule to events that 
arc not disjoint requires a careful use of the Inclusi 
Exclusion principle, and is not generally recommended 
for beginners. 

© The product rule generalizes for three or more indepen- 
dent events. With some care, the generalized product rule 
can be applied to three or more events even if they are 
not independent, 





Calculating probabilities 

© You should be able to combine the use of the product 
and sum rules (and their generalizations), along with the 
methods of the preceding section, to calculate probabil- 
ities for a wide variety of applications. 

© You should understand when it is permissible to use the 
simpler versions of the product and sum rules 


6.3 Probability in Games of Chance 


Terms, concepts, and skills. 


© You should be able to distinguish a trial (or Bernoulli 
trial) from the larger experiment it is a part of. 

® You should be able to solve “Bernoulli trials” prob- 
lems of the form: “If the probability of success for 
each trial is p, what is the probability of obtai 
actly k successes in n trials?” The formula is given as 
Cín. k)- p* - (1 — py, but knowing the method that 
leads to that formula is preferable to memorizing the 
formula. 

© You should be able to apply this method to analyze prob- 
abilities for "best of" series 














64 Expected Value in Games 
of Chance 


Terms, concepts, and formulas 

© You should understand that a random variable is simply 
‘a measurement of some numerical value associated with 
each possible outcome of a probability experiment, 

© Fora random variable X, you should know the meaning 
of the expected value of X (sometimes called the average 
value of X). 

© You should recognize the notation EX] for the ex- 
pected value, and be able to apply the formula £[X] = 
Gn) » Prob(X = x1) + (3) - Prob(X = x3) + +++ + Oe) 
Prob(X = x,). 


Calculating expected values 
© You should be able to calculate expected values for a 
variety of situations. For example, the random vari- 
able X might represent: 
~The winnings for a game of chance. 
~The value showing on a die, spinner, etc. 
- The number of siblings a person has. 
~The number of times per year a person sees the doctor. 
~The number of hits a player gets in a baseball game. 
- The number of games in a "best of seven” series. 
= and so on 
‘The essential ingredients are that the variable can take 
on a finite number of numerical values and that we can 
calculate the associated probability for cach of those val- 
ues 
© You should know the formula n - p for the average num- 
ber of successes in n Bernoulli trials with probability p. 
but you should also be able to perform the necessary 
calculations to verify that formula for specific situations. 








6.5 Recursion Revisited 
Cateulating expected values 
® You should be able to extend the notion of expected 
value to random variables with an infinite number of 
possible outcomes, by using recursive reasoning. For ex- 
ample: 
= If you roll a single die repeatedly, how long on average 
does it take to roll a6? 
— If you start with $50 and bet $1 repeatedly on a game 
where you have probability 1/2 of winning, how long 
‘can you expect to be able to play before going broke? 


{66 Excursion: Matrices and Markov Chains 503 


© Similarly, you should be able to use recursive reasoning 
toanswer other probability questions, such as, "Ifa game 
of tennisistied at deuce and A has probability 2/3 of win- 
ning each point, what is the probability that A will win 
the game?” 

© Specifically, you should be able to apply these methods 
to the Hank and Ted game and other similar games, 








6.6 Excursion: Matrices and Markov 
Chait 


‘Terms and concepts 





© You should understand the use of the word state to de- 
scribe situations that can occur in a game, and the idea of 
a transition matrix representing probabilities of moving 
from state to state in the game. 

© You should be able to distinguish absorbing states from 
transient states. 





the standard usc of algebraic op- 
: for example, A + B, A B, 





® You should recognize the use of the symbol / to denote 
the identity matrix, and the use of the notation A“! to 
indicate the inverse of the matrix A. 

© You should recognize Mx, informally, as the matrix the 
large powers of M are “settling on.” Formally, Mx 
lims. M" provided the limit exists. 

© The section consistently uses M for the transition ma- 
trix, N for the same matrix with the rows and columns 
for absorbing states removed, and Q for the matrix 
NEN EN + 





Matrix operations 


© For given matrices A and B, you should know how to 
calculate A + B and A - B. and you should know when 
these operations are possible. 

© You should know the special properties possessed by the 
identity matrix 1 

© You may have learned how to use computer software to 
calculate large powers of a matrix; or perhaps your in- 
structor will supply the answers to those calculations for 
you. 

© A similar comment applies to the calculation of matrix 


Modeling games with matrices 


® You should know how to build the transition matrix M 
for a game with a finite number of states 


504. Chapter 6 / Probability 


© You should be able to compute probabilities for moving 
fromstate tostatein2 moves,in3 moves, and soon, using. 





© Youshould beable tointerpretthe meaning o large pow- 
ers of the transition matrix, for example M'™®, correctly 
using the notation M.. as part of your interpretation. 








© You should be able to calculate expected values by in- 
terpreting the matrix Q = N + N? +N? +... (where N 
is the transition matrix with the rows and columns for 
absorbing states removed). 

© You should be able to calculate Q exactly as (I — N)“ N, 
with an assist from computer software or your instructor 
for calculating (I — N)*. 








As AA 
AB ABA 
ABB 

G 


SE Aa Ae Baa 
B BAB 

ee 
Figure 7-1 Game tree 
for beste roo tede. 





Figure 7-2 Arrow 
diagrams for functions 
and relations, 








Graphs and Trees 


The story of graph theory is a fascinating testimony to the study of recreational 
mathematics. Many early results in the area apply to nothing more than puz- 
zles on maps or chessboards, and yet today graph theory comprises some 
of the fastest growing branches of applied mathematics. Graph theoretical 
concepts are used to design computer circuits, create production schedules, 
‘optimize communication networks, and countless other modern tasks. In the 
realm of abstract mathematics, graphs are often used to add visualization or 
simplification to difficult concepts. In fact, we have already seen examples of 
‘graphs and trees in earlier sections of this book: 


© In determining all possible outcomes of a “best-of-three” match between 
Players A and B, we visualized all possible matches using a "game tree" as 
in Figure 7-1. 

$ We have also used “arrow diagrams" to help us visualize properties of func- 
tions and relations as in Figure 7-2. 


All these diagrams are examples of graphs. The first is a special type 
of graph called a tree. The second and third are called directed graphs to 
‘emphasize the role of the arrows in the diagrams. In this chapter, not only will 
we see what these various diagrams have in common, but we will also study 
a number of additional applications of graphs and trees. 


505 








506 Chapter 7 / Graphs and Trees 


71 Graph Theory 





Origins and Euler 


The origin of graph theory is usually traced to a paper written by the great Swiss 
mathematician Leonhard Euler (1707-1783). In his paper Euler writes, 





The branch of geometry that deals with magnitudes has been zealously studied 
throughout the past, but there is another branch that has been almost unknown 
up until now; Leibniz spoke of it first, calling it the "geometry of position." 
(geometría situs). This branch of geometry deals with the relations dependent 
on position alone; it does not take magnitudes into consideration, nor does it 
involve calculation of quantities. 





c In this respect, graph theory represents one of the great turning points in the. 

history of mathematics, involving none of the "classical" mathematics of the time. 

The inspiration for Euler's paper was a popular problem of the day—namely to 

B determine if in the city of Königsberg (whose crude map is shown in Figure 7-3), it 

Figure 7-3 ‘The seven Was possible to take a walk through the city using each of the seven bridges exactly 

bridges of Königsberg — Once. Euler's simple solution to the question introduced the idea of modeling a 

problem with what we now call a graph. His insight was the observation that the 

€ o most important factors in the problem were the four regions of the city (labeled A, 

B,C, and D in the figure) and how they were connected by the seven bridges between 

those regions. Figure 7-4 shows a graph that emphasizes just these important factors. 

In this graph, the vertices (ie, the points in the picture) represent the regions of 

the city. The edges (ie, the line segments or arcs connecting the points) represent the 

bridges. Observe that there are two edges between vertices A and B, representing 

x Cm n the two bridges joining regions A and B. Because there is no bridge directly joining 
regions B and C, the graph has no edge between vertices B and C. 








Figure 7-4 t i 
Avere Let us imagine Euler taking a walk about the city, using the integers 1 to 7 
Königsberg as labels for the seven bridges as shown in Figure 7-5. We can completely describe 


Euler's walk by listing, in turn, the regions he visits and the bridges he uses to get 

from region to region. (This is the style Euler uses to describe walks in his original 

paper.) For example, starting in region D, he might cross bridge 1 to region C, then 

Use bridge 3 to go to region A, and so on. We can represent this information very 
efficiently as D. 1,C,3,A, .... Here are some other possible walks: 


Walk #1: A,6,B,5,D 
ML Ds 


Walk #2: A,2,C,3,A 

sophie nue Walk #3: D, 5, B,7,A,3,C, 1, D 
Walk #4: D, 1,C,2,A,3,C,2,A.4,D, 5, B,6,A,7,B,5,D 
Walk #8: A,2,C,1,D,4,A,6,B,7,A,3,C 





1 


The walk sought by the citizens of Königsberg has a special property—it must. 
use each edge exactly once. Such a walk has come to be called an Eulerian trail. If 
it also begins and ends at the same vertex (ic, in the same region of the city) it is 
called an Eulerian circuit, Some of these sample walks (#2, #3, and #4) begin and end 


pe a ooo eee SS y yy 








8 b c d e 


Figure 7-6 Graph for 
Practice Problem 1. 


71 Graph Theory 507 


at the same vertex, but none of the walks are Eulerian trails. Walks #1, #2, #3, and #5 
do not use all the edges, and walk #4 uses edges 2 and 5 twice each, 

So how did Euler prove that there is no Eulerian trail through the city of 
Königsberg? He simply considered the number of times such a walk would have 
to pass through each region of the city. For example, walk #5 above is a trail that 
begins at vertex A and ends at vertex C, so vertices B and D only occur within the 
walk. Hence, for each edge used to enter vertices B or C, there is a different edge 
used to leave it—that is, the edges used to traverse these "internal" vertices occur 
in pairs. Hence, we know that walk #5 uses an even number of edges connected to 
vertex B and an even number of edges connected to vertex D. Since in the entire 
graph of the city, there are an odd number of edges connected to vertex B and an odd 
number connected to vertex D, the direct conclusion of this analysis is that vertices B 
and D cannot be "interior" points in any walk that uses every edge in the Königsberg 
graph exactly once. More significantly, this argument can be generalized to give us 
the following, 








Proposition 1 In any graph, if there are an odd number of edges connected to 
a vertex x, then x cannot be an interior vertex (i, a vertex other than the starting or 
stopping point) in an Eulerian trail. 


We will not prove this proposition formally here for lack of formal definitions 
of many of the terms, but the preceding paragraph outlines the logic involved. This 
proposition completely resolves the Kónigsberg bridge problem since all four vertices 
in the Kónigsberg graph are connected to an odd number of edges, making all four 
of them incapable of being in the interior of an Eulerian trail. The only possible 
conclusion is that the Königsberg graph does not have an Eulerian tr 





Practice Problem 1 Explain why the graph shown in Figure 7-6 does not have 
an Eulerian trail 





There are two comments we can make about Euler's solution to this problem, 
First, viewing the problem abstractly (i.e., with only the crucial factors present) was 
the key to solving the concrete problem about the city. Second, Euler did not simply 
solve this one problem, He essentially answered this type of question for any city 
whose map is given. Of course, to this point we have only seen a condition that allows 
us to say that a graph has no Eulerian trail. In his original paper, Euler also outlined 
a way to give a positive answer even for complicated cities, but we should develop 
proper definitions and proof tools before telling the rest of the story. 


Terminology and Notation 


To avoid getting ahead of ourselves, we need to develop some terminology, We 
begin with some terms that are suggested by the bridge problem, and then we will 
look at some ideas that are motivated by other applications. As you will see, most 
terms used in graph theory are chosen to elicit images of the particular property 
they name. Unfortunately, the terminology is not entirely standardized. If you read 
another book on graph theory, you may find different names for the same concept, 
or even occasionally the same name used for a different concept. The lesson here is 
to always pay attention to the defi 











508 Chapter 7 / Graphs and Trees 


Figure 7-7 
Graph for 
Example 1. 








Definition 


1. A graph G consists of two sets V and E. The elements of V are called vertices 
(or nodes), and the elements of E are called edges. Each edge is associated 
with one or two vertices, called its endpoints. In the diagram we draw the 
edge as a line segment or curved arc joining the endpoints. 

2. If an edge has only one endpoint, then the edge joins the vertex to itself. 
This is called a loop. 

3. If two edges have the same endpoints, they are called multiple edges or 
parallel edges. 

4. Two nodes that are joined by an edge are said to be adjacent nodes. 

A walk ina graph isa sequence verve? ... Unenvnei With n > Oofalternating 

vertices and edges, which begins and ends with a vertex and where each edge 

in the list lies between its endpoints. If the beginning vertex is the same as 
the ending vertex, we say the walk is closed, The length of a walk is the 
number of edges in the walk. A walk of length 0 is called a trivial walk, 

6. A railisa walk with no repeated edges, and a path isa walk with no repeated 
vertices. A circuit is a closed trail, and a trivial circuit is a circuit with one 
vertex and no edges. A trail or circuit is called Eulerian if it uses every edge 
in the graph. 

7. A cycleis a nontrivial circuit in which the only repeated node is the firstlast. 
one, 











BE Example 1 Lers ry out this terminology on the graph in Figure 7-7. 


The graph has three nodes and five edges. 

Edge 1 is a loop, since it has only one endpoint—that is, A is joined to itself by edge 

1. Edges 3 and 4 are parallel. 

Here are some walks in the graph: 

(a) B, 5, C, 5, B,2, A,2, B. This closed walk repeats edge 5 so itis not a trail 

(b) A,2, B, 5,C. This walk is a path since it does not repeat any vertices. 

(c) B. This is considered a trivial walk and a trivial circuit 

(d) A,1,A,2, B.5,C,3, A. This walk is a circuit since itis a closed trail that starts 
and ends at vertex A. 

(e) A,2, B,5,C, 3, A,1, A, 4, C. This walk is an Eulerian trail since it uses every 
edge exactly once. The first part of this walk A,2, B, 5. C, 3, A (obtained by 
taking everything from the walk between two occurrences of vertex A) is a 
cycle. 


Practice Problem 2 By adding one more edge to the graph in Figure 7-7, create 
4 graph with an Eulerian circuit. 











71 Graph Theory 509. 


Graphs in Applications 


As we have seen, graphs are a fairly natural representation of real-world problems, 
and as such they are extremely versatile, perhaps to a fault. There are so many 
variations on these structures that it makes organizing them for study somewhat 
difficult. Every introductory textbook must make some concessions in the variety of 
graphs to study in order to build a coherent picture for students to learn the basic 
tools of the trade. We pause here briefly to illustrate some of this variety before 
making these inevitable concessions. 

In any application of graph theory, there are many decisions that will have to 
be made at the outset. For example, a graph might be used to show a small airline’s 
flights among a set of four cities. In Figure 7-8 we show two different graphs that 
describe these flights. 

Tn the first graph, the existence of an edge simply indicates that there is a con- 
nection between the two cities (i.e., at least one flight in each direction). There is 
no reason to label the edges since referring to the endpoints (e.g., (Rome, Madrid]) 
‘unambiguously describes an edge. Since the first graph has no loops and no multiple 
edges (this is what we will call a simple graph), its edges can always be referenced in 
this way. Moreover, when describing a walk in a simple graph, we can simply list the 
vertices of the walk in the order they are traversed. 

Tn the second graph, we explicitly show all the flights labeled by their flight 
numbers, and we use arrows to indicate the direction of the flight. When arrows are 
put on the edges to indicate direction, we call the graph a directed graph. 

Which version of the graph is more appropriate? That depends on the intended 
application. The first example is the type you are likely to find in an ad for the airline, 
showing the cities it serves and the connections between them, The second captures 
more detailed information, Either graph can be used to determine that there is no 
direct flight from Madrid to London, To answer the question, “How can I get from 
Madrid to London on this airline?” we can also use either graph to get the solution 























Go from Madrid to Rome to Paris to London. 


‘This walk can be unambiguously described as “Madrid, Rome, Paris, London” in 
the first graph, or, if we wish to give more detailed instructions, we can describe the 
following walk in the second graph: 





702 
Madrid, 


bs 





v 


London Rome * London 
Figure 7-8 Airline lights 








510 Chapter 7 / Graphs and Trees 


VV 
Pe 
for Example 2. 








Go from Madrid to Rome by flight 3421, from Rome to Paris by flight 1703, 
and from Paris to London by flight 3913. 


So the amount of information that is conveyed in a graph depends entirely on 


the application at hand. We will focus on the simplest possible structure of vertices 
and edges in order to master the essential concepts of this important subject, 


More Terms and Notation 


‘The following definitions formally state some of the preceding ideas, along with some 
others that will be used in subsequent discussions. We postpone the formal definition 
of a directed graph until Section 7.4. 





Definition 


1. A simple graph is à graph with no loops and no multiple (parallel) edges. 

2. The unordered list notation [a, b] indicates an edge with endpoints a and b. 
This notation is ambiguous if the graph has multiple edges, but we will still 
use it if doing so will not cause confusion. In a simple graph, an edge with 
endpoints a and b can also be represented as the two-element set (a, b]. 

3, If the graph is directed, we use (a, b) rather than [a, b] to indicate a directed 
edge from a to b. This use of the usual ordered pair notation emphasizes 
that in a directed graph the order of the vertices connected by the edges is 
significant. 














"lll Example 2 forte graphs shown in Figure 75, 


1. The picture on the left shows a simple graph with the three edges [1,2], [2,4], and 
[4.3] 

2. The picture on the right is an arrow diagram for a function. As a graph, it is a 
directed graph with the four edges (1, 2), (2, 1), (3,3), and (4, 2) 


Itis worth noting that even though directed graphs will receive less emphasis in 
this chapter, much of what we say for graphs has an analog for the case of directed 
graphs. For example, if the bridges in Kónigsberg had been one-way streets, we would 
use a directed graph and we would be discussing Eulerian circuits in a directed graph. 

The following definition contains a few additional terms we will use in the con- 
tinuing discussion of the Königsberg bridge problem, 





Definition Ina graph G, we use the following terminology: 


1. An edge e is said to be incident with a node v if v is an endpoint of e. 

2, The degree of a node v, denoted by deg(v), is the number of times v appears 
as an endpoint of an edge. That is, deg(v) is the number of edges that are 
incident with v, except that loops are counted twice. 





1 2 8 
7 
P4 
E 4 5 
Figure 7-10 Graph 
for Example 3. 


71 Graph Theory 511 


3. A graph G is connected if there is a walk between any pair of distinct nodes, 

4. A graph H is a subgraph of a graph G if all nodes and edges in H are also 
nodes and edges in G. 

5, A connected component of a graph G is a connected subgraph H of G such 
that no other connected subgraph of G containing H exists. 





DE Example 3. Refer to the graph consisting of 8 nodes and 12 edges shown in 
Figure 7-10. 


1. The degree of node 3 is 4—this node appears in the four edges (1, 
and [3, 4]. The degree of node 6 is 3. 

2. There is no walk that begins at node 4 and ends at node 6, so the graph is not 
connected. 





I2) 





5.3] 


3. The graph has two connected components. 


Practice Problem 3 Answer the following questions for the graph given above 
in Example 3: 





(a) Find the degree of node 1. 

(b) There are two connected components, which we will call Hy and Hz. Subgraphs 
Hy and H» are graphs in their own right, each consisting of nodes and edges. Give 
a complete list of the nodes and edges that make up each of these subgraphs. 

(c) Find an Eulerian trail in the connected component that includes node 6. 

(d) Find asubgraph that is connected and contains just the nodes 2, 3,4, and 5. Explain 
why this subgraph is not a connected component of the graph. 


Eulerian Graphs 


We are now prepared to finish discussing Euler's achievement, So far we have only 
seen a condition we can use in order to get a “no” answer to the question of finding an 
Eulerian trail ina given graph. To complete the story, we need to see what conditions 
will guarantee a "yes" answer, For the other half of the picture, we will focus on the 
question of a city having an Eulerian circuit instead of an Eulerian trail, 





Definition A graph G is Eulerian if there is a circuit in G that involves 
every edge exactly once. Recall that such a circuit is called an Eulerian circuit. 








Euler found a simple way to decide if any given connected graph is Eulerian. 


Theorem 2 Let G bea connected graph. The graph G is Eulerian if and only if 
every node in G has even degree. 


The proof, which is very elegant and uses the recursive structure of graphs, 
will be given in the next section where we discuss the proof techniques for graphs. 





S12 Chapter 7 / Graphs and Trees 


13.12 


Nn 10 

hm x 

DA AMD 

t 

dye h 
š 


Figure 7-12 The 
induction step. 


sj 10 
E 12 


Figure 7-13 Graph for 
Practice Problem 4. 





BR 
TN Sone 
2X D 

A ome G 





Figure7-M Finding a circuit in G. 


in more detail. The following example illustrates the process for constructing an 
Eulerian circuit in a connected graph in which every vertex has even degree, Notice 
where the construction process repeats itself recursively—this will (eventually) be 
the induction step in our proof, 


"lll Example 4 Ler G be the graph given on the left in Figure 7-11. 


1. First we find any circuit C, as shown in Figure 7-11 on the right. (We can do this 
formally by starting at node 1, and walking along unused edges until we get back 
1o node 1.) Notice that we simply list vertices in our description of C since this 
unambiguously determines the intervening edges in this simple graph. 

2. We next form the graph G' by removing from G all the edges in the circuit C. 
To simplify this example, we also remove those vertices (1 and 2) that have all 
of their edges removed by this process. The resulting graph G' is not connected, 
so we will call its components H and Hz in Figure 7-12 and continue, Since 
these are smaller connected graphs with every vertex of even degree, this same 
process will determine that both Hy and Hz have Eulerian circuits, respectively, 
C; =3,4,5,6,7, 11, 12, 13,6, 3, 11, 13, 14, 3 and C3 = 8,9, 10,8 

3. We can now piece together C, Cy, and C2, Specifically, inC = 1,2. 
we replace the 3 with C, and the 8 with C3 to get the Eulerian circuit 








7,8, 11, 14, 1, 





1,2,3,4,5,6,7, 11, 12, 13, 6,3, 11, 13, 14, 3, 7, 8, 9, 10,8, 11, 14,1 
[3 & 


Practice Problem 4 Consider the graph in Figure 7-13 after the circuit C — 
3,6,9,7,5,10,7.4,6, 8.3 has been removed from the graph. 


(a) Draw the two components H; and Hz that are left behind. 
(b) Find Eulerian circuits C, and Cz, respectively, for the two components. 
(c) Paste togetherC, Cy, and C; to make an Eulerian circuit for the original graph G. 


Graphs with Eulerian Trails 


1f you have been really paying attention in this section or else were never paying 
attention in elementary school, you might have noticed the similarity of the previous 
problem to a puzzle that usually makes the third grade circuit (so to speak). The 











7) Graph Theory 513 








3 4 
Figure 7-14 The envelope and its graph. 


problem is to draw the picture in Figure 7-14 on the left (an "envelope") without 
lifting pencil from paper and without retracing any line. This problem was originally 
introduced in Chapter 1. We now have the mathematical sophistication to allow a 
thorough analysis of the problem. 

This is the same problem as finding a trail in the graph on the right in Figure 
7-14, using every edge exactly once. (Remember that we call this an Eulerian trail.) 
It might sound like we are asking that the graph be Eulerian, but we are not. The 
difference is that we are not looking for a circuit—that is, we don't care in this puzzle 
whether the pencil ends up where it starts! Take a moment to trace the figure as 
described if you have never done it before. 

How closely related is this problem to the one about the bridges of Königsberg? 
Can we find a general result similar to Theorem 2 about this kind of graph? These 
questions have an easy and elegant answer. However, before we can give a complete 
characterization of graphs with Eulerian trails, we need to establish a couple of simple 
facts about graphs in general. 














Theorem 3 In any graph, the sum of the degrees of the vertices is equal to twice 
the number of edges. In symbols, 


Y deglu) = 2m 
fet 
where vj, vo, -.. ty are the vertices of the graph and m is the number of edges in the 
graph. 
PROOF Each edge has two (not necessarily distinct) endpoints, and the degree 
of a node counts the number of times it appears as an endpoint of an edge, So, in 
‘summing the degrees of the nodes, we count each edge exactly twice. " 





Corollary 4 In any graph G, the number of nodes with odd degree is even. 


PROOF In Chapter 2 you proved that the sum of two even numbers is even, 
that the sum of two odd numbers is even, and that the sum of an odd and an even 
number is odd. It follows that, in calculating the sum of all the node degrees, the 
even numbers certainly sum to an even number, and from Theorem 3, the entire 
sum is even. Therefore, the odd numbers must sum to an even number too. And 
if the sum of a bunch of odd numbers is even, then there must be an even number 
of them. " 


ee 


514 Chapter 7 / Graphs and Trees 


We are now ready to state and prove a general solution to all puzzles of the 
envelope-drawing type. The proof is a good example of letting previous results do 
all the hard work for you. 


Theorem 5 A connected, non-Eulerian graph G has an Eulerian trail if and only 
if G has exactly two nodes of odd degree. Moreover, the trail must begin and end at 
these two nodes. 


PROOF First, suppose that G is a connected graph that is not Eulerian but 
does have an Eulerian trail, say, starting with node v and ending with node w. By 
Proposition 1, we know that every node except possibly v and w has even degree. 
Since the graph is not Eulerian, at least one node has odd degree (by Theorem 
2). So, by the previous corollary, at least two nodes have odd degree, The only 
possibility is that v and w are the only two nodes of odd degree in G. 

Now suppose that G is a connected graph with exactly two nodes of odd 
degree, say, x and y. Form the graph G' by adding to G the edge [x, y]. Now 
G' has all nodes of even degree, so it must have an Eulerian circuit by Theorem 
2. Deleting the edge [x, y] from this Eulerian circuit gives us an Eulerian trail 
in G from x to y. a 

















2 5 ua Example 5 The graph of the "envelope" we saw earlier can thus be modified 
by adding an edge between its two nodes (3 and 4) of odd degree to get the graph 
shown in Figure 7-15. The solution method of Example 4 can then be used to find. 

E 4 the Eulerian circuit 6,3,4,3 in this new graph, Dropping the last 
edge 4,3] from this circuit gives us the Eulerian trail 3,2, 1, 5,4,6,5, 2,6,3, 4 in the 

Figure 7-15 Adding original graph. 

an edge makes the 
envelope Eulerian, 

















Practice Problem 5 Suppose the Eulerian circuit you found had been 
1,5, 4,6, 5,2, 6,3, 4, 





Explain how to use this circuit to find the trail in the original envelope. 
dh 
xis are in. Practice Problem 6 How can one draw the picture in Figure 7-16 without lifting 

the Web the pencil or retracing any part of the figure? (This picture of a bird comes from the 
sona sand drawing tradition of the Chokwe people of South Central Africa.’ 


Vl Sample 6 tiers sa iagicsrick based ch Eulerian graphs Theesplanaron by 

left as an exercise for the reader. You are the magician. Take a standard set of dominoes 

and discard the double 6, double 5, double 4, double 3, double 2, double 1, and double 

blank pieces, Have one spectator choose a single domino and give it to you, Turn 

your back and have a second spectator line up the remaining dominoes using the 

‘conventional rules that the faces touching must share the same number. The spectator 
Figure 7-16 
Picture for 

Practice Problem 6. ' To leam more about this interesting story-telling tradition, see Geometry from Africa by Paulus Gerdes: 

(MAA, 1999), Washington, DC, pp. 150-205 








eR 


71 Graph Theory 515 


should only have to make one such line—allow other spectators to help if he or she is 
having difficulty, Now you reveal (dramatically) the numbers on the ends of the line, 
and show that they are the same two numbers on the domino originally chosen. 


Solutions to Practice Problems 


1 Vertices c, f, h, and j are each connected to an odd 
number of edges. By Proposition 1, none of these can 
be "interior" vertices of an Eulerian trail, and certainly 
all four of them cannot make up the starting and ending 
points of a trail, Hence, this graph has no Eulerian tral 

2 The Eulerian trail in the example starts at A and ends at 
C. If we add another edge joining A and C, we can use 
that edge to get back to A and complete the Eulerian 
circuit, 

3. (a) Thedegreeis 6. Don't forget that for aloop U 

is counted as an endpoint twice, once for ead 











of the edge, 
(0) Hs has nodes 4,5 and edges [1.1], [1,2]. 
[1.3], [1,4], [1,5], [2.3]. (3,4), (3,5), and Ha has 





nodes 6,7, 8 and edges [6.7], [6.8] [7.8], and a sec- 
ond [6,7] edge. 

46) 6,7,8,6, 7is one solution. 

Ul) Along with nodes?, 3, 4, 5include edges 3, 5) [3.21 
[3,4]. This is not a connected component because it 
is contained within the connected subgraph #1. 

4,2,5,4,3, 1, 4and C; = 8,9, 10, 12,11, Bare Eu- 

lerian circuits for Hy and Hs, respectively, shown 

in Figure 7-17, Pasting together these with C= 

3,6,9,7,5, 10,7, 4,6,8, 3 gives us the Eulerian circuit 

3,6,9,7,5,10,7.4,2,5,4,3, 1,4, 6,8,9,10, 12, 11,8,3 














for the original graph G. 
A circuit can be started at any node in the circuit, So. 
‘we could rewrite the circuit we found so it either begins 


Exercises for Section 7.1 


1, Refer to the graph G in Figure 7-19, 
(a) Find the number of nodes and the number of edges. 
(b) Find the degree of each node, 
4) Compare the sum of the degrees with the number 

of edges, How are these two numbers relate 

(d) List all cycles in the graph. 

2. Refer to the graph Gh in Figure 7-19. 
(a) How many nodes are there? 
(b) List the degrees of the nodes in G. 





1 2 
4 

ss] a 

dq 

" 12 


Figure 7-17 Solution to Practice Problem 4. 


‘or ends with the edge we want to drop. One possibility 

is 4,3,2, 1,5,4,6, 5. 2,6,3,4, If we drop the first edge, 

we get the trail given in the example, Dropping the 
3 






st 
edge gives 4, 3,2, 1.5.4.6, 

6 Af we label the nodes asshown in Figure 7-18, the picture 
can be drawn by visiting the nodes in order 


1,2,3,4,6,9,7,5,8,10,11,12,11,9,6,4,7,10,8, 5.3, 1 


~> 
$e 


<2 10x ) 
VSN 
x >) 
x 


1>: 








Figure 7-18 Solution to Practice Problem 6. 


6) What is the sum of the degrees of the nodes in G1? 
(d) How many edges in G1 ate there? 
3. Refer to the graph Gh in Figure 7-19. 

(a) How many nodes are there? 

(b) How many edges are there? 

(©) List the degrees of the nodes in Go 

(d) Construct a second graph // with the same number 
of nodes and edges, but having a node with a higher 
degree than any node in Go 








516 Chapter 7 / Graphs and Trees 


A B c 
D B F 
1 

Graph G 


Graph G, Graph Gz 


Figure 7-19 Graphs for Exercises 1,2, and 3. 


4. Refer to the graph in Figure 7-20. 
(9) Find two different circuits with a length of exactly 
7 that begin and end at 1. 
(b) Find a walk from 1 to 8 that uses five edges and is 


not a trail 

(c) Find a trail from 3 to 5 that uses all the edges inci- 
dent with 4. 

(d) Find a circuit starting and ending at 8 that is not a 
cycle. 


(e) Find cycle of length 4. 
5, Consider the graph in Figure 7-20. 
(a) List the degrees of all the nodes. Note that all the 
degrees are even, 
(b) Create a circuit using this strategy: Start at node 
1, and choose any edge to move to another node. 
At that node, choose any unused edge to move to. 
another node, Continue until you get back to node 
L 
(c) Repeat the previous exercise. Make at least one dif- 
ferent choice so that you get a different circuit. 





tog 
E 4 
X K 
7 s 
9 10 
Graph H; Graph He 


Figure 7-20 Graphs for Exercises 4 and 5. 





4) Do you think you will always eventually get back 
to the original node no matter which edge is chosen 
at cach step? Will this happen for any connected 
graph you start with? 

6. For each of these walks, find a shorter walk that begins 
and ends with the same nodes as the given walk, and 
Which does not visit any node more than once, 

(2) 1,2,3,2.4 

(b) 44,562 

©) 9.3.2.7.8.24 

4) 1.2,3,4,5,67,4,8,3,9 

(©) 3.5,6,5.4.8,9,3,2,8,7 

(0) 8,9,2,4,6,8, 11, 4,2,5.7,3,2,10 

7. The degree sequence of a graph is the list of degrees 
of the nodes of the graph, listed from largest degree to 
smallest, For example, the degree sequence of the graph 
in Exercise 4 is 6,6,4,4,4, 4, 2,2, 2,2. 

(a) Give an example of a simple graph with degree se- 
quence 3, 

(b) Give an example of a simple graph with degree se- 
quence 3, 

(c) Give an example of a simple graph with degree se- 
quence 3,3,2,2, 2, 2. 

4d) Explain why there is no simple graph with degree 
sequence 7,6,5,4,3,2, 1. 

(©) Explain why there is no simple graph with degree 
sequence 3, 3,3,2,2, 1.1. 

(0) Explain why there is no simple graph with degree. 
sequence 5,5,3,3, 1, 1. 

8. Each of the graphsin Figure7-21 isnot Eulerian. In each 
‘case, add the fewest edges possible to create a graph that 
is Eulerian, 

3. For each of the graphs in Figure 7-22, find an Eulerian 

F circuit or an Eulerian trail, or else explain why neither 

is possible. 























71 Graph Theory 517 





s 
12 
3 | i 
+ «| 
na e 7 Je 
N. | 
3 e 8 


7« 





(e 





[7] w 


i 
ZI «Nc 


x é 
\ 7< >e 
@ 4 3 [7] 9 10 
Figure 7-22 Graphs for Exercise 9. 











518 Chapter 7 / Graphs and Trees 


10. 





1 


M. 





Figure 7-23. Floorplan for Exercise 10. 





‘The floorplan in Figure 7-23 shows the Boatsville Col- 
lege Museum of Art Is it possible fora guest to visit the 
entire museum by going though every doorway once 
and only once? (Convert the floorplan toa graph.) 
The floorplan in Figure 7-24 shows the Boatsville Col- 
lege Museum of Art after renovations have been made, 
15 it now possible fora guest to visit the entire museum 
by going though every doorway once and only once? 


. The formal induction proof (in the next section) of The- 


orem 2 will involve the statement P(n) that says, “For 
every connected graph G with n edges, if every ver- 
tex of G has even degree, then G is Eulerian,” Verily 
P(O), PC), PQ), and PG) by drawing every possible 
connected graph of the appropriate size satisfying the 
hypothesis, and verifying the conclusion for each graph, 
Sarah and Emily decided to go to a part of town (see 
the map in Figure 7-25—a node represents a corner and 
an edge represents a street) to distribute their radical 
political literature. They would like to split up and walk 
from corner to corner until every street in town has 
been covered by exactly one of them only once, So there. 
should be no street traveled either by both women or by 
‘one woman twice, and every street should be traveled 
by someone. Is it possible? Characterize all graphs for 
which such a scheme would work. Give an example of 
a simple, connected graph for which it would not work, 
‘Suppose that Sarah and Emily from the previous exer- 
cise want to take their crusade to another city. Can they 
tell from looking at the “graph” of the new city how 
‘many additional helpers they will have to bring in order 


















































Figure 7-24 Floorplan for Exercise 11, 





Figure 7-25 Graph for Exercise 13, 


to cover the city in the sense of the previous exercise? 
How? 

15. What is the minimum number of times that one must 
lift one's pencil to draw each picture shown in Figure 
7-262 (Hint: This is related to Exercise 8.) 

16, From your investigation in Exercise 15, make a con- 
jecture about the number of times one must lift one's 
pencil to draw any given picture 

17. Explain the magic trick in Example 6, 

18, What is the minimum number of edges that a simple, 
connected graph with n vertices can have? (You do not 
have to prove your answer is correct.) 

19. In light of the previous exercise, explain why 
there is no connected graph with degree sequence 

2,2,1,1,1,1,1, 1 (as defined in Exercise 7). 

20, What is the maximum number of edges that simple 
graph with n vertices can have? Justify your answer! 

21, What is the maximum number of edges that a simple, 
disconnected graph with n vertices can have? (You do 
not have to prove your answer is correct.) 

22, Prove that for a simple graph G in which all the n nodes 
have a degree of least 2, there are at least n edges. 
(Hinr: Use Theorem 3.) 

23. Let's say that a connected graph is almost Eulerian if 
it contains a closed walk that uses every edge in the 
graph once and one edge in the graph twice. In Figure 
7-27, the graph shown on the left is “almost Eulerian” 


























SA 
(OA 


S. 


(ej 
a t) (e 
Figure 7-26 Graphs for Exercise 15. 


Xe 














‘Almost Eulerian’ 
Figure 7-27 





72 Proofs About Graphs and Trees 519) 


since the closed walk W = a, b,c, b, d, a uses the edge 
c [b,c] twice and every other edge once. On the other 
hand, the graph on the right is not “almost Eulerian.” 
b Find a condition like the one in Theorem 2 that com- 
pletely characterizes connected graphs that are almost. 
Eulerian, 
a d 
Not "almost Eulerian’ 


hs for Exercise 23. 





7.2 Proofs About Graphs and Trees 





Our goal in this section is to discuss the proof techniques most commonly used in 
the study of graphs, In many cases, we will illustrate an idea first with simple graphs 
in order to keep the notation and definitions as streamlined as possible, Remember 
that in a simple graph, we can identify an edge by specifying its two endpoints, and 
We can represent a 











Direct Proofs of Graph Properties 


We begin by noticing that except for a higher degree of notational complexity, our 
basic ideas for direct proofs still hold. Specifically, we try to write the statement to 
be proven in a simple "if, then" form so that we can invite the Reaper to give us any 
example satisfying the hypothesis, and we strive to convince her that the conclusion 
must also hold true for her example, 

In our first example of a direct proof, we will need to use the following simple 
fact about connected graphs, 














Lemma 1 A graph is connected if and only if for every pair of different vertices 
a and b in G, there is a path in G starting at a and ending at b. 


The proof of this lemma follows from Proposition 6 that is itself proved a bit later 
in this section, That proof will use mathematical induction, so we prefer to postpone 
it for the moment, 


Proposition 2 Let G bea simple, connected graph, and let a and b be vertices in 
G such that there is no edge between a and b.  G' is the graph formed by adding the 
edge (a, b) to G, then G' has a cycle that contains the edge (a, b), 





PROOF Let a simple, connected graph G be given, let a and b be vertices in G 
such that there is no edge between a and b, and form the graph G' by adding the 
edge (a, b) to G. 

Since G is connected, we know by Lemma 1 there is a path 





LL 


TE NV CER 


520 Chapter 7 / Graphs and Trees 


where w 
graph G^ 


= a and v, = in G. Of course, every edge on this path is in the larger 
nd the edge (a, b) is in G'; hence, the entire walk 





W = Up. oes Dh t0 





is in G’, Since P is a path, it contains no repeated vertices, so the only repeated 
vertex in W is the beginning/ending vertex vy — a. Hence, W is a cycle in G'. Wi 





‘This property of simple graphs and its converse are very important for other 
proofs in this section, so we have included the converse as a "fil-in-the-blank" proof 
in Exercise 2 at the end of this section. You might want to try it before reading on. 





Proposition 3 When an edge is removed from a cycle in a connected graph, the 
result is a graph that is still connected, 


PROOF See Exercise 2 for the outline of this direct proof. m 


Just as with any direct proof, itis our prerogative to prove the contrapositive of 
the statement instead of the original if, then statement, The following proof shows 
how this might work, 


Proposition 4 For every connected graph G with at least one edge, if G has no 
cycles, then G has at least one vertex of degree 1. 


It is impossible for a connected graph with at least one edge to have a vertex of 
degree 0, so the contrapositive statement is “For every connected graph G with at 
least one edge, if G has every vertex of degree at least 2, then G has a cycle.” This is 
the form of the statement we will prove. 





PROOF (Of the contrapositive statement.) Let G be a connected graph with at 
least one edge and having every vertex of degree at least 2, Let n be the number 
of vertices in G, and choose vo to be any vertex in G. Now choose vertices to build 
a walk as follow 








© Since wy has a degree of at least 2, we can choose vertex v; to be the other 
endpoint of an edge incident with wy. 

9 Since vy has a degree of at least 2, and we have already used one edge [vo, i] 
incident with vj, we can choose vertex v; to be the. second endpoint of a different 
edge incident with vi. 

© Since v; has a degree of at least 2, and we have already used one edge [vi, v2] 
incident with vz, we can choose vertex vs to be the second endpoint of a different. 
edge incident with is. 

9 And so on until vertex v, has been chosen, 





‘This walk uses m+ 1 vertices, but the entire graph G only has n vertices, so (by 
the pigeonhole principle, Practice Problem 3 of Section 2.5, to be precise) there 
must be a first value j such that v; = v; for some i < j, This choice of i and j will 
guarantee that 





72 Proofs About Graphs and Trees 521 
LI 


isa cycle in G. " 


The type of graph under scrutiny in the previous proposition is an important. 
one from several points of view, 











Definition A tree is a connected, simple graph that has no cycles, Vertices 
of degree 1 in a tree are called leaves of the tree. 














& Zo: Example 1 Answer each of these questions for each of the trees shown in 
5 Figure 7-28. 
po 
1. How many nodes does the graph have? How many edges does the graph have? 
» ~+ 2. How many leaves does the tree have? 
$7.2. — 3, Choose any edge and remove it. Is the resulting graph a tree? 


, 4. Choose any two nodes and add an edge joining them. Is the resulting graph a tree? 
— ÀÀX4 
Figure 7-28 Graphs SOLUTION 


for Example 1 
1. The tree on the top has eight nodes and seven edges. The tree on the bottom 


has seven nodes and six edges 
2. In the tree on the top, vertices b, c, d, f, and A are leaves, In the tree on the 
bottom, vertices a, b, and f are leaves 











A. In each case, the result is not connected, so it is not a tree. 


4, In each case, the result has a cycle, so it is not a tree, : 


[ 





The graphs in the previous example do not look much like the game trees and 
decision trees we have seen in earlier sections of this book. Those examples have 
even more structure than a typical graph, so we will defer studying them in detail 

j until the excursion (Section 7.6) at the end of this chapter. 
Mene. Practice Problem 1 Prove that, for every tree G, removing any edge from G will 
the Web. result in a disconnected graph. (Hint: Write this statement in if, then form, and then 
consider the contrapositive.) 








Trees are often described in two different ways, providing us with two of the 
reasons why they are important. Some people think of a tree as a graph with the 
most edges possible without containing cycles—we can call this being “a maximal 
acyclic graph.” This is essentially what we proved in Proposition 2. On the other 
hand, some people think of a tree as a graph with the least edges possible while 
remaining connected, which we could call being “a minimal connected graph.” This 
isthe essence of Practice Problem 1. Like many other ideas in discrete math, the most 
Useful thing is to be able to think of a tree in either way, depending on the situation. 











| 07. i o 


522 Chapter 7 / Graphs and Trees 


Induction Proofs of Graph Properties 


We have seen that our usual direct proof style can be used with the graph theory 
definitions. However, the main proof technique for studying properties of graphs is 
mathematical induction. Ifyou have only ever proved numerical facts usinginduction, 
this technique might seem strange when applied to the decidedly nonnumerical world 
of graph theory. 


Proposition 5 For every tree G with at least one edge, G has at least two leaves 


m (ie, vertices with degree 1). 
Figure 7-29 
The base case forthe PROOF Let P(n) be the statement “Every tree with n edges has at least two 


Proof of Proposition S. — leaves" Since the only tree with one edge is shown in Figure 7-29, which clearly 
has both vertices with degree 1, it follows that P(1) is as true as it is uninteresting. 

Let m > 2 be given such that statements P(1), P(2),.... P(m — 1) have all 
been checked to be true. In considering the next statement P(m), we let a tree G 
with m edges be given. By Proposition 4, there is a vertex v of degree 1, and so 
there is only one edge (v, w} in G that has v as an endpoint. Form the graph G' by 
removing vertex v and edge v. w} from G. 

Now G' isa tree with m — 1 edges, so since we have already checked statement 
Pm — 1), we can conclude that G' has at least two leaves. But w is the only vertex 
in G' that has a different degree in G' than it does in G. Choose one of the leaves 
in G' (other than w, if w is even a choice) and call it u. The degree of u in G is the 
same as the degree of u in G'. Hence, u and v are leaves in G. È 


Since the structure of the induction proofs in this section is quite a bit more 
complicated than other proofs in this book, we will follow many of them with an 
example that traces the steps of the proof with a concrete example. 





A I s Bl Example 2 Gitustrating Pr oposition 5.) Assuming that we have checked 
d u every possible tree with five or fewer edges, suppose we are given the tree G with six 
6 > edges in Figure 7-30. The steps in the proof call for us to choose vertices v and w as 
Figure 7-30 shown in G. The graph G' on the right shows the leaf u that is described in the proof, 


Mlustrating Proposition. 





The same type of proof can be used on structures within a graph (like subgraphs, 
walks, circuits, etc.) as well. 


Proposition 6 In any simple graph G with distinct vertices a and b, if there is a 
walk from vertex a to vertex b, then there is a path from a to b. 


PROOF (By induction.) Let the simple graph G be given, and P(n) be the state- 
ment “For all vertices a # b in G, if there is a walk of length n from a to b in G, 
then there is a path from a to b in G: 

Statement P(1) is true since a walk from a to b of length 1 must look like a, b, 
which is a path from a to b. 

Let m 2-2 be given such that statements P(1), ..., P(m—1) have already 
been checked, and we are now considering statement P(m). Let vertices a and b 
in G be given, and let the walk W =v, vi, v»... 9. 1, Up be a walk of length m 
with vo =a and v, = b. 


ALD ee eee | 











72 Proofs About Graphs and Trees 523 


Now it might be the case that W is actually a path from a to b, which would 
mean that we have our desired conclusion. So let's address the other possible case, 
where W is not itself a path. By the definition of path, this means that some vertex 
is repeated in W. That is, in the list of vertices that describes W, there are values 
i < j such that v, = vy 

Form the walk W' as follows. If i = 0. W' Djs, +++ Umi Otherwise, if 
j=mW' ar; otherwise, W' = m, Bjal ++ s Um This walk 
still connects a to b, but the length of W' is less than the length of W. In par- 
ticular, if we let k be the length of W', then we know that statement P(k) has 
already been checked, and so it has already been verified that there is a path from 
a to b. This is our desired conclusion for statement P(m), and so this completes 
the induction. " 














7-31, consider the walk W = a.c, d. e, f.d, g.e. bof length 8 that is not itself a path. 
If we match this with the notation of the proof above, we would have this 


À "Bl example 3 üllusiraring Proposition 6) In the graph shown in Figure 


fede fed ge b 





w- 
Mm Mow wy Ja Ps de w os 

H r Since W isnot a path, we know it must have repeated vertices. If we look atthe example, 

Pei Ta we see that v; = vs, so when we form W’, we simply remove everything in between vz 

Tatratieg and vs like this: 

Proposition 6. acdgeb 





Tae mom w vos 





Explore more on. Practice Problem 2 — By adapting the proof of Proposition 6, prove that in any 
the Web simple graph G, if there is a circuit containing the edge (v, w), then there is a cycle 
containing the edge |v, w). 
Many of our induction proofs about graphs will have this same basic form. It 
looks considerably more complex than some of our other forms, but it is really no 
different from the form of induction we have been using in other settings 








Claim. Every graph G with property p also has property q. 
| Proof by induction. Let P(n) be a statement tantamount to “Every graph | 
on n edges with property p also has property q." Verify the first few statements 
P(1), P(2), ... by listing all possible graphs with property p with the given number 

of edges. 

Let m be given such that statements P(1), P(2),.... P(m — 1) have all been 
checked. To prove statement P(m), let a graph G with property p and having m 
edges be given. Designate an edge e (either Reaper chooses or Autuor dictates) 
and form the graph G' by removing e from G. Graph G' or its connected compo- 
nents will be graphs with fewer than m edges and so the fact that the statements 
P(1),...., P(m — 1) have already been checked will lead us to conclude that G’ (or 
its components) has property q- 

Use the fact that G' (or its components) has property q to conclude that G itself 
has property q- 








| i Oe, a E ————— 


524 Chapter 7 / Graphs and Trees 


We illustrate this basic proof form on the following important result about trees. 
Note that there are many other ways to prove this same result, one of which is given 
in the exercises at the end of this section. 





Theorem 7 IfT isa tree with n edges, then T has n 4-1 vertices. 


PROOF Let P(n) be the statement “Every tree with n edges has n + 1 vertices.” 
Since the only tree with 0 edges consists of one vertex and no edges, it follows that 
P(0) is an uninteresting but completely true statement, 

Let m= 1 be given such that statements P(0), P(1),..., P(m—1) have all 
been checked to be true, In considering the next statement P(m), we let a tree T 
with m edges be given. Choose any edge (v, w) in T, and form the graph 7” by 
removing this edge from T. 

Now T' has two components # and Hs, each of which is a tree, Let ky and 
ka be the number of edges in H and H2, respectively. Since ky +k: m — 1, it 
follows that k; < m and ky < m, so we know that statements P(k,) and P(e) have 
cach already been checked to be true, Hence, there are ky + 1 vertices in Hy and 
ka +1 vertices in Ha, Every vertex in T is in one of Hy or Hs, so the number of 
vertices in T 








(s +1) + (o T) = (hi +) +2 = (m=1)+2=m41 


Hence, statement P(m) has been checked to be true, completing the 
induction, 


TE example 4 llustrating Theorem 7) ‘Thee the logic in the induction step 
of Theorem 7 assuming that m has been given as 7, and the tree with seven edges in 
Figure 7-32 has been given, 





SOLUTION Wo are assuming that statements P(0), P(1),..., P(6) have al- 
Troe T ready been checked. In the given tree with seven edges, since edge (v, w) is 
Figure 7-32 identified as in the first picture shown in Figure 7-33, we delete it to get the 
Illustrating the trees Hı and H3 shown in the second picture. The tree Hi has four edges, so 
proof of statement P(4) says that H, has five vertices. Similarly, tree Hy has two edges, 
Theorem 7. 
Tree T Troos H, and H, 


Figure 7-33. Illustrating the proof 
of Theorem 7, 





eee SS 


72 Proofs About Graphs and Trees 525 


and so by statement P(2), we know that Hy has three vertices. Hence, T has 
total of 5 + 3 = 8 vertices, as predicted by statement P(7), a 


We can also see how the induction proof form allows us to give a careful proof 
of Theorem 2 of Section 7.1. 


Theorem 2 revisited Let G be a connected graph. The graph G is Eulerian 
if and only if every node in G has even degree. 


PROOF Before we begin the proof, we note that itis trivially true (and uninter- 
esting) if G consists of a single vertex and no edges. The remainder of the proof 
assumes that G contains at least one edge 

First we show that if G is Eulerian, then every node has even degree, Sup- 
pose that G has an Eulerian circuit, and let x stand for any node of the graph. 
We will show that x has even degree. We write the Eulerian circuit as a walk 
serving that because it is a circuit the last node is the same 
connected, x is the same as one or more of the vertices vy 
through v, Each time (if any) it appears in the middle of the walk (as one of the 
vertices v;. .. y), it is the endpoint for two different edges, the one just before it, 
and the one just after it, (In general, the node labeled v, is the endpoint for e, 1 
and ej.) If it appears at the beginning (as vertex v), again it is the endpoint for 
two different edges, ey and ej. So the degree of node x can be computed as twice 
the number of occurrences of x in the list of vertices v; through vy. That is, the 
degree of x is even 

Next we show that if every node in G has even degree, then G is Eulerian. We 
do this by induction on the number of edges. Let P(n) be the statement "Every 
connected graph with n edges in which every node has even degree has an Eulerian 
circuit." It is easy to check that the statement P(1) is true since there are very few 
graphs with one edge, Now let m > 2 be given such that we know how to find an 

Fulerian circuit in any connected graph where each node has even degree and 
in which there are fewer than m edges, (That is, we have checked P(1), P(2), ... 
Pim- 1) 

Consider a connected graph G that has m edges in which every node has 
even degree. In particular, every node has a degree greater than 1, so by the 
contrapositive of Proposition 4, there must be a cycle C in G. Form the graph G* 
by removing the edges involved in C from G. By the inductive assumption, each 
component Hy,..., Hy of G' has an Eulerian circuit C),...,C,. Bach of these 
must meet C, say, at v... 04 ,80 we can “insert” each one intoC (Le, arrange C, 
to start and end with v;, and then write C; in place of v; in C) to form the desired 
circuit " 









Leber.. 
as the first, 




































Graph Algorithms and Spanning Trees 


We will address our last graph proof technique in the context of a specific exam- 
ple. In developing algorithms to solve problems, we will often construct graphs (or 
subgraphs) by an iterative process. To show that the end result of the algorithm is a 
success, we will prove that each step in the process maintains some desired property. 
Ultimately, this is another proof by induction, but treating it separately will allow us 
to concentrate on these first examples of graph algorithms. 








Oe a a cc 


526 Chapter 7 / Graphs and Trees 





Definition Let G bea simple, connected graph. The subgraph T isa span- 
ning tree of Gif T is a tree and every node in G is a node in T. 





Does every connected graph necessarily have a spanning tree? The answer is 
yes. In Exercise 24 you are asked to prove this directly using induction, However, it 
is also possible to give an algorithm for finding a spanning tree. One simply removes 
edges that are part of a cycle until there are no longer any cycles left. More formally, 











Spanning Tree Algorithm 


© Begin with a simple, connected graph Go. 
© For each i > 1, as long as there is a cycle in Gi, 


choose an edge e in any cycle of G1, and form the subgraph Gi of 
Gi- by deleting e from G,.. 


© The final result Ge will be a spanning tree of Go, This is a spanning tree. 











Practice Problem 3 Referto the graph in Figure 7-34 for the following problems: 


(a) Apply the preceding algorithm to the graph. In each step, list the cycle you choose, 
and the edge you choose from that cycle. 
(b) How many edges are there in the resulting spanning tree? 


(©) Do you think everyone in the class got the same answer for the first part? Do you 
think everyone got the same answer for the second part? 





Figure 7-34 Graph for Proposition 8 Forany simple, connected graph G, the algorithm above produces 
Practice Problem 3. a spanning tree of G. 


PROOF Suppose the algorithm is executed starting with Gy = G, where G is 
the given simple, connected graph. We will use induction to prove the statement. 
P(n): “The graph Gy is à connected subgraph of G that uses all the vertices of 
G,” for values of n from 0 to k. The first statement P(0) says, “The graph Go is a 
subgraph of G that uses all the vertices of G.” Since Gy = G, this statement is silly 
but quite true, 

Let the integer m (with 1 <m < k) be given such that P(m) is the first 
statement not yet checked. That is, we have already checked statements 
P0), P(1),..., P(m=1), and we are ready to consider statement P(m), Since 
graph Gm is obtained by removing an edge from a cycle in the connected (by 
Pim — 1)) graph Gm-1, we know from Proposition 3 that graph G is connected. 
Since (by P(m — 1) graph G,,.. uses all the vertices of G and we do not remove 
any vertices in forming Gm, we know that Gy, uses all the vertices in G. Finally, 
since (by P(m — 1)) graph Gm-1 isa subgraph of G and Gn is a subgraph of Gm-1, 
we can infer that Gw is a subgraph of G. Hence, statement P(m) is true, completing 
the induction. 














CC °0—0 — 


72 Prools About Graphs and Trees 527. 


How does this prove that the algorithm works? We need to say one more 
thing. The algorithm only stops when it produces a graph G having no cycles. The 
induction proof above ensures that Gs is a connected subgraph of G using all the 
vertices of G. Hence, by the definition of “spanning tree” we can conclude that. 
Gn is a spanning tree of G. " 








The idea of a spanning tree as a graph’s “skeleton” is important for understand- 
ing many abstract properties of connected graphs, but there is a practical implication 
when the graph represents costs in a connected network, For example, suppose the 
graph shown in Figure 7-35 represents the potential connections of phone linesamong 
towns A, B,C and D, where the numbers on the edges represent the cost of establish- 
ing that connection, Constructing lines that make a spanning tree would provide basi 
communication between all the towns, but finding a spanning tree with the smallest 
possible total cost would be of additional interest to the local phone company. 

For such a “weighted” graph, itis not enough simply to find any spanning tree— 
we would like to find one whose total weight is as low as possible. When faced with a 
problem of this type, itis typical to try to design an algorithm that efficiently constructs 
a solution to the problem. We will see one example of such an algorithm along with a 
proof that it really does work. The exercises consider two other algorithms, including 
one that generalizes our simple algorithm for finding a spanning tree. Since we intend 
to write proofs about these algorithms, the first order of business is to establish the 
formal definitions 











Figure 7-35 A 
weighted graph, 

















Definition A weighted graph is a graph G = (V, E) along with a function 
w: E> R that associates a numerical weight to each edge. If G is a 
graph, then T is a minimal spanning tree of G it itis a spanning tre 
other spanning tree of G has smaller total weight, 





















a 
A 
20/ \40 Bl Example 5 The weighted graph shown i Figure 7:6 has three spanning 
ak — No trees Ty = (la, b]. [a,c], d]] with total weight 70; Ts = (ja, b], b.c]. [Ps d] with 
30 total weight 60; and T, = (ja, c], [b.c]. [bs d]) with total weight 80. T is a minimal 
Y spanning tree since each of the other two trees has a larger total weight. 
d 
Vic Practice Problem 4 Find a simple, connected weighted graph that has more than 


Weighted graph one minimal spanning tree. 
for Example 5. 
Here is one possible algorithm for finding a minimal spanning tree, First, we 
present the algorithm, along with an example illustrating its use. We then prove that 
it does indeed yield a minimal spanning tree. 








Prim’s Minimal Spanning Tree Algorithm 


© Given a connected, simple graph G with n + 1 nodes. 
9 Let v be any node in G, and let To = {wo} be a tree with one node and no 
edges, 


a re 


528 Chapter 7 / Graphs and Trees 


© For each k from (1,2, ... i), 
Let E, = (e an edge in G : e has one endpoint in 7, and the other 
endpoint not in 7, 1]. 


Let e, be the edge in E, with the smallest weight. (In case of a tie, 
choose any edge of the smallest weight.) 


Let T, be the tree obtained by adding edge ex (along with its node 
not already in 7j.1) to T1. 


© T, is the tree returned by the algorithm. 








Figure 7-37 "lll Example 6 Use Prim’s algorithm to find a minimal spanning tree for the graph 
Ilustrating Prim's in Figure 7-37. 
algorithm, Pune i 
SOLUTION Let To be the tree with one node a and no edges. The remainder 
of the algorithm can be traced as shown in Table 7-1. The total weight of the 
final tree Te, shown in Figure 7-38, is2+3+6+7+3+5=26. o 
a 2 z b 
c e œ Practice Problem 5 In step 4 there was another choice for e. What minimal 
Na x spanning tree do you get if you use that other choice? 
5 
; 2 
Figure 738 Solution We conclude our discussion by proving that this algorithm does, indeed, yield a 
to Example 6 minimal spanning tree. 


Lemma 9 For each k= 0, T is included in a minimal spanning tree of G. 


PROOF We prove by induction on kthe statement “7, is included in a minimal 
spanning tree of G,” which we denote by P(k), Since To consists of only one node 
and no edges, itis contained in every spanning tree of G, and hence the statement 
P(0) is true. Now let m > 1 be given such that P(0), P(1),..., P(m — 1) have all 
been checked to be true, and consider the tree Tm. According to the algorithm, Tw 
was formed by adding the edge e,, to the tree Tj... By the inductive hypothesis 
P(m — 1), we know that the tree T,,. is included in a minimal spanning tree of 
G. Let's call this minimal spanning tree T. The edge e, either is in T or it is not, 
50 we argue by cases, 











Case 1. If it so happens that the edge en is actually in T, then it must be the case 
that T,, is included in T, which means that statement P(m) is true. 














kj B ^ Nodes of Tj. 
1 lla. b). a. c]. fad) a.b] la, b} 

2 lla. cl. a, d), [b.e]. [^ d]] [b.d] la, b, d} 

3 || Mach Del les di ld, fll elldoel | (del la, b, d,e} 

4 tla, e}. e. d). ld. f). 1d. g). le. el) Ia. f] la, b.d, e, f] 
5 | (ach le, dhe f). Fd. g] [e el (fel le. f) qa, b.c. d.e, f] 
6 Ad. gh [e 81. (fal) ths] la, b. d.e, f. g. c) 





Table 7-1 Tracing Prim's Algorithm 





——Oooeaa i wA. 





72 Proofs About Graphs and Trees 529. 


Case 2. If ej, is nor an edge in T, then we can form the new graph H by adding ej, 
to T. By Exercise 12, this new graph will have exactly one cycle. Let f be the 
edge on this cycle with the smallest possible weight, and let T' be the graph 
formed by removing f from H. By Exercise 13, T' is a spanning tree of G. 
Since the weight of f is no more than the weight of em, it follows that the 
weight of T' is no more than the weight of T. Hence, T is a minimal spanning, 





tree of G in whic 





T, is included. This establishes statement P(n) in this case 


as well, completing the induction. 


Theorem 10 The tree T, returned by Prim's algorithm is a minimal spanning. 
tree for the weighted simple, connected graph G on n + 1 nodes 


PROOF 


Since the algorithm adds one node at each step, it follows that T, has 


k+ 1 nodes for all k, This means that T, has n+ 1 nodes, so T, is a spanning tree 
included in a minimal spanning tree of G, so T, must be 


of G. By Lemma 9, T; 


that minimal spanning tree of G. 


Solutions to Practice Problems 


Proof The contrapositive can be written, “For every 
simple, connected graph G, if there is an edge we can 
remove from G and still have a connected graph, then 
G has a cycle." 

Let a simple, connected graph G be given such that 
there is an edge (a, b} in G whose removal from G ere- 
ates a connected graph G*. Since G* is connected, there 
isa path Pin G" froma tobby definition of “connected.” 
This path is also in the graph G as is the edge (a,b), so 
adding the edge |a, 6} to P forms acycleinG, — m 
Proof by induction. Let the simple graph G with edge 
lu, w) be given, and P(n) be the statement, “If there isa 
circuit of length n containing (v. w} in G, then there isa. 
cycle containing (v. w) in G." 

A circuit with length 3 containing (v, v) in a sim- 
ple graph must look like v, w, x, v, where x is differ- 
cnt from v and w. Any such circuit must be a cycle, 
so statement P(3) is true, Now let m > 4 be given such 
that statements P(1),..., P(m— 1) have already been 
checked, and we are now considering statement P(m) 
Let C = ui. it, it... tts lin With Uy = ty be a cir- 
cuit of length m that contains the edge (v. w}. We can 
choose values / and j with j > i such that u) = m and j 
is as small as possible, Ifi = Oand j = m, then there are 
no other repeated vertices in C, which means that C isa 
cycle; Since C contains (v, w), this establishes statement 
P(m) for this case. 

IC > O or j < m, then we can break C into the fol- 
lowing two smaller circuits 























[Er 
e 





u, 








4 for more on this) Since every edge of the circuit C 
appears in one of these new circuits, we know that edge 
lo. w) is contained in one of them. The new circuit that 
contains (v, w} has length (call it k) less than m, and 
We know that statement P(k) has already been checked. 
Hence, we conclude that there is a cycle in G that con- 
tains edge [v, w). Thisisourdesired conclusion for state- 
ment P(m), and so this completes the induction, — m. 
(a) One possible solution is to (1) remove [8. 1] from 
thecycle 1,2,3,4, 5, 6, 7, 8, 1, (2) remove 6, 7] from. 
thecycle3, 4,6,7. 3, (3) remove [3, 4] from the cycle 
3,4,8, 2,3, (4) remove [5, 6] from the cycle 4,5, 6, 4, 
and (5) remove [2, 3] from the cycle 2. 3,7,8,2. 
(b) There are seven edges, 
(c) There are many ways to do the first part, but they 
all leave seven edges in the second part. 
‘There are many possible answers. One is to change the 
weight of [a,c] in the example to 30. 
Figure 7-39 shows a minimal spanning tre. 




















at 
E 
2 Td 
3 7 
5 


Figure 7-39 Solution to Practice Problem 5. 





530 Chapter 7 / Graphs and Trees 


Exercises for Section 7.2 


1. Each of the following statements is false, In each case, 

demonstrate this by supplying a counterexample. 

(2) In any simple graph G, if W is a closed walk, then 
W contains a cycle. 

(b) In any connected graph G, if every vertex in G has 
degree 3, then G must contain a cycle of length 3, 

(©) In any graph G, some pair of vertices must have the 
same degree, 

Fill in the blanks to complete the following proof of 

Proposition 

Proposition When an edge is removed from a cycle. 


wp 











in a connected graph, the result is a graph that is still 

connected, 

Proof. Let G be a connected graph with the cycle 
[Em 


and let G' be the graph obtained by removing the edge 
n. n] from G. 

To show that G' is connected, we have to show 
that every pair of nodes has a walk between them in 
G^. Let nodes x and y be given, Since i 
there is a walk 


Wena, 





tes 


where consecutive nodes form edges in G. Either edge 
Toy, v] 8 used in walk W or it is not, so we argue in two 





Case 1, If edge [i.v] is not used in walk W, then 





is a walk from x to y with edges in G'. 
Case 2. If edge [v vi] is used in this walk, then 


is a walk from x to y with edges in G', 
Since there is a walk from x to yin G’ in either case, we 
conclude that G' is connected, " 

3. (This is identical to Exercise 6 in Section 7.1.) For each 
of these walks, find a shorter walk that begins and ends 
with the same nodes as the given walk, and that does 
not visit any node more than once, 

(0) 12,324 

O) 44,562 

(©) 9,3,2,7,8,2,4 

D 1,2,3.4,5,67,4,8,3,9 

(©) 3,5,6.5.4,8,9,3,2,8,7 

(0 8,9,2,4,6,8, 11,4,2,5,7,3,2,10 





A OEE — — — ERN 


A. Within each of the given circuits C, identify the circuits 
C' and C" described in the solution to Practice Problem 
2. In each case, decide which of the two new circuits 
contain the edge (a, b). 
(9) C= a,b,c,d, b,e,a 
(b) C =b,c,d,e,b.a, fb 
49 C =c,b,d.e, f. dig base 
D C=a,e.b, fie d,a, 
5, ForeachcircuitC in Exercise 4, find a cycle that contains 
the edge (a,b) 
6, Use Exercise 2 and Theorem 7 to prove that for any 
simple, connected graph G, if G has exactly one cycle, 
then G has the same number of nodes and edges. 
Use the pigeonhole principle (Practice Problem 3 of 
Section 2.5) to prove that every simple graph with at 
least two vertices must have two vertices with the same 
(Hinr: First explain why a simple graph with. 
not have both a vertex of degree 0 and a 
ree n= 1) 
8, Prove the following statement using an argument simi- 
lar to the proof of Proposition 4 








7. 











Let m = 0 be given. In a simple, connected graph 
G, it every vertex in G has degree at least m, then 
there is a path in G of length atleast m. 





9. Prove that in a graph G on n nodes, the longest path in 
G uses no more than n — 1 edges. 

10. Prove that for every connected graph G, if G has no 
cycles, then for every pair of vertices a, b in G, there is 
only one path from a to hin G. (Hint: Consider the 
contrapositive statement.) 

11. Prove that if G is a tree with at least one edge, then 
deleting any edge from G will result in a graph with 
exactly two connected components 

12, Prove that if G isa tree with (two different) nodes uand 
w and [u, v) is not an edge in G, then adding edge |u, 
to G will form a graph with exactly one cycle, (HiNT: 
Proposition 2 tells us that there is at least one cycle in 
this new graph.) 

13, Prove that if G isa simple, connected graph with exactly 
one cycle and e is an edge on that cycle, then the graph 
formed by removing e from G is a tree. 

14, Fill in the blanks to complete an alternative proof to 

F Proposition 7 
Proposition For every n > 1 
must have exactly n + 1 nodes. 
Proof by induction on n = 1. Let P(n) represent the 
statement “Any tree G with n edges has n+ 1 nodes.” 








y tree with n edges 








Bim o eee 


15. 


16. 


v. 


The first statement to be checked is P(0), which 
states, 





‘The only graph satisfying the hypothesis of this state- 
ment is the one shown below, and this graph does have 
one vertex. 

Hence, P(0) is true, Now let m= 1 be given such 
that P(m) is the first statement that has not yet been 
checked. Let a tree G with medges be given. By Propo- 

ition „there isa vertex vin G with degree 1. 
Let G' be the graph resulting from removing this vertex 
and its single edge from G. 





Now G' has edges, so by statement 
PL ) (which has already been checked to be 
true), we know that G' has nodes. Since 

, G has nodes, This verifies 


Statement Pn), completing the induction. . 
‘The induction proof of Proposition 3 involves the state- 
ment P(n) that says, "For every tree T, if T has n edges, 
then 7 has atleast two vertices of degree I." Verify P(1), 
PQ), PG). and PCA) by drawing every graph of the ap- 
propriate size satistving the hypothesis, and verifying 
the conclusion in each case. 

Let G bea tree, and let k be the number of vertices in G 
whose degree is at least 3 Prove that G has atleast k+ 2 
leaves, (Hint; Use Theorem 3 from Section 7.1.) 
‘The induction proof of Theorem 7 involves the state- 
ment P(n) that says, "IE G is a tree with n edges, then 
G has n + 1 vertices” Verify P(O), P(1), PO), and PG) 
by drawing every possible graph of the appropriate 
satisfying the hypothesis, and verifying the conclusion 
in each case, 

Fill in the blanks to complete the proof that goes with 
Exercise 22 in Section 7.1, Notice that here the induc- 
tion is on the number of vertices, not on the number of 
edges in the graph, so the proof looks a bit different. 
Proposition A simple, connected graph with n 1 
vertices must have at least n ~ 1 edges. 

Proof by induction. Let P(n) represent the statement 
"Any simple, connected graph G with n vertices has at 
least — 1 edges” The first statement to be checked is 
(1), which states, 




















‘The only graph satisfying the hypothesis of this state- 
ment consists of one vertex and 0 edges. Hence, P(1) is 
true, Now let m > 2 be given such that P(m) is the first 
Mtement that has not yet been checked. Let a con- 
nected graph G with m vertices be given. Now either 
G has a vertex of degree 1 or else all its vertices have 
degree at least 2, so we argue in two cases. 





19, 


n 


72 Prools About Graphs and Trees. 531 


Case 1, Ifihere isa vertex vin G with degree I, then let 
G'bethe graph resulting from removing this ver- 
tex and its single edge, NowG'has 
vertices, so by statement /, (which 
has already been checked to be truc), we know 
that G' has at least edges, Since 
LLL G has at least edges. 
This verifies statement (i). 

Case 2. If every vertex in G has degree at least 2, then 
by Exercise in Section 7.1, G has 
at least m edges, This verifies statement P(m) 
‘without even needing to use the induction hy- 








potheses. 
In cither case, we have verified statement P(m), com- 
pleting the induction, . 


‘The induction proof in Exercise 18 involves the state- 
ment P(n) that says "For any simple, connected graph 
G, if G has n vertices, then G has atleast n — 1 edges, 
Verify P(1), PQ), P), and P(A) by drawing every pos- 
sible simple, connected graph of the appropriate size 
satisfying the hypothesis, and verifying the conclusion 
for each graph, 

Prove that if G is a simple graph with n vertices, k con- 
nected components, and no cycles, then G has n — k 
edges. (Hint: Use mathematical induction.) 

Fill in the missing steps in the following proof of the 
correct answer to Exercise 20 in Section 7.1 

Claim For every simple graph G with n vertices if Gis 
not connected, then G has no more than “WY edge 
Proof. Let G be a simple, disconnected graph with n 
vertices and let Hand H; be subgraphs of G such that 
every vertexin G isin exactly one of # oF H, and there 
are no edges connecting a vertex in H; with a vertex in 
Ha, Let k be the number of vertices in 1,80 there are 
n — k vertices in Hy. . 
Chaim By Exercise in Section 7.1, the 
is and the max- 























maximum number of edges in i 
imum number of edges in Hp is LEASE, Hence, the 
‘maximum number of edges in G is 








k- Bei 
fü) = 1) (n on k D 
where sks 


If we treat n as a constant, this function of k is the 
parabola 


fü) E k+ 


with a minimum value occurring at 
and maximum value occurring at 
k= 

Hence, 








and 








Ve maximum number of edges G can have is 








532 Chapter 7 / Graphs and Trees 





Figure 7-40 Graph for Exercise 22. 


22. This exercise uses terminology from Section 4.5. For a 
graph G with vertex set V, define the relation R on V 
such that 


a Rb means there is a walk from a to b in G. 


(a) Provethatthe relation Risreflexive, symmetric, and 
transitive, and hence R is an equivalence relation. 
(b) Give the partition of V induced by R for the graph 
shown in Figure 7-40, 
(©) Describe in general the partition of V induced by R 
using the terminology of this chapter. 

23. Find all minimal spanning trees in each graph shown in 
Figure 741. 

24, Using Proposition 3, write an induction proof of the 
statement “For all > 0, every connected graph with n 
edges has a spanning tree." 

25. Consider the following alternative algorithm (due to 
Kruskal) for finding a minimal spanning tree. Use it to 
find a minimum spanning tree in each of the graphs in 
Exercise 23. 
© Given a simple, connected weighted graph G with 

at least one edge, let T; begin as a graph with the 
same set of nodes as G but with no edges. 


© List the edges of G from lightest to heaviest: 


© Forcachi from 1 to m, let 








Tie; ife; can be added to T- without 
creating a cycle 
Tia otherwise 


26. Prove that for any simple, connected weighted graph G, 
for each graph 7, produced by Kruskal's algorithm (in 
Exercise 25), there is a minimal spanning tree T of G 





Such that T; is a subgraph of T. 
27, Fill in the details of the following alternate proof of 
Proposition 4: 


Clim Every tree has at least one leaf, 

Proof, Let T be a tree with n edges. We know from 

Proposition 7 that T has nodes. If each of 

these nodes has degrec greater than 1, then the sum 

of the degrees of the nodes is at least > 

But according to Theorem 3 of Section 7.1, the sum 

of the degrees of the nodes in Tis. Since 

it is therefore impossible for every node 
in T to have degree greater than 1. That is, T has 
at least one node of degree 1. This node is a leaf 
definition. a 

28. Modify the proof in Exercise 27 to show that every tree 
has at least two nodes of degree 1. 

29. The map in Figure 7-42 shows the main roads in the Bor- 
‘ough of Boatsville, along with the cost of plowing each 
street during a heavy snowstorm, Find the cheapest set 
‘of roads that must be plowed in order for everyone to 
be able to travel from any point to any other point in 
town. 

30, Consider the following algorithm on a simple, con- 
nected graph G: 

9 LatG=G. 

















D $ „c 
< 
as d 





Figure 7-41 Graphs for Exercise 23. 











73 isomorphism and Panarty 533 


© Repeat the following as long as possible: If G; hasa 
ce, let e be the most expensive edge on that cycle, 
and let Gi. be the graph obtained by removing e 
from G. 
© The resulting graph has no cycles. Return this as the 
result T. 
Find a minimal spanning tree of the graph in Exercise 
29 using this algorithm, 
31. For each of the graphs in Exercise 23, find a minimal 
spanning tree using the algorithm in Exercise 30. 
32. Prove by induction that in the algorithm in Exercise 30, 
each G, isconnected, Conclude that the resulting graph 
T isa spanning tree of G. 
33. Prove by induction that in the algorithm described in 
Exercise 30, each G; contains a minimal spanning tree 
29, of G. Conclude that this algorithm produces a minimal 
spanning tree of G. 








Figure 7-42 Graph for Exe 


$ 


7.3 Isomorphism and Planarity 





In this section, we get back to studying more concepts of graph theory that can be 
applied directly to problems both practical and recreational. Because the proofs of 
me of these concepts are better left to a specialized course in graph theory, “proof” 
will play a less prominent role in this section than it did in the last. Before we get to 
the applications, however, we need to address one more abstract property of great 
practical importance. 





Isomorphic Graphs 


Siite more w Example 1 Exercise 6 of Section 7.1 asked for an example of a simple graph 
the Web. with six vertices having degrees 3, 3, 2,2, 1, and 1, respectively. The solutions in Figure 
7-43 are all correct. Which graph is not like the other two? 





SOLUTION Ifweimagine the graphs being constructed of tacks (for nodes) 
and elastic string (for edges). then we can see that graphs (a) and (c) would 
look exactly the same if we did the following to graph (a): 


© Move tacks 1 and 6 so they fall below tacks 2 and 5, respectively. 


3 4 e d pz Y x 
i= El [] 
ca so a à e d ov ow 
@ © [7 
Figure 7-43 Graphs for Example 1. 





| m .  — TAA wc X ERN 


534 Chapter 7 / Graphs and Trees 


@ Rotate the entire picture 90° clockwise. 
© Rename tacks 1, 2,3, 4,5,6 as z, y, x, w, v, u, respectively, 


On the other hand, this type of moving and relabeling cannot possibly make 
graph (b) look like the other two. We will justify this claim a little later. a 


We will express the notion of “sameness” in this example by saying that the 
graphs (a) and (c) are isomorphic, or equivalently, that there is an isomorphism 
between the two graphs. This concept is crucially important in any discussion of 
graphs, because it makes us come to terms with the question "Are these graphs the 
same?” 

To get the idea, we will give only the formal definition of isomorphic simple 
graphs since itis easiest to state, thanks to the shortcut of referring to an edge by the 
set of its endpoints. 








Definition Simple graphs G and H are called isomorphic if there is a one- 
to-one and onto function f from the nodes of G to the nodes of H such that 
{v, w) is an edge of G if and only if (f(v), f(w)) is an edge of H. The function 
J is called an isomorphism. Hence, an isomorphism is simply a rule associating 
nodes that preserves the edges joining the nodes. 





So formally graphs (a) and (c) in Example 1 are isomorphic using the function 
f described as 


1e nie y3eirdewsev6cu 


This is the correspondence we determined by our thought experiment with tacks and 
strings. Table 7-2 gives an explicit list of the edges (a, b) in the first graph and the 


d corresponding edges (f(a), f(b)) in the second graph. 
Explore more on Practice Problem 1 The rwo graphs shown in Figure 7-44 are isomorphic, as can 
the Web. be seen by the rule 


lebe di3era dere, Sr cre f 


(a) Describe a thought experiment with tacks and string that would give this particular 
isomorphism. 

(b) Inthe style of Table 7-2, list the edges in the graph on the left and the corresponding 
edges in the graph on the right for this particular isomorphism. 





It is difficult, especially with large examples, to construct an explicit function 
between two graphs to demonstrate they are isomorphic. It is easier to look for 


Edges in graph (a) | (1.2) | (2,3) 
Edges in graph (e) | (zy) | (ya) 


Table 7-2 Corresponding Edges for Graphs (a) and (c) in Example 1 














Ba | «s | 2.5) [ie 








wu | n [ sn [mn 














73 Isomorphism and Planarity 535 


E ' 
> —aX 
" \ 


isomorphic graphs 
Figure 7-44 Graphs for Practice Problem 1 





evidence that the graphs are nor isomorphic. If we cannot find such evidence, this 
suggests (but does not prove) that the two graphs are the same. Here is a partial list 
of things to look for when deciding if two graphs are really the same or not. The 
proof of this proposition is based solely on the formal definition of isomorphism, but 
it will not be presented here. 





Proposition 1 Two graphs that are isomorphic to one another must have 


The same number of nodes 
The same number of edges 

The same number of nodes of any given degree. 
The same number of cycles 

The same number of cycles of any given size. 


Wo Rm 


Vll Gipi 2 i rie gregh oriri Hie T6 Mave Wa sme marier? 
nodes and the same number of edges, and all the nodes in each are de; 3. Howev 
the graphs are not isomorphic. Prove it. 





SOLUTION. Graph H has cycles of length 3 while graph G does not. Infor- 
mally, this means that H cannot be redrawn to look like G. Formally, we can 
cite the last part of Proposition 1 to conclude that graphs G and H are not 
isomorphic. 








Practice Problem 2 Find all isomorphic pairs among the graphs shown in Figure 
746. If you decide a pair is not isomorphic, say how you know this. If you decide a 
pair is isomorphic, give the isomorphism function. 








Graph G Graph H 
Figure 7-45 Graphs for Example 2. 





536 Chapter 7 / Graphs and Trees 


a "sr ie 
Figure 7-47 
Connect letters 
to numbers 
without crossing. 


1 oe ew n z d 2/4 2 
3 * los ea 4 3 4 3 4 
@ e (o [7] (e 


Figure 7-46 Graphs for Practice Problem 2. 


Planar Graphs 


We have seen on more than one occasion that an appealing feature of graphs is the 
fact that their structure is based on a simple relationship between vertices and edges 
and not on precisely how the graph is drawn. However, there are times when the way 
a graph is drawn is extremely important. 





"Bl Example 3 the utilities puzzle) Given three houses, is there any way 
10 connect them to their electricity, cable TV, and telephone companies without one 
service's wires crossing over or under the others? You may assume that the houses are 
arranged any way you like. It is easier to state as “Can one connect each of nodes 1, 2, 
30 each of nodes A, B, C in Figure 7-47 with (not necessarily straight) edges so that 
no two edges cross?” 


Of course, this new interest in the way a graph can be drawn comes with the 
realization that we do not yet have the tools to answer a question like this. So using 
the lessons of history as a guide, we will try to put the problem into as general a 
framework as possible. 





Definiti 


1. A simple, connected graph is called planar if there is a way to draw it (on a 
plane) so that no edges cross (i.e., the only place two edges can meet is at a 
node). We will call such a "drawing" of a graph on a plane surface with no 
edge-crossings an embedding of the graph in the plane, 

2. A graph is called bipartite if its set of nodes can be partitioned into two 
disjoint sets 5; and Sz so that every edge in the graph has one endpoint in 
5; and one endpoint in Sz. 

3. The complete graph on n nodes, denoted by K,, is the simple graph with 
nodes {1, ..., n} and an edge between every pair of distinct nodes. 

4. The complete bipartite graph on n, m nodes, denoted by K m, is the simple 
bipartite graph with nodes $, = {a}, a». ..., dq) andSz = [b bz, ..., by) and 
with edges connecting each node in 5; to every node in Sz. 














un Example 4 K4 and Ks; are shown in Figure 7-48. Show that each is planar 
by redrawing it with no edge-crossings. 











73 isomorphism and Planarity S37 


a 
1 2 


Figure 7-48 Kand Ksz. 


SOLUTION To show a graph is planar, we need only draw its embedding in 
the plane. Figure 7-49 shows embeddings of K4 and K3., respectively. o 


Using this terminology, the utilities puzzle is asking the question "Is Ks pla- 
nar?" This question illustrates in a simplistic way another significant application of 
graphs. When designing an electronic circuit with logic gates as we did in Section 3, 
there is a set of objects that must be interconnected by "wires" in a specific way. It 
is easy to see that this corresponds to a graph. There is a cost attached to creating a 
“bridge” of one wire over another, so the number of edge-crossings should be min- 
imized when designing a circuit. The same sort of issue arises in the design of the 
integrated circuits that make your computer work. We will not address the problem 
of “minimizing edge-crossings” in this course, but to get a flavor of this area of mathe- 
matics, we will discuss some applications of graphs with no edge-crassings, the planar 
graphs. 

‘The next proposition gives two particular examples of small nonplanar graphs. 
We will devote the rest of this section to building a framework that will allow us to 
show that these two graphs, along with many others, are not planar. 





Proposition 2 The graphs Ks and Ks s shown in Figure 7-50 are not planar. 





Before getting to the proof of this proposition, we should point out that Kaz- 
imierz Kuratowski (1896-1980) proved in 1930 that these are essentially the only 
two nonplanar graphs! Of course, this is not strictly true since a larger graph con- 
taining Ks as a subgraph—for example Ks or K;—cannot be planar since Ks is not, 
Kuratowski actually proved the following. 





1 2 
Figure 7-49. K, and Ks: are planar. 








538 Chapter 7 / Graphs and Trees 


a b 
2 8 ae be 
3 4 a by 


Figure 7-50. Ks and Ks; 


Theorem 3 (Kuratowski's Theorem) A graph G is planar if and only if 
it contains no "copies" of Ks or Ks as subgraphs, 


We will not go into what a "copy" of Ka. or Ks is since it has no bearing on the 
rest of the course, but it is not too hard to imagine, We will not prove this theorem 
here. The proof involves some facts about the "topology" of the plane and is typically 
studied in upper-level courses devoted to graph theory. 





Practice Problem 3 One of the graphs shown in Figure 7-51 is planar and the 
otheris nonplanar. Which is which, and how do you know? For the planar graph, show 
‘how to draw it in the plane with no edges crossing. 


Euler’s Formula for Planar Graphs 


In order to find an easy proof of Proposition 2, we need to understand the structure 
of planar graphs in general. Once again, we find ourselves face to face with the work 
of Leonhard Euler, who made a discovery about solid geometry that turns out to be 
related to our study of planar graphs. 

The ancient Greeks knew that there were only five types of regular" polyhedra, 
but their argument to prove this was based on the measure of the angles at 
each vertex. Even though these and other solids had been studied in depth from 
the time of the ancient Greeks, in the mid-eighteenth century Euler made a surprising 








Graph G, Graph G, 


Figure 7-51. Graphs for Practice 
Problem 3. 





* A polyhedron is regular if its faces are congruent polygons and every vertex is incident with the same 
number of edges. These regular polyhedra are often called the Platonic solids. 








73 Isomorphism and Planarity 539 


Ageoe 


Figure 7-52. The five Platonic solids, 


discovery about a relationship between the number of vertices, edges, and faces of 
polyhedra, 


Wh bacon 5 rotten: of vertices, edges, and faces of each of the five 
regular polyhedra shown in Figure 7-52. 


SOLUTION Table 7-3 shows these values for each of the five Platonic 
solids a 


Euler observed that for each of these solids, the total number of faces and vertices 
is 2 more than the number of edges. According to [5], Euler wrote about this and 
other observations in a letter to Christian Goldbach in 1750, which concludes, 





1 find it surprising that these general results in solid geometry have not previ- 
ously been noticed by anyone, so far as I am aware; and furthermore, that the 
important ones . ..are so difficult that I have not yet been able to prove them 
in à satisfactory way. 





For many years, Euler was unable to find a satisfactory proof of his observation. 
because he kept thinking of it as a geometry problem. In 1813, Augustin Cauchy 
(1789-1857) published a paper in which he represented the polyhedra as what we 
now call a planar graph. In this setting, Cauchy was able to give an elegant proof. 
Once again, a mathematician looked beyond layers of irrelevant complexity to see 
the essential ingredients of a problem. And once again, the underlying structure can 
be studied using graph theory. 





























Solid Faces | Vertices | Edges 
Tetrahedron 4 4 6 
Cube 6 8 2 
Octahedron 8 6 2 
Dodecahedron | 12 20 30 
leosahedron. 20 n 30 


Table 7-3 Vertices, Edges, and Faces of the Platonic 
Solids 








540 Chapter 7 / Graphs and Trees 


Explore more on 
the Web. 





Figure 7-53 The planar graph of the dodecahedron, 


uw Example 6 Createa graph for the dodecahedron where the vertices and edges 
for the graph are the vertices and edges (respectively) of the solid, but so that the edges 
do not cross. 


SOLUTION Once again, imagine that the vertices and edges are made of 
tacks and elastic string. In Figure 7-53, if you pick one face of the dodecahedron 
onthe left and stretch itout, all the other vertices and edges will lic in the interior 
as shown in the figure on the right. o 


Practice Problem 4 Create a graph for the octahedron where the vertices and 
edges for the graph are the vertices and edges (respectively) of the solid, but so that the 
edges do not cross. 


Once this transformation is made, we have a planar graph to work with. Cauchy 
proved that Euler's formula works on the transformed pictures in the plane, and so 
it must work for the original solids. In order to give a proof for any planar graph, 
we must make sure that the notion of a “face” is carefully defined, since not every 
planar graph arises from transforming a polyhedron. 





Definition For a planar graph G embedded in the plane, a face of the 
graph is a region of the plane created by the drawing. Since the plane is an 
unbounded surface, every embedding of a finite planar graph will have exactly 
‘one unbounded face. 





It is easy to forget about the unbounded face in a planar graph. In Example 6, it 
might have bothered you if you counted the faces in the planar graph representation 
and only came up with 11 faces. The cycle 16, 17, 18, 19, 20, 16 is the boundary for 
the unbounded face in this embedding of the graph. 





ee ee ee plone raph i aa ia iwo. 

different ways. For each drawing, identify the faces by giving the cycle that creates each 
face, and highlight the unbounded face. Note that the number of faces is the same in 
each case, but the cycles that border the faces are different. 











73 isomorphism and Plananty 541 


Figure 7-54. Graphs for Example 7. 


SOLUTION We describe each face by giving the cycle that "borders" it in 
Table 74. o 


Itis not at all obvious that the number of faces of a planar graph does not depend 
on a particular embedding of the graph in the plane. Euler's formula establishes that 
the number of faces is determined by the number of edges and vertices, which are 
obviously not dependent on how the graph is drawn, 

To prove Euler's theorem, we need a couple of facts about embeddings of graphs 
in the planes. These are the results that depend on the structure of the plane, which 
we will not prove here. See Exercise 18 to see the role of a "plane surface" in these 
facts that seem so obvious. 











Lemma 4 Given an embedding of a graph in the plane, 


1. Removing any edge from a cycle in the graph decreases by 1 the number of faces 
in the graph; and 
2. Every edge on a cycle in the graph borders exactly two faces. 


Theorem 5 (Euler's Formula for Planar Graphs) For any connected 
planar graph G embedded in the plane with V vertices, E edges, and F faces, it must 
be the case that 


ViF=E+2 


Left drawing) Right drawing 




















12.41 1.2.41 
Lisa 1.3.5.1 
zasa | 2x42 
1231 | 1.54.1 

Unbounded — 1.4,3.5,1 12351 


Table 7-4 Cycles Bordering Faces in Example 7 





542 Chapter 7 | Graphs and Trees 


PROOF (By induction.) Let P(n) be the statement “For every embedding of 
a connected planar graph with n edges, V, vertices, and F faces, V +F =n +2.” 
The first statement P(0) concerns connected graphs with no edges, of which the 
following is the only example: 


In this graph there is one vertex, 0 edges, and one (unbounded) face. Since the 
‘equation 1 +1 =0 +2 is true, we can conclude that statement P(0) is true. 

Now let m > 1 be given such that P(m) is the first statement yet to be checked. 
Let G be a connected planar graph with m edges and V vertices with F faces as 
embedded in the plane. Either G has a cycle or it does not, so we will argue in two 
cases. 


Case 1. If G has no cycles, then G is a tree. Since there are no cycles, G has only the 
one (unbounded) face. By Theorem 7 of Section 7.2, V = m+ 1. Hence, we 
have V --F —2— (m+ 1) +1 -2 = m. Thus, statement P(m) is true in this 
case. 

Case 2. If G has a cycle, then form the graph G' by removing an edge from this 
cycle. By Proposition 3 of Section 7.2, G' is still connected, Clearly, the number 
of edges in G' is m — 1, and by Lemma 4, the number of faces in G’ is F — 1. 
Using the fact that statement P(m — 1) has already been checked to be true, 
we know that the formula in statement P(m — 1) can be applied to graph G’. 
Specifically, 





V+(F-1)=(m-1)+2 


Adding 1 to both sides makes this equation say, “V + F 
statement P(m). 





+2” which is 
= 


This result implies that the number of faces of a planar graph does not depend 
on the particular embedding of that graph in a plane! Hence, we can unambiguously 
talk about the “number of faces of a planar graph” without mentioning the way the 
graph is drawn. 


The Utilities Puzzle Resolved 


After a considerable digression, we now have enough tools to prove that Ks is not a 
planar graph and thereby establish that the utilities puzzle has no solution. Because 
Euler's formula provides a relationship between the number of faces and the number 
of edges in a planar graph, we can use it to show that some graphs have too many 
edges to be planar. This is an effective tool for graphs, like Ks s, for which every edge 
is on a cycle. 








Definition Let G be a planar graph in which every edge is on a cycle, 
and let f be a face created by an embedding of G in the plane. The size of 
f is defined to be the number of edges on the cycle in G that makes up the 
boundary of f. 








EEE 


73 isomorphism and Pianarty 543 


Lemma 6 In any planar graph G in which every edge is on a cycle, the sum of the 
face sizes is twice the number of edges in the graph. 


PROOF This follows directly from the second part of Lemma 4 in a manner very 
similar to the proof of Theorem 3 of Section 7.1. " 


Theorem 7 In any planar graph G in which every edge is on a cycle, if k is the 
size of the smallest cycle, then the number of edges in G is at least 5 times the number 
of faces in G. 
PROOF If the smallest cycle has length k, then when we sum the face sizes, we 
get a result that is at least k- F, where F is the number of faces in G. By Lemma 
6, the sum of the face sizes is 2m, where m is the number of edges in G. Putting 
together these two facts yields the inequality 
2m>k-F 


from which the desired result follows " 


X Proposition 2 
PROOF We will show that K3_3 (sce Figure 7-55) is not planar, and leave Ks for 
the practice problem below. We will argue by contradiction, assuming that K; s is 
planar and deriving a contradictory statement from this assumption. To use the 
tools available to us, we need to take a quick inventory of the structure of Ks: 


revisited) The graphs Ks. and Ks are not planar. 


Figure 7-55 Kis 
isnot planar, 


© Ks; has six vertices and nine edges. 
© Every edge in Ks; is on a cycle. 
9 The shortest cycle in Ks has length 4. 





If Ks sis planar, then by Euler's formula, it must have exactly five faces, Since 
the smallest length of a cycle in Ks.3 is 4, then by Theorem 7, the number of edges. 
is at least 2 times the number of faces. That is, 9 > 2- 5. Since this false statement 
logically follows from the assumption that Ks is planar, we can conclude that 
Kss is not planar. " 








Practice Problem 5 Complete the proof of Proposition 2 by explaining why Ks 
is not planar. 


Solutions to Practice Problems 





1 (a) Exchange tack 2 with tack S. 


"S 35) (2.4) {2,5} 12-6) G5 (46 
{b.a} {b.c} (d.e) {d.c} (d. f] {a,c} {ef} 


2 Nonmatching edge count shows these pairs are not iso- 
morphic: (a) and (b), (a) and (e). (b) and (c). (c) and 
(d). (c) and (e). Other isomorphisms are ruled out using 
degrees. Graph (a) has a node of degree 3 and (c) does 
not. (d) has anode of degree 1 and (b) does not. (d) hasa 
node of degree 1 and (e) does not. The only possibilities 


left are (a) to (d) and (b) to (e). We find that (a) and (4) 
arc isomorphic using the correspondence 1 ++ 1,2 
3,35» 3,4 > 2. Graphs (b) and (e) are isomorphic us- 
ing the correspondence 11 2,21 1,31 3,41 4, 

In Figure 7-56, the second graph is nonplanar since it 
contains a copy of Ks s. The first graph is planar, and can 
be redrawn without edge-crossings as shown in Figure 
756. 





544 Chapter 7 / Graphs and Trees 


c b 
Figure 7-56 Solution to Practice Problem 3. 


4 The graph is shown in Figure 7.57. 


Figure 7-57 Solution to Practice Problem 4. 


Exercises for Section 7.3 
1. See Figure 7-58. Explain why graphs G; and Gz are not 


isomorphic. Explain why graphs H and # are not iso- 
morphic. 


iN AA 


" Graph G, Graph G 
(b) Graph H, Graph H 


Figure 7-58. Graphs for Exercise 1. 








Proofby contradiction. Suppose Ks is planar. Since Ks 
has vertices and 10 edges, Euler's formula tells us that 
Ks will have seven faces when embedded in the plane. 
However, every edge in Ks is on a cycle and the shortest 
cycle in K has length 3, o by Theorem 7, the number of 
edges in Ks will be at least 3 times the number of faces. 
This means that i 


3 
EG 
025 


which is clearly false. Hence, the original assumption 
must be wrong, so Ks is not planar, L] 


2. Explain why graphs (a) and (5) in Example 1 are not 
isomorphic. 

3. Two of the graphs shown in Figure 7-59 are isomorphic 

F to cach other. Which ones? 

4, The degree sequence of a graph is the list of degrees 
of the nodes of the graph, listed from largest degree to 
smallest For example, the degree sequence of the graph 
Gz shown above is 5.4, 3,3,3,3,2,1. 

(9) Construct two connected, simple graphs with de- 
gree sequence3, 3, 2, 2. 2, and explain why your two 
graphs are not isomorphic, 

(b) Construct two connected, simple graphs with de- 
Breesequence3, 2, ind explain why your two 
graphs are not isomorphic. 

(©) Construct two connected, simple graphs with de- 
aree sequence 3,3, 2, 2, 2,2, and explain why your 
two graphs are not isomorphic. 

5. Find two simple graphs, each with six nodes, with ev- 
ery node of degree 3, but not isomorphic to each other. 
(Hint: Use Kia as one of your graphs, and note that 
Ks has no cycles of length 3.) 

6. Give two examples of a simple, connected graph with 
eight nodes, each node having degree 4, that are not 
isomorphic to each other (and explain why your graphs 
are not isomorphic), (Hint: Make one graph planar 
and the other nonplanar.) 

















73 isomorphism and Planarty 545 








SISXIBNES 


Figure 7-59 Graphs for Problem 3. 


7. Give three examples of a simple, connected graph with 
10 nodes, each node having degree 5. Explain why no 
two are isomorphic, 

8, Show that each of five houses can be connected to two 

utilities without lines crossing. How many houses would 

be too many to connect each to two utilities without 
lines crossing? Explain your answer completely. 

In the induction proof of Theorem 5, we used the 

statement P(n): "For every embedding of a connected 

planar graph with n edges, V vertices, and F faces, 

V + F = n + 27 Verify statements P1), P(2), P), and. 

P(A) by drawing every possible graph satisfying the hy- 

pothesis and showing that the conclusion is true in each 

case. 

10. Show that the two graphs shown in Figure 7-60 are 

# planar by redrawing each in the plane with no edge- 

crossings, 

11. Use Theorem 7 and Euler's formula to prove that 

each of the graphs shown in Figure 7-61 is not plan: 
(Nori: The first graph is a cube with every face di 





onal added, the second is a cube with every internal 
diagonal added, and the last graph is called the Petersen 
graph, which arises in many different contexts in graph 
theory.) 








8 
D 72 
To: a 
5 6 
6 7 8 
[] (0 


Figure 7-60 Graphs for Problem 10. 





12. Decide if each of the graphs in Figure 7-62 is planar 
F or nonplanar. If it is planar, draw an embedding of the 

graph in the plane. If it is nonplanar, explain how you 
know. 

For each of the following families of nonregular poly- 

hedra, find the number of vertices, edges, and faces, and 

verify that Euler's formula holds for each of them. (Ex- 

amples of each are given in Figure 7-63.) 

(a) A prism whose base is a regular k-sided polygon. 
and whose sides are squares. (The first example in 
the figure shows this prism with k = 6.) 

(b) An “antiprism” whose top and bottom are regular 
‘sided polygons and whose sides are alternating 
isosceles triangles. (The middle example in the fig- 
ure shows this antiprism with k = 6.) 

(©) A pyramid whose base is a regular -sided polygon. 
(The third example in the figure shows this pyramid 
with k= 4) 

For each of the three sample polyhedra in the previous 

problem, draw the planar graph representation, 

Let G be a simple, connected planar graph in which ev- 

ery edge is on a cycle, Prove that if G has n vertices and 

m edges, then 


13. 





mes3-6 


(Hint: The smallest cycle a simple graph can have has 
length 3.) 

Let G be a simple, connected planar graph in which ev- 
ery edge is on a cycle, Prove that if every vertex of G 
has degree d and every face is bound by b edges, then 


16. 





b-F=2-E=d-V 


and so Euler’s formula tells us that 





2.2 
egez 

Show that there are only five possible choices of d = 3 

and b > 3for which 3 + è — 1 has a positive value, De 


cide which choice of (4 
five Platonic solids. 





| b) is associated with each of the 








546 Chapter 7 / Graphs and Trees 


En a 


Figure 7-61 Graphs for Problem 11. 


^ 8 CG 
6% AV] 
5 
2 
7 5 3 
$ D wi rg 2 
w t) (o 


Figure 7-62. Graphs for Problem 12. 


e! 


Figure 7-63. Examples for Problem 13. 





17. Demonstrate that the utilities puzzle can be solved if 18 Demonstrate that each fact in Lemma 4 is false for 
you lived on the surface of a doughnut-shaped planet, graphs drawn on the surface of a doughnut. 


7.4 Connections to Matrices and Relations 





In the first three sections of this chapter, we have represented graphs primarily as 
pictures, using points to indicate the vertices and arcs for the edges. This is certainly 
appropriate considering the origins of the subject and the types of applications we 
have studied. One big drawback to this representation is that it does not lend itself to 
manipulation by a computer. In this section, we consider a particular representation 
that is more amenable to the use of a computer as a tool for answering questions 
about graphs, In the process, we shall discover some interesting connections with 
other topics in discrete mathematics. 


74 Connections to Matrices and Relations 547 


Adjacency Matrices 


For any graph, we can store information about the number of edges connecting each 
pair of vertices in a rectangular grid by using the rule "In row /, column j of the grid, 
write the number of edges connecting node i and node j,” 

Consider the example shown in Figure 7-64. In row 1 of the grid, we place a 
1 in column 2 since there is one edge connecting nodes 1 and 2, and we place a 
2 in column 5 since there are two edges connecting nodes 1 and 5. Of course, we 
place zeros in columns 1, 3, and 4, because there are no edges joining node 1 to 
nodes 1, 3, or 4, The following definition formalizes this idea and introduces some 
notation. 











Definition Given a graph G with vertex set V = (vj, v;, ... Vu} and edge 
set E, we define the adjacency matrix of G as follows. The matrix M is an n x n 
array of natural numbers, which we imagine having rows and columns labeled 
as follows: 





The entry in row i, column j (referred to as the (i, j) — entry of M or, 
more concisely, My) is defined 





Mij = the number of edges connecting v, and v in G 











‘The name “adjacency matrix” comes from the fact that a pair of vertices con- 
nected by an edge are said to be adjacent. A subtle feature of the definition is that 
the adjacency matrix for a graph depends on how the nodes have been labeled, so it 
is a good practice to specify the labeling of vertices in addition to the corresponding 
matrix, especially as you try to compare answers with others, 











3 Columns 1, 2,3, 4, 5 
Bene 
Row! —» ETE 
2 4) Rw2|10110 
Rows > | 01000 
Rws > | 01010 
/ ^ Fows > | 20000) 


Figure 7-64 The adjacency matrix of a graph. 








548 Chapter 7 / Graphs and Trees 





Graph H Graph G 
Figure 7-66. The graph for Practice Problem 1 


BE Example 1 Figure 7-65 shows a graph with vertex set (a, b, c,d, e). Use the 
bvious" labeling strategy (a as vi, b as v2, etc.) to form the adjacency matrix of this 
graph, 

SOLUTION | Wesimply let My represent the number of edges joining vertices 
i and j, resulting in the matrix shown below, Note that the labels on the rows 
and columns help us keep track of which row and column correspond to which 
vertex—they are not part of the adjacency matrix itself. 











Figure 7-65 The 
graph for Example 1 abede 
030020 


aanse 
-2 





Practice Problem 1 Give the adjacency matrices for the two graphs shown in 
Figure 7-66. For each matrix, identify the values of My, My, Mas, and Mas. 


Directed Graphs and Matrix Multiplication 


‘The most important generalization for the remainder of this section concerns directed 
graphs, We saw enough in Section 7.1 to know that when drawing a picture of a 
directed graph, we use arrows rather than line segments to connect the edges, Even 
in this case, we can still use an adjacency matrix to store information about thos 
edges. We think of Mj as indicating the number of arrows from vertex i to vertex j. 
Of course, if there are no parallel edges, this count will be either 0 or 1, 

We begin by giving a careful definition of a directed graph and its adjacency 
matrix. 











Definition 
1. A directed graph, like a graph, consists of a set V of vertices and a set E 
of edges. Each edge is associated with an ordered pair of vertices called its 


endpoints. In other words, a directed graph is the same as a graph, but the 
edges are described as ordered pairs rather than unordered pairs. 





a ee 


7A Connections to Matrices and Relations 549. 


Ifthe endpoints for edge e are a and b in that order, we say e is an edge from 
a to b, and in the diagram we draw the edge as a straight or curved arrow | 
from a to b. 

3. For a directed graph, we use (a, b) rather than [a, b] to indicate an edge 

from a to b. This emphasizes that the edge is an ordered pair, by utilizing | 








the usual notation for ordered pairs, 
4. A walk in a directed graph is a sequence vieivaea .. Unéntns of alternating, 

vertices and edges that begins and ends with a vertex, and where each edge | 
in the list lies between its endpoints in the proper order. (That is, e; is an 

edge from v, to va, e; is an edge from v; to vs, and so on.) If there is no | 
chance of confusion, we omit the edges when we describe a walk. | 
The adjacency matrix for a directed graph with vertices (vi, v... v] is the | 
n x n matrix where My (the entry in row i, column j) is the number of edges 

from vertex v; to vertex vj. 














E. 

Example 2 In Section 7.5 we will see that graphs can be used to represent 

games. Asa simple example, consider a two-player game where there is a single pile of 

10 stones and each player may remove one or two stones at a time on his or her turn. I 

The graph given in Figure 7-67 models this game. The labels on the nodes represent 

the number of stones left at the end of a turn, and each edge represents a legal move. 

For example, the existence of the edge from node 6 to node 4 indicates there is a legal 

move that takes the pile from six stones to four stones j 
Write the set of edges in set-builder notation. Give a walk of length 1 in the graph. 

Give a walk of length 7 from 10 to 0 in the graph. 





SOLUTION The set of vertices is V = (0, 1,2,3,..., 10) and the set of edges 





is 

E=((.yyeVxV +lorx=y+2) 1 

That is, the set of edges is ll 
E = (10,9), (10,8), (9, 8). (9.7)... (3,2). 3.1), (2, 1). 2,0), (1,0) 

‘A walk of length 1 just corresponds to an edge like, for example, 9,7. i 


We can describe one possible walk of length 7, listing only the vertices, as ] 
10,8, 7, 5,4, 3, 1,0. This walk corresponds to a sequence of moves making up 
a complete game. o I 





Practice Problem 2 Consider the graph of Example 2, 


(a) Explain why using a directed graph is important for modeling the game. 
(b) If you label the vertices in the order 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 0 (so that row 1 and. 
column 1 correspond to vertex 1, row and column 2 to vertex 2, ..., row and. 


Figure 7-67 The graph for Example 2. 





| SL —— cc" AS ne 


550 Chapter 7 / Graphs and Trees 


column 11 to vertex 0), give the value of these entries of the: ‘adjacency matrix: Msz. 
Mss, Maa, Mas, Mu, Mis. Ms- 
(c) Explain why Mss # Mas for the adjacency matrix. Could that happen for an or- 
. dinary graph? 


The representation of directed graphs with matrices is not just for bookkeeping 
| purposes We will see that there are many properties of directed graphs that can be 
formally checked using mathematical operations on the corresponding matrices. To 
do this, we need to first figure out the meaning of specific matrix operations for the 
underlying graph. For example, given a matrix M for a graph G, what does MŽ tell 
us about G? We will ake the investigation of this question as our starting point for 
the discussion. The student who wishes to review some basics of matrix operations 
| should consult Appendix B of the text before reading on. 





i ample S cocer Be areal graph C I Riu 7.08 wom veriis 
11,2,3,4, 5, 6], and its adjacency matrix M. Look carefully at the way the product 
M-M = M is defined, and see what it tells us about G. 


SOLUTION We know, for example, that the entry in row 2, column 3 of 
M - M, comes from multiplying row 2 of M by column 3 of M. That is, 








[1 0 0 0 1 0]. o | = QD) + (0X0) + (00) + (00) + AY) + CYA) 


CTI 


=2 


So we get the answer 2 as the row 2, column 3 entry of M? specifically because 
of the two (1)(1) terms in the above sum. We analyze each in turn: 


© The first (1)(1) term comes from the product Mz - Mı, so it corresponds 
to the fact that in M, both the row 2, column 1 entry (Ms) and the row 1, 
column 3 entry (M13) are 1's This means there is an edge from 2 to 1, and 
also an edge from 1 to 3. So there is the length-2 walk 2, 1, 3 from node 2 to 
node 3. 


2 3 
901100 
100010 
9010100 
4 m= 
" Fr CAER 
101001 
111000 
5 5 


Figure 7-68. The graph for Example 3. 





a MERO aa Ct 


74 Connections to Matrices and Relations 551 Í 


© Similarly, the second (1)(1) term comes from the product Mas - Mss, so it. 
corresponds to the fact that both Mz; and Ms; are 1. This means there is | 
the length-2 walk 2, 5, 3 from node 2 to node 3. ! 


It is easy to check by inspecting the graph that, in fact, these are the only 
length-2 walks from 2 to 3. Hence, we conjecture that the row i, column j entry 
in M? actually counts the number of walks from node i to node j that are exactly 
of length 2 (i.e., which use two edges). a 


Practice Problem 3 There were no parallel edges in the previous example, The 
directed graph whose adjacency matrix appears below extends the graph in Example 
3. Draw in the additional edges, and use this picture to decide if the entry in row 2, 
column 3 of M - M still counts the number of length-2 walks from node 2 to node 3: 


004100 
200030 
010100 
Mg op 11 
101001 
115000 


Careful attention to the interpretation of matrix multiplication in terms of the 
adjacency matrix for a directed graph G allows us to prove the following theorem. 
The proof uses induction on the length of the walks The details are left as an exercise 
at the end of this section. 


Theorem 1 Let M be the adjacency matrix of a directed graph G with vertex set 
11,2,3,.... m]. The row i, column j entry of M* counts the number of k-step walks 
from node i to node j in the graph G. Consequently, the row i, column j entry of 





MEM EM eee M* 


counts the number of walks from node i to node j of length 1 or 2 or3 or ...ork—that 
is nontrivial walks of length k or less. 


PROOF See Exercise 13 at the end of this section. n 


"B Example 4. This problem revisits the directed graph of Example 3 repeated 
in Figure 7-69, Compute matrices M, M^, M", and M + M^ + M", and use them to 
answer the following questions about the graph G: 


1. How many walks of length 3 are there from node 5 to node 3? 
2. How many walks are there from node 5 to node 3 of length less than or equal to 3? 
3. Give a complete list of those walks. 





552. Chapter 7 / Graphs and Trees 


Figure 7-69 The graph for 








Example 4. 
SOLUTION 

OO 1:2 0-0 93 d. 1 11 422032 
100010 1310.2.1 9 1 145422511 
21010100 a_j110021 4 134112 
Melo 1.9 01/1] 533312 01 1|* 83534311 
150139917 1272722007 241342 
111000 14$ 12.10 232233 

4432543 

344422 

diua ci E E E s 

MMM mm G RA S 

464543 

454443 


1. Since Mj, = 1, there is only one walk of length 3 from node 5 to node 3. 

2. Since the row 5, column 3 entry in M + M? + M? is 4, there are four walks 
from node 5 to node 3 of length 3 or less in G. 

3. The walk of length 1 is 5,3, the length 2 walks are 5, 6, 3 and 5, 1,3, and the 


length 3 walk is 5,6, 1,3. n 


Practice Problem 4 Answer the three questions in Example 4 for walks from 
node 6 to node 2. Answer the same three questions for walks from node 1 to node 1. 


Connections to Binary Relations 


One of the most fascinating things about mathematics is that ideas that seem to be 
unrelated turn out to have connections between them, and that those connections 
sometimes make hard problems easier to solve, In this brief subsection, we examine 
the connections among directed graphs, adjacency matrices, and the binary relations 
we studied in Chapter 4. 

Recall from Section 4,1 that a binary relation on a set A is simply a subset of 
A x A. Compare this to the notion of a directed graph, consisting of set V of vertices 
and a set E of edges, where each edge is associated with an ordered pair of vertices, 





















74 Connections to Matrices and Relations 553 


Provided there are no parallel edges in the graph, each edge in £ can be thought of 
as the ordered pair (vi, v2), where vy and v, are the endpoints of the edge. In this 
case, the edge set of a directed graph with no parallel edges is a binary relation on the 
vertex set! 

Because of this connection between directed graphs and binary relations, we 
can extend some of our terminology. 





Definition 


© Given a relation R on a set A, we can form the directed graph (A, R). We 
will refer to this as the graph of the relation R, Note that the picture of such a 
graph is exactly what we called the arrow diagram of the relation in Chapter 
4, 

© The adjacency matrix for a relation is simply the adjacency matrix for the 
corresponding directed graph, Note that the adjacency matrix for a relation 
has only 0's and 1's as entries. 











This correspondence gives us three ways to think of relations. First, a relation 
is a set of ordered pairs. Second, we can obtain a useful visualization by thinking 
of the relation as a directed graph. Third, we can obtain a form useful for calcula- 
tions by considering the adjacency matrix, These connections can be formalized as 
follows: 





Binary Relations, Directed Graphs, and Adjacency Matrices 
For a relation R on the set A = (1,2, 3...) the following statements are 
equivalent for any a, b € A: 





1. (a, b) € R (which we write sometimes as ab). 
2, There is a directed edge from node a to node b in the graph of R. 
3, There is a 1 in the row a, column b entry of the adjacency matrix for R. 











T Example 5 


1. Let A = (1,2,3,4), and let R be the relation (1, 2), (1,4), (3,3), (2, 1), (4. 2)) De- 
termine the corresponding graph and adjacency matrix. 


2. For the adjacency matrix 


eeced 
sosoo- 
a 
s-2oc 


write the corresponding relation S on the set (1, 2, 3, 4. 5] and give its graph. 








554. Chapter 7 / Graphs and Trees 





Figure 7-71. The graph 
for relation 5. 


1 2 
0101 
1000 
0010 
0100 

Ee] n 

Figure 7-70 The graph and adjacency 

matrix for R. 


SOLUTION 





1. The relation has an ordered pair (1, 2), indicating an edge from node 1 to 
node 2 in the graph and a 1 in the (1, 2) entry of the matrix. The complete 
graph and the adjacency matrix for the relation R are given in Figure 7-70. 

2. The fact that My — 1 indicates that (3,1) should be in the relation, and 
that there is an edge from vertex 3 to vertex 1 in the graph, Reason- 
ing in this way for each entry of 1 in the matrix, we obtain the relation 
11,2), (1,4), (2,3), (3,1), (3,4), (3,5), (4, 5)). The graph is shown in Figure 
Tn. 

o 








‘These connections will allow us to apply our understanding of any one of these 
structures to better understand the other two, Let's look at an example of this 
proces 








Boolean Operations and Composition of Relations 


One of the benefits of having these alternative ways to look at relations comes 
from a connection between relation composition and matrix multiplication. To 
develop this idea, we recall the definition of relation composition, applied to 
the situation of forming the composition of a relation with itself, and then we 
will translate this definition into the language of directed graphs and adjacency 
matrices 














Definition Let R be a relation on the set A. The composition R c R is the 
relation on A given by the rule "(a, c) € Ro Rif there is an element b € A such 
that (a, b) e R and (b,c) € R.” 














Tosee the connection between this definition and graphs and adjacency matrices, 
consider what this statement says for a relation R on the set A = [1,2, ..., n). 





© If Gis the graph of R and H is the graph of R o R, then there is an edge from a to 
c in H if there is an element b € A such that there are edges in G from a to b and 
from b toc, 

9 If Mis the adjacency matrix for the rela 
RoR, then “Nye = 1 if for some b, Muy 





ion R and N is the adjacency matrix for 








——— 0 X a 00 ooontnRGnBGM sr RN 


74 Connecions to Matrices and Relations 555 




















a| b || ab | a+b 
ofo 0 o 
TE 0 1 
+ 
ifo 0 1 
"E 1 1 


























Table 7-5 Boolean Arithmetic 


‘The first correspondence means that R o R captures information about the exis 
tence of walks of length 2 in the graph of R, but what does the second correspondence 
ncy matrix, an (i, j) 
true" or "false," respectively, for the s 





mean? The answer comes from r 
of 1 or O can be interpreted as values 
“There is an edge from í to j in the graph of R.” 





lizing U 





Since we have already seen that matrix multiplication counts the number of 
paths in a graph, and all we want is a “true or false" answer, th 
use matrix multiplication to compute the adjacency matrix for R © R, but to modify 
the operation to only provide true or false answers. Fortunately, we already know of 
n situations where only true or false answers are 
w these operations in Section 3.4 when we studied Boolean algebra. 
t interpretation of “plus” and "times" that need to 
‘or an adjacency matrix A, the 
the matrix product A -A using the Boolean arithmetic operations on the entries, 
We will call this the Boolean product of A with itself, and we will denote the result 
A® to distinguish it from the ordinary matrix product A. Let's compare these two 
operations in an example, and see how the Boolean operation calculates exactly what 





à system of arithmel 
desired, We 
Table 7-5 shows the relevi 
over from that section. 





that appli 











we need to know about the relation R è R. 





tall Example 6 Consider the relation R on the set (1 


by to cs and bys to css 
SOLUTION 


1. In the matrix B = A?, row 3 x column 1 looks like 








140-040. 1- 1-151404 

O—~y2 110 

001 

X Wa gb 

y 100 
3 ZI 


Figure 7-72. The graph and adjacency matrix 


for Example 6. 














idea here is to compute 


0+ 








ver idea here is to 





4) whose graph G and 
adjacency matrix A are given in Figure 7-72. Using B = A? and C = AX), compare 








556 Chapter 7 / Graphs and Trees 


We interpret this as “There is an edge from 3 to 1 and an edge from 1 to 1, 
and there is an edge from 3 to 4 and from 4 to 1; hence 3, 1, 1 is a walk in G 
of length 2 fram node 3 to node 1, and 3, 4, 1 is another walk in G of length 2 
from node 3 to node 1, so there is a total of two walks in G of length 2 from 
node 3 to node 1.” 

In the matrix C = A®, row 3 x column 1 looks the same, except we use 
Boolean arithmetic to evaluate the final answer: 





ey =1-140-040-141-1=140+041 





We can think of this as “There is a walk in G of length 2 from 301 to 1, and 
there is a walk in G of length 2 from 3 to 4 to 1, so there is certainly at least 
‘one walk in G of length 2 from node 3 to node 1.” 


2. Similar calculations for bys and cx: yield 


bys =0-0+0-14+1-0+1-0=04+04040=0 
£5 =0-0+0-141-0+1-0=04+0404+0=0 





In this case, the answers are the same because an answer of 0 to the question 
“How many walks in G of length 2 are there from node 2 to node 3?” is the 
‘same as a 0 (meaning "false") answer to the question “Is there a walk in G 
of length 2 from node 2 to node 3?” n 


Wecan now look at the entire matrix A® and see its connection to the composite 
relation Ro R. 


a Example 7 Ler R = {(1, 1), (1.2). (2,3), 2.4). (3,1), (3,4), (4. 1)) be the re- 
lation from Example 6 with adjacency matrix 





1100 
0011 
a 
1000 
Find the relation R © R, and compute the matrix AC). 


SOLUTION Using the Boolean product operation, we have 
irti 
aliota 
e 
siis] [le A) 
1100 


To find R © R, we can either use the definition of composition or we can resort. 
to the two-set arrow diagrams from Section 4.2 as illustrated in Figure 7-73 to 
find that 

Rz R = (1,1), (1,2), (1,3), (1.4). 2. 1), 2,4), (3. 1). (3 2), (4. 1), (4,2)] 


Notice that AC) is the adjacency matrix for R c R, just as we expected. n 








74 Connections to Matrices and Relations 557 | 


P1 Pt 
: : 





Figure 7-73 The solution to Example 7. 


This example illustrates the direct connection between R o R and M®, where M 
is the adjacency matrix for R. We can formally state this connection as the following 
theorem. The proof of this theorem is based directly on the ideas from this section, 
but we will not include it here. 


Theorem 2 If R is a binary relation on a set A with adjacency matrix M, then the 
matrix M® is the adjacency matrix for the relation R > R on the set A. 


Practice Problem 5 For the relation R = {(1, 4), (2, 3). (1.5). (2,2), (3,2), (5,3)] 
on the set (1, 2,3, 4, 5, 6], draw the directed graph of R, and use it to find Rc R as a set 
of ordered pairs. Find the adjacency matrix M of R, and use it to compute M^". Verify 
that MC) is the adjacency matrix for Ro R. 


Application to Transitivity 

One application of composition explored in Exercises 21 and 22 of Section 4.4 gave 
a simple characterization of a transitive relation. This will give us, in turn, a simple 
test for transitivity of a binary relation using the adjacency matrix of the relation. We 
will learn about this test after a brief review of the concept of transitivity. 








Definition Let Rbearelation on the set A, that is, Risa subsetof A x A. R 
is transitive if whenever (a, b) € R and (b. c) € R, itis also true that (a, c) € R. 











lll Example 8. Here are three relations on A = 012,3.4.51. Which one s not 
transitive? 


1. Define Ry by saying that aRıb if a and b have the same number of letters in their 
English-language spelling. For example, ARS since the words four and five each 
have four letters 

(a, b) : a — b? < S} 

((1, 1). (1, 4). (4. 1). (1. 2). (4.2). 5.5). 





SOLUTION 


1. Not only is R; transitive, but it is also reflexive and symmetric, so this is an 
equivalence relation and it partitions the set A into the subsets (1.2). (3). 
and (4, 5]. 





SES = S V ë ET 


558 Chapter 7 / Graphs and Trees 
(a,b), (b.e) Resulting 








a9.) a1) Yes 
(1.1). (1,4) (1,4) Yes 
(1.1). (1.2) (2) Yes 
(14). (4.1) [m Yes 
(1.4), 4,2) (1.2) Yes 
(41.0.0 aD Yes 
4.0), (1.2) (4.2) Yes 


Table 7-6 Transitive Relation in Example 8 


2. Relation Rz is not transitive since (3, 2) € Rz and (2, 1) € Rz but (3, 1) ¢ Rs. 
3. Relation R3 is transitive, as shown by the complete list of pairs (a, b) and 
(b,c) € Rs shown in Table 7-6, a 


Practice Problem 6 For A = (1.2, 3) decide which of the following relations on 
A is transitive: 


I, 1), 2,2), (3,3). (1.2), 2, 3)) 
10,3). (3. 1). (1, 1), (3,3) 
(© Rs = (0,3). (3, 1)} 








We have seen that checking a relation for transitivity can be fairly tedious, but if 
we write the definition of transitivity in terms of relations, graphs, and matrices, we 
will be able to take advantage of seeing the situation from several points of view. 





Proposition 3 Leta transtive relation R on a set A be given along with its associ- 
ated directed graph G and adjacency matrix M. Then for all a, b, c € A, the following 
are true: 


© IfaRb and bRe, then aRc. 

© In the graph G, if there is an edge from a to b and an edge from b to c, then there is 
an edge from a to c. We can rephrase this as "In the graph G, if there is a length-2 
walk from a to c, then there is an edge from a to c." 

© If Muy = Land My = 1 then Mu = 1. 


PROOF This is just a translation of the transitive property of binary relations 
into the language of the other structures. = 


+] Example 9 Give the graph and adjacency matrix for relation Rs of Example 
8, and use them to discuss the transitivity of Rs. 


SOLUTION The matrix and graph are given in Figure 7-74. The easiest way 
to check transitivity is to use 





a  — — — (CCt 


74 Connections to Matrices and Relations 559 


il 













11010 2 3 
00000 

u-|onooo TA , 
11000 
20001 


Os 


Figure 7-74 The solution to Example 9. 


11010 
00000 
M?-|jooo0o00 
11000 
00000 


which locates length-2 walks. When we compare this to M, we see that whenever 
there is a length-2 walk, there is an an edge, so the relation is transitive. a 





useful if you have some form of technology aid" for the matrix calculations. To make 
this test easy to state, we need one more piece of notation. 





Definition Suppose M and N are two adjacency matrices of the same size. 
We write M < N to mean that, for every pair of indices; and j, My < Ny. (That 
is, Ny = 1 whenever My = 1.) 





Proposition 4 If M is the adjacency matrix for a binary relation R, then R is 
transitive if and only if M?) < M. 
PROOF This statement combines the statement of transitivity in terms of direct 
graphs in Proposition 3 with the interpretation of M® given in Theorem 5. It also 
follows directly from Exercises 22 and 23 of Section 4.4. " 


Practice Problem 7 Verify that Proposition 4 is correct for each of these relations 
on A = (1.2,3] from Practice Problem 6: 





3). (1.2). 2.3) 
(1,3). (3, 1). (1, 1). 3.39) 
fe) Rs = 40,3). G. DE 








* Appendix B explains how to use a TI-83 calculator for matrix arithmetic 


DE HEN —— E eet 


560 Chapter 7 / Graphs and Trees 





Similar matrix conditions can be found for checking that a relation satisfies 
other properties (reflexive, symmetric, etc.) of interest from Chapter 4. These will be 
explored more in the exercises. 


Application to Connectivity 


This section began with an application of matrices to the problem of counting the 
number of walks in a graph. There is a variation of this idea that can answer a related 
question. Although it might be interesting to know how many walks there are in a 
graph, the existence of walks is often all you really need to know about. An example 
Of this would be a test to sce if a graph is connected. This would certainly be a nice 
thing to have a computer check for us, so we should try to understand this property 
of graphs in terms of operations on the corresponding adjacency matrices. At this 
point, it should come as no surprise that the answer to the yes or no question "Is there 
a walk from node j to node j?" is related to the Boolean product of the adjacency 
matrix. To make this precise, we will need slightly more general notation than we 
have used before. 





Definition For an adjacency matrix M, we will write 
M» M 

‘times 
where the - operation on the right is the Boolean product. 














This notation allows us to restate Theorem 1 in terms of the existence of walks. 


Theorem 5 Let M be the adjacency matrix of a graph G with nodes 
{1,2,3,....). The row i, column j entry of M® is 1 if and only if there exists a 
-step walk in G from node i to node j, 


PROOF See Exercise 14, =. 


2 Example 10 Consider the graph G with its adjacency matrix A andits Boolean 
‘adjacency matrix B shown in Figure 7-75. Compare A? and B®, and give an inter- 





petation of each. 
i; g 1201 1101 
2011 1011 
^"|o 100 0100 
1100 1100 

3 4 


Figure 7-75 The Boolean adjacency matrix for Example 10. 





O ^ E 


74 Connections to Matrices and Relations 561 


SOLUTION Wecompute 





157 3 9 
iin 
17869 PEE E 
3602| 1101 
TEF 
9925 


As we have seen before, the entries in A? count walks of length 3 between var- 
ious pairs of nodes. We can interpret B® as providing "existence" information 
(instead of a count) about walks of length 3. Hence, we can tell that node 3 is 
not connected to itself by a length-3 walk, but every other pair of nodes is, Note 
that this information is also readily available in A’, o 


To check for connectivity in a graph, we do not care as much about the length of 
a walk in a graph as about its existence. For a given pair of nodes į and j, we really 
want to know if there is a walk from i to j of length 1, one of length 2, or one of 
length 3, and so on. Since we are already using Boolean arithmetic with the entries of 
the matrices, the use of the logical “or” in this statement can easily be incorporated 
into those calculations with one more piece of special notation. 











Definition If M and N are adjacency matrices of the same size, the matrix 
M V N is the adjacency matrix whose (i, j) entry is My + Nj, using the "plus" 
operation from Boolean arithmetic. 














Using this operation, it is easy to build a matrix that tells us about the existence 
of walks of length k or less. 


Corollary 6 Let M be the adjacency matrix of a graph G with nodes 
(1,2,3,.. n]. The row i, column j entry of 





My MO y ME yy MO 


is 1 if and only if there exists a walk from node i to node j of length 1,2, 3, ... or k— 
that is, a nontrivial walk of length k or less 





Figure 7-76 The 

graph for Example 11, uw Example 11 Form the Boolean adjacency matrix of the graph G shown in 
Figure 7-76, calculate MÙ, M®, M®, and explain the meaning of entries in MÙ v 
M® v M® in an English sentence. 





SOLUTION 
010010 101101 011281 
100100 010010 101101 

mo- |09000011 101111| qg.|1 11111 
7|o10010[| 101101 7011011 
101101 011011 TESSE 
001010 19 1414 LETITI 





OC Ee 


562 Chapter 7 / Graphs and Trees 


M y MO y MO = 


1 
1 
1 
1 
1 
1 


The final result is a matrix where every entry is 1, This tells us that every vertex 
in the graph with adjacency matrix M can be reached from any vertex 
itself) in one, two, or three steps. 








Practice Problem 8 Inthe graph of Example 11, list all the pairs of vertices that do 
not have walks of length 1 or 2 from one to the other, by considering just M) v MO). 


Following the example and practice problem above and realizing that the longest 
path in a graph on n vertices has a length of at most n — 1 (as proved in Exercise 7 
of Section 7.2), we have the following corollary to Theorem 5. 





Corollary 7 If G is a graph on n vertices with adjacency matrix M, then G is 
connected if and only if the matrix 


MU) y MO) v MOD y... y Mtn 


consists of all I's 





We should note that this is not the most efficient way to check if a graph i 
connected, but it is a very easy test to implement in any computer language in which 
basic matrix operations are available 





Solutions to Practice Problems 


) The adjacency matrix for G is gal, but the move from 8 to 10 is illegal, so we want 


an edge in the former direction only. 





03001 
30000 (9) 1.1.0.0,0,1,0 
m=|00120 (c) There is an edge from 4 to 3 but not from 3 t04, In 
002190 an ordinary graph, since edges are not directed, we 
10000 will always have My = My. 
for which Ma = 0, Ma = 0, Ma 3 The multiplication gives (2)(4) + (0)(0) + (00) + 





(0)(0) + GNI) (045) =8+3=11. The (Q4) is 





jacency matrix for H is 


apo 0 10 
blot 
Mk 01 
di0 0 1.0 

for which Mis = 1, My, = 1, Mu = 0, and Mss does not 





exist since there is no fifth node, 
2 (a) In this game you are not allowed to put stones back 
in the pile. For example, the move from 10 to 8 is le- 


‘Mz + Mis, 50 it corresponds to the fact that in G there. 
are four edges from node 2 to node 1, and also two edges 
from node 1 to node 3. If we apply the counting tech- 
niques of Chapter 5, there are 2. 4 — 8 length-2 walks 
from 2 0 1 103. Likewise, there are 31 = 3 such walks 
from 2 10 5 03, Since you can get from node 2 to nod 
by way of either node 5 or node 1, there are 8 +3 = 11 
total walks of length 2. The matrix product does still 
count the walks. 














74 Connections to Matrices and Relations 563 





000110 
011000 
^ 010000 
000000 
001000 
000000 


ê 5 
Figure 7-77 Solution to Practice Problem 5, 


Mj; =,indicatingthe 7 Letting M, be the adjacency matrix for Rj, we have 


110 101 
m=|0 11], Me[0 0 0 
001 101 


4 Me = 1 indicating the walk 
length-2 walk 6,3. 2. Mẹ 








cating the three walks 

2. Similarly, My, = 0 and 

indicating no walks of length 1 or 2, but 
ing the walks 1.4.2, 1, 1,4, 



















5. Thegraph and adjacer 














given in Figure 777. 001 
m=]0 00 
0001107 7000110 
011000|[011090/ uti 
wss|o 10000) fo10000 
000000 060002020 " rg » 10 
001000||001000 MP-[orgh uP-|noo 
0000200 0000200 001 10 
001000 100 
011000 M?=]0 00 
.jorrooo 001 
000000 
STAEN To test the relations for transitivity, we can sce that 
090000 MP £ M since the (1.3) entry in MP is 1, while the 
(1:3) entry in M, is O, so Ri is not transitive. A similar 
Using the graph of R we can fd Re R=  femark applies to the (1, 1) entry to demonstrate that 
KCE, 3), (2, 2), (2,3), (B, 2), (3, 3), 2). This is indeed MO) y M, showing relation Rs is not transitive, How 








the relation for which M^ is the adjacency matrix. 


that Ma, $0 Ry is à transitive 





ever, it is the 




















^ Only relation R is transitive, as Table 7-7 shows As Partin 
for the other relations, (1,2) € R; and (2,3) € Ry but 
(1.3) g Ri, and (1,3) € Ry and (3,1) e Rs but (L,1)g " 
Ry FERENT 
110110 
(a, b), (b,c) Resulting (a,c) Is (a,c) € R3? 101111 
MM mcm t 
(1.3, 8.1) n Ye ju gd qd 
3). 3.3) (3) Yes 1934 
1. (0.0) GU Yes 
1. (5,3) G3) Yes there are no length 1 or length 2 walks from node 2 to 
1,050) [rm Yes node 3 and there are no length 1 or length 2 walks from 
0,3) (13) Yes node 2 to node 6. 


3.81) @.1) Yes 
) 0.3) Yes 
Table 7-7 Table for Practice Problem 6. 





O TERNS 





564 Chapter 7 / Graphs and Trees 





An 
XE. 


* 





x 


e /8 Graphs for Problem 1. 


Exercises for Section 74 


1. Give the adjacency matrix for each graph in Figure 7-78. 
2, Give the adjacency matrix for these graphs: 
(a) The bridges of Königsberg (page 506) 
(b) The envelope of Figure 7-14 (page $13) 
(©) Ks, the complete (simple) graph with vertex set 
11.2,3,4,5] 

3. Explain how to use the adjacency matrix to calculate. 
the degree of a node for these situations: 

(a) A simple graph (no parallel edges, no loops). 
(b) Loops are allowed but not parallel edges. 
c) Parallel edges are allowed but not loops. 

(d) Both loops and parallel edges are allowed. 

4. For a directed graph, the out-degree of a node is the 
number of edges beginning at the node, and the in- 
degree is the number of edges ending at the node. Ex- 
plain how to use the adjacency matrix of a directed 
graph to calculate the in-degree and out-degree of a 
node for these situations: 

(a) No parallel edges, no loops. 

(b) Loops are allowed but not parallel edges. 
c) Parallel edges are allowed but not loops. 
(d) Both loops and parallel edges are allowed. 

5. Use Theorem 1 to count the number of walks of length 
3 or less from node 1 to node 6 in each graph shown 
in Figure 7-79. Check each answer by directly counting. 
the walks in the graph. 

6, For each graph in Problem 5, give the corresponding 
relation on the set {1,2,3,4,5.6) as a set of ordered 
pairs. 

7. In solving Exercise 5, you calculated M for the adja- 
cency matrices of the graphs. 


TERT] 
OD 
(a) 6 5 





ata 
1 5 
"d 
[ 36 


Figure 7-79 Graphs for Problem 5. 


(a) Use this to determine M, the matrix that would 
result using Boolean operations rather than ordi- 
nary arithmetic operations. 

(b) Use the result of part (a) to determine Ro Ro R, 
where R is the relation corresponding to the graph. 

8. For each given relation on the set A = (1,2,3,4, 5,6], 

draw the graph and give the adjacency matrices for re- 
lations R and 

(1.2).2.3).0,4.(4.5).6.6)) 

© R= (1.3) G.1).(,5). 5.1). .5) (5,3) (1.1), 
0.3.6.5) 

9. Suppose 














eee SSS 


Noe 1 2 3 
Number as ae as 


(0) Using Theorem 5, we see that the Boolean ex- 
pression M'? v M® v M?! determines all pairs of 
nodes between which there is a walk of length 3 
or less. What does MO v M2 v MO! v Mv MO? 
determine? 

(b) If we want to determine the existence of nontrivial 
Walks of length 7 or less, what calculation could we 
do? 

c) If we want to determine the existence of nontrivial 
Walks of length 9 or less, what calculation could we 
do? 

d) Suppose G has 10 nodes. Using Theorem 5, what 
lation could we do to determine the existence 
‘of nontrivial walks of all possible lengths? 

e) Generalize your answer for part (d) toa graph with 
n nodes, 

10. This problem develops some ideas necessary to prove 
Theorem 1. The formal proof can be found in Exercise 
13. 

Suppose a graph has nine nodes labeled 1,2, 3, ....9. 
In Table 7.8, the left-hand table gives the number of one- 
step walks from node 6 to cach of the nodesin the graph, 
and the right-hand table gives the number of four-step 
walks from various nodes to node 3. 

(9) Notice that there are two one-step walks from node 
6 to node 1, and five four-step walks from node 1 to. 
node 3. How many five-step walks are there from 
node 6 to node 3 that begin by going to node 1? 

















74 Connections to Matrices and Relations 565 


Node — Number 


Table 7-8 Tables for Exercise 10 


1 
3 
4 

as a w a m S bs 
6 
7 
8 
9 


Table 7-9. Table for Exercise 11 


(b) How many five-step walks are there from node 6 to 
node 3 that begin by going to node 2? 

46) How many five-step walks are there from node 6 10 
node 3 that begin by going to node 8? 

4) How many five-step walks are there from node 6 to 
node 3 that begin by going to node 9? 

(€) How many five-step walks are there from node 6 to 
node 3? 

11. This problem continues the development of ideas for 

Theorem 1. 

(4) Repeat what you did in the previous problem, but. 
Using Table 7-9 where symbols have replaced the 
numbers 

(b) The notation in part (a) uses ay to indicate the num- 
ber of one-step walks from node 6 to node 1, and 











s for the number of four-step walks from node rto 
te 


node 3. Using the similar notation of ay to ind 
the number of one-step walks from node 8 to node 
1, and by for the number of fourstep walks from 
node rto node 2, write a formula for the number of 
five-step walks from node 8 to node 2. 

(©) Generalize to obtain a formula for the number of 
five-step walks from node í to node j. 

d) Fillinthe blanks If M is the adjacency matrix forthe 
graph, the first table in part (a) gives row 
of M, and the second table gives column. 
ofA 











CETT Ux UU CES 


566 Chapter 7 / Graphs and Trees 


12. This problem continues the development of ideas for 
Theorem 1. 
(a) We will use induction on k, where k is the power 
to which we are raising the matrix M. Let P(k) be 
the statement “For every pair of integers / and j, 
the row i, column j entry of M* counts the number 
of step walks from node i to node j.” Write P(1) 
carefully. 
(b) Explain why P(1) is true. 
is the same as what?) 
(©) Write Pk — 1) carefully. 
13. Prove Theorem 1 by induction on k > 1. In he induction 
step you should connect the fact that M* = M - Mit 
with a recursive counting model for the number of walks 
of length k between a fixed pair of nodes. The previous 
two problems contain related ideas. 
14, Explain how Theorem 5 follows from Theorem 1. 
15, For each of the following sets A and relations R on A, 
give the graph and the adjacency matrix for the relation: 
(2 A = (1,2,3,4), R (C1. 1), 2,2). ,3). (4.4) 


(Hixr: A one-step walk 











(b) A = (1,2,3,4), R= ((1,1), 2.3). (3.2). (4.1). 
(4.4) 

(©) A = (1,2,3, 41, R= (12). (2.3). (1,3). 1). 
8.2). 0] 

(® A = (1,2,3,4), R = ((0,1).(2,2). 3,3). (4.4). 
(1.3) B. 1). (1.4), (2.49) 


(6) A- 
(0.1. 
(D A = (1,2,3,4, S), R = IC. 3), (2. 4). (3.5), (4.1). 
(5:4). 63. 1). 2,2). 0,5)] 
4p A = (1.2.3.4, 5), R = (C, 2), (2,3). (3,4). (4.5). 
(5,4).(4.3)..2). 2. D) 

16, Using the notation of Proposition 4, we can test a re- 
lation with adjacency matrix M for being reflexive by 
checking if / < M, where / is the matrix” of the same 
size as M with 1's on the main diagonal and 0's else- 
where. For each relation in Exercise 15, use this test to 
determine ifthe relation is reflexive. 

17. Prove that for any relation R with adjacency matrix M, 
the matrix I v M is the adjacency matrix of a reflexive 
relation. This new relation is called the reflexive closure 
of R. It is the smallest reflexive relation extending R. 

18. Find the adjacency matrix, graph, and set of ordered 

F pairs descriptions of the reflexive closure of each rela- 
tion in Exercise 15. 

19. We can testa relation with adjacency matrix M for be- 
ing symmetric by checking if M = M, where MT is the 


(1,2). 2. 0), (1,3). 1). 














a. 


matrix! formed by transposing the rows and columns 
of M. For each relation in Exercise 15, use this test to 
determine if the relation is symmetric. 

Prove that for any relation R with adjacency matrix M, 
the matrix M v M is the adjacency matrix of a sym- 
metric relation. This new relation is called the symmet- 
rie closure of R. It is the smallest symmetric relation 
extending R. 

Find the adjacency matrix, graph, and set of ordered 
pairs descriptions of the symmetric closure of each re- 
lation in Exercise 15. 

Use the test given in Proposition 4 to determine if each 
relation in Exercise 15 is transitive. For those that 

not, fill in the blanks in this sentence: “R. 
LR. but SR on 

The smallest transitive relation that extends a given re- 
lation R is called the transitive closure of R. If M is the 
adjacency matrix for a relation R, then the adjacency 
matrix for the transitive closure of R is 








MU v MO) y MO y... v MU 
where K is the length of the longest path in the graph 
‘of R. We will see how this procedure works by finding 
the transitive closure of the following relation on the 
set A = (1,2,3,4,5,6] with the arrow diagram shown 
in Figure 7-80, 


Re(y)eAxAu1-y 


(a) Use the graph to make a complete list of pairs of 
nodes (a. b) for which there is a walk from node a 
tob. 

b) Use the result of part (a) to write the transitive clo- 
sure of the corresponding relation. 

(©) Write the adjacency matrix M for the graph. 


dedii i 


figure 7-80 Diagram for Problem 23. 


‘This matrix is called the identity matrix. See Appendix B for more on matrices. 


| This matrix is called the transpose of M. See Appendix B for more on matrices. 








75 Graphs in Puzzles and Games 567 
24. Let the relation R on the set C = (1.2,3.4,5,6.7.8, 

$. 9,10} be given by 
R= ((a. b) : There isa direct flight from city a to city b} 


where the flights are described by the directed graph in 
A 1 Figure 7-81. Find the matrix for the transitive closure of 
R. Write an English sentence explaining the meaning of 


? p the transitive closure in terms of flights and cities. 
Z 25. Find the adjacency matrix, graph, and set of ordered 
cunts. F pairs descriptions of the transitive closure of cach rela- 
Figure 7-81 Diagram for Problem 24, tion in Exercise 15. 


26, For a transitive relation, we know that M® < M—if 
there is a 1 in row a, column b of M^), then there must 
bea Lin rowa, column bof M. A friend has conjectured 
that, in fact, M^ = M for transitive relations. Either 
prove this statement is true, or find a counterexample, 
(Hot: What does this conjecture say in terms of walks 
ina graph?) 





(d) Calculate M?) v M® v MO v MO v M^ to de- 
termine the transitive closure of the relation. Com- 
pare your result to what you found in part (b). 


7,5 Graphs in Puzzles and Games 





Throughout this chapter, we have seen important ideas in graph theory motivated 
by problems about puzzles. We will turn the tables in this section, taking some of the 
basic graph structures we have developed and applying them to puzzles and games 
we have not seen before. 





Wolves, Goats, and Cabbages 


The following version of a truly ancient puzzle is generally attributed to Alcuin of 
York (735-804), a friend of Charlemagne. 


TE Example 1 A rer has three possessions, a wolf, a goat, and a cabbage, 
which he must transport across a river. The catch is that, if left alone, the wolf will 
eat the goat or the goat will eat the cabbage, and naturally the boat can hold only the 
traveler with one possession at a time. 

‘The first thing that comes to mind when considering this puzzle is probably "That 
must be a pretty big cabbage." Hopefully, the second thing that comes to mind is how 
this puzzle might be modeled using a graph. 


SOLUTION We will represent a "legal" state of this puzzle—that is, one 
in which nothing is being eaten—as a pair in which the first entry tells who 
‘or what is on the departing shore and the second entry tells who or what is 
on the arriving shore. We use Ø to stand for “no one.” For example, the pair 
(WC, TG) means that the wolf and the cabbage are on the departing shore and 
the traveler and the goat are on the arriving shore. The pair (WC TG, Ø) means 
that everything is on the departing shore. 

We can make a complete list of the legal states, since it turns out there are 
only 10 of them. We organize them by listing first those states with three items 











568 Chapter 7 / Graphs and Trees 


(w CTG) (WrG, O (a, wer) 
(WCTG, 0) (GT, wo) 


(we, TG) (8, WGTO) 
(Wer, a) (C, War) (car, w) 


Figure 7-82. A wolf, a goat, and a (really big) cabbage. 


on the departing shore, then those states with two items on the departing shore, 
and so on. 


(WCTG,U) (WCT,G) (WTG,C) (CGTW) (WGTG) 
(GTWC) (WCTG) (GWGT) (GWTC) (M WGTC) 


We then draw a graph in which the legal states form the nodes, and we 
draw an edge between two states if it is possible to get from one to the other in 
ig. Thus, we have the graph in Figure 7-82, 

Notice that a solution to the puzzle is simply a path from (WCTG, f) to 
(0, WGTC). a 





From this point of view, we can see that there are two solutions to the puzzle, 
and they take the same number of crossings to get across the river. The graph model 
not only helps us find a solution to the puzzle, but also it allows us to look at all 
solutions—a valuable attribute of graph theoretic solutions, 








Practice Problem 1 Suppose the traveler is actually traveling by ferry boat and is 
charged an extra $5 fee each time he crosses the river with a live animal. How should 
he get all his items across the river for as little money as possible? 





Instant Insanity 


‘The previous problem illustrates a fundamental idea in the modeling of puzzles and 
games using graphs—each node represents a "state" of the puzzle and edges connect 
that can be reached one from the other in one “move.” When this happens, 
the crucial structure in the graph model is a path from the beginning state to end 
state, This type of model will come up again in the context of two. 
end of this sect 
graph, 

The Instant Insanity™* puzzle consists of four cubes with faces colored green, 
red, blue, or white, The goal is to stack the cubes into a tower so that each of the four 
colors is represented on each of the four sides of the tower. Figure 7-83 shows how 
the four cubes are colored. On cube 1, for example, there is a white face in front, a 
white face to the right, and a red face on top, while there is a blue face to the left, a 
green face on back, and a green face on the bottom, 















* INSTANT INSANITY is a trademark of Winning Moves, Inc.© 2004 Winning Move, Inc. However, 
variations of the game have been around for over a century. It appears to have been patented in the 
UK. in various forms in the early twentieth century, and it was marketed in the US. in the 1960s in the 
form we consider here, by Parker Brothers It has recently been re-released by Winning Moves, Inc. 


Sea 


75 Graphs in Puzzles and Games 569 






































a a 
1 4 
av, G 
w 
à L d | 
3 ‘a 
Cube 2 Cubes 





Figure 7-83. Instant Insanity cubes. 





Figure 7-84 The graph of the puzzle. 





‘The puzzle can be modeled with a graph in the following way. Label four nodes 
ofa graph Red, Blue, Green, and White. Now connect two nodes with an edge labeled 
i if cube i has a pair of opposite faces with the colors of the nodes, For example, 
cube 4 has a white face opposite a red face, there is an edge from White to Red 
labeled with a 4, This process forms the graph with four nodes and twelve labeled 
edges shown in Figure 7-84, 

Now that we have a simple representation of the puzzle, we need to form a 
strategy for solving it. The strategy requires a little bit of ingenuity, but it is easy to 
understand once you try it. We will first solve the easier problem of getting all four 
colors on just the front and back columns while ignoring the other pair of columns. 
We will call this a partial solution to the puzzle. In order to get the other columns 
into position without disturbing our "solved" column, each cube can be held by its 
front and back faces and rotated to change the exposed faces on the left and right 
columns until the left and right si are solved as well, 

‘To formalize this strategy, we need to know how to use the graph to find a partial 
solution to the puzzle, 
































Definition Inthe graph G ofan Instant Insanity™ puzzle, good subgraph 
of G is one using all four nodes and one edge, with each label such that each 
node has degree 2, 














For any good subgraph H, we can create a partial solution to the puzzle by 
stacking the cubes in numerical order so cube i is placed in such a way that the 
forward/backward-facing colors match the endpoints of the edge labeled in H. 
Since every node in H has degree 2, every color will be represented on exactly two 








570 Chapter 7 / Graphs and Trees 














TJ re 
" " 
Cube 1 | W n 
Red = 
cwez | B |} „a 
2 
owes | R | | ow 
Blue Gem — Cubo4| G 











Figure 7-85 A good subgraph yields a partial 
solution, 


cubes, so we can easily arrange them in order that each color appears on the front 
column once and on the rear column once. Figure 7-85 shows a good subgraph and 
its accompanying partial solution of the puzzle. 

At this point, we might start rotating each cube (fixing its front and back faces) to 
try all possibilities. This would not take an entirely unreasonable amount of time, but 
we can do much better by simply using the graph again. If we can find a second good 
subgraph that does not use any of the same edges that our first one did, then we can 
rotate each cube as we mentioned before until the partial solution corresponding to 
the second subgraph occurs in the left and right faces of the stack. This will completely 
solve the puzzle, 

If we cannot find a second good subgraph in this way, however, we can only 
conclude that our first good subgraph was a poor choice, not necessarily that there 
is no solution. The first subgraph we already found for the puzzle at hand illustrates 
this shortcoming very nicely. 











ll Examnpie.2 in onder find a second good subgraph that does not use any of 
the edges from the first, we can simply delete the edges used by the first and look for 
à good subgraph in the graph, shown in Figure 7-86, that remains. Explain why this 
new graph has no good subgraph, 


SOLUTION Toget a good subgraph, we must choose exactly one of the edges 
labeled 1, But once we do that, itis impossible for the Green node to have an 
even degree. (The degree will be 1 if we use the edge labeled 2 from Blue to 
White, and the degree will be 3 if we use the loop labeled 2 at Green.) We 


conclude that there is no good subgraph. o 
3 
ed White 
1 
D Green 
Ó? 


Figure 7-86 This solution cannot be 
completed. 

















a 
[mos 
cubes >| w [AY w 
aa] g 
Cube2^ "| e |B a 
Cuwa > GW g 
WI e^ 
Cube 4 ">| R [O 








Figure 7-89 The 
solved puzzle. 





75 Graphs in Puzzles and Games 571 











6 
3 r 
cubes | w w 
Red White pe 
cwez | B|} a 
| Ah 

cues | a |) a 

Bwe freen Cubea | R 











Figure 7-87 Another good subgraph and partial 
solution, 


3 
ned Re White. 


White ted 
4 2 a 
1 Green 1| Bue Green 
2 
Figure 7-88 Finding a second good subgraph. 


Unfortunately, not just any choice of a first good subgraph will necessarily lead 
to a complete solution. However, if we look at the original graph again, we can 
find another candidate for the "first good subgraph,” given in Figure 7-87 with its 
corresponding partial solution, As we will see, this one works out better for us in the 
end, 





Now applying our strategy again, we see that the leftmost graph in Figure 7-88 
shows the original graph with the edges from the new good subgraph removed, and 
the rightmost graph in the same figure shows the second good subgraph we were 
hoping to find to complete the solution, 

To solve the puzzle now, stack the cubes in the partial solution shown in Figure 
7-87, and rotate cube 1 (keeping the front and back faces fixed) until the red and 
green faces are to the left and right, respectively. Repeat this for each of the other 
cubes using the good subgraph in Figure 7-88 to determine which faces to put to the 
left and right. This gives the full solution to the puzzle shown in Figure 7-89. 

In hindsight, there was no need to even distinguish the two subgraphs. A com- 
plete solution corresponds simply to two partial solutions that can be fit together. In 
the language of graphs, we can state the following. 





Theorem 1 An Instant Insanity puzzle has a solution if and only if the graph of 
the puzzle contains two good subgraphs that do not share any edges 


Since the graph of such a puzzle has only 4 nodes and 12 edges in the first place, 
this condition has a huge advantage over “trial and error” for solving the puzzle. It is. 
also much easier to give an explanation in terms of graphs when an Instant Insanity 
puzzle has no solution. This additional point is investigated in the exercises at the 
end of this section, 





a 


572. Chapter 7 / Graphs and Trees 


Graphs in Games 


We now turn our attention to analyzing strategies for some two-player games, The 
basic premise is that each player removes stones from the playing board until none 
are left. Since each move must remove something, the game will eventually end. We 
will consider different rules for how the stones are to be removed and analyze the 
resulting play based on the starting configuration, 

These games are just special cases of a class of games called finitely progressive 
games that are characterized by the finiteness of moves at any time and a guarantee 
that the game must end in a win for one of the players. It is a fact (that we will not 
prove) that all such games have a winning strategy for one of the players. That is, for 
one of the players there will be a system with which that player will always win, no 
matter how the other player plays. Therefore, the best thing to do is to be the player 
with the winning strategy and play for lots and lots of money, 





a Example 3 Two people play a game in which there is a single pile of 10 stones 
and each player may remove one or two stones on his or her move. Determine a winning 
strategy for one of the players. 





SOLUTION We begin by creating a directed graph to represent the play of 
this game, In the graph of Figure 7-90, node i represents the game board with 
i stones on it, and an edge from i to j (with i > j) means that with one move 
the game board can change from i stones to / stones, For this game, we have 
an edge from i to j ifi — j = 1 or 2. 

We can picture the game as being played on the graph itself with two 
players taking turns moving to adjacent nodes in order to build a walk from 
node 10 to node 0. We will call a node “good” if moving to that node results in 
an inevitable win for the player, and "bad sults in a loss, 

Itis useful in this type of game to do an “end-game” analysis, That is, we 
begin by focusing on what will happen in the last few moves. To win the game, 
Sue must move to node 0. For that to happen, her opponent must move to either 
node 2 or node 1. Hence, the winning node 0 is a “good” node, and nodes 1 and 
2are “bad” nodes, 

Now, how can Sue force her opponent to move to either node 2 or node 
1? She must move to node 3, since from node 3, the only moves lead to the bad 
nodes 1 and 2. Hence, node 3 is also a good node. And since Sue can guarantee 
that she can reach node 3 by forcing her opponent to first move to either node 
4 or node 5, we conclude that nodes 4 and 5 are bad nodes. 

Repeating this argument, we can determine that nodes 6 and 9 are good 
nodes, whereas nodes 7, 8, and 10 are bad. Since the game starts at the bad 
node 10, Sue wants to be the player to move first, and she begins the game by 
removing one stone, thus moving to the good node 9. From there, no matter 








YEUDE Suet 
ee NIA RIS T GR 
E ic emen] 


Figure 7-90. The graph of the simple stone removal 
game. 











75 Graphs in Puzzles and Games 573 


what her opponent does he will be at a bad node, and Sue can move to the good 
node 6. On her next move she can reach the good node 3 and then finally the 
winning node 0, 

Note that if the game begins with only nine stones, then it would be the 
player to go second with the winning strategy for precisely the same reason! C 


For any two-player game, we can repeat this process on its under! 
structure. 





ing graph 





Definition Given any game with a finite number of states, the graph of 
the game is a directed graph with nodes representing each possible state of the 
game board and with an edge from node to node j ifitis possible for the board 
to change from state i to state j as the result of one move by either player. In 
the graph of a finitely progressive game, there are no cycles and every ending 
state (node) represents a win for one of the players. 














In terms of this graph, we may always think of the play of a game as simply 
constructing a walk from the starting configuration to the winning configuration 
where the players alternate choosing the “next” node to visit. In the previous example, 
note that the key to either player's winning is to move to one of the “good” nodes in 
the set K = (9, 6, 3, 0] as shown in Figure 7-91, Once a player moves to one of these 
nodes, he or she can apply the outlined strategy to eventually win the game. Finding 
a set of nodes like this will be the key to winning any game. We will call such a set of 
nodes the kernel of the game. 








Definition In the directed graph of any game, a set K of nodes with the 
following properties is called the kernel of the graph: 


(i) The winning node (in this case node 0) 
(ii) From any node not in K, there is an edge to a node in K. 
(ili) There are no edges from nodes in K to other nodes in K. 





We will often refer to the kernel as the set of good nodes 





in the graph, 











‘The three properties of a kernel actually dictate an algorithm for finding the 
kernel from the graph. The following process will result in all the good nodes circled 
and all the bad nodes crossed out. 


pu. bu d: 
" NN $S ^ 
Pd b w 7 
ZR LN ey 
iiamtn~ d re, Eo A 
Figure 7-91 Good positions in the simple stone 
removal pane 





574 Chapter 7 | Graphs and Trees 


St i 


Teal omen. 


Figure 7-92. Kernel of 
the game in Example 4, 





1. Start with the winning node circled. 


2. Repeat the following two steps as long as there are nodes that have not been 
crossed out or circled: 


(a) Cross out every node that points to a circled node. 
(b) Circle every node that only points to crossed-out nodes. 











"Bl ample 4 rure 52 shows he graph of a game in which the winner is the 
first to reach node d. Find the kernel of this game using the steps above. Which. ‘player 
‘has a winning strategy? 


SOLUTION We obtain good and bad states in this order: (1) d is good; (2) c, 
8, and j are bad since they have edges to d; (3) f and i are good since their only 
edges lead to bad states; (4) a, b, and e are bad since they have edges to f or i; 
(5) A is good since its only edges lead to bad states. The kernel is (d, f, h,i), 0 


Practice Problem 2 Suppose two players play a game with a pile of 20 stones, 
and use the rule that on each move, either one, two, or three stones may be removed, 
Find the kernel of the graph for this game. Which player has a winning strategy? 


The following theorem can be proved using an induction argument that is similar 
to the algorithm above. It is proven in precisely this setting in Alan Tucker's book on 
applied combinatorics (see [47]) for those who are interested in reading more about. 
it, 


Theorem 2 The graph of any finitely progressive game has a unique kernel. 


Combining this fact with the strategy we outlined above gives us the following 
solution to who has the winning strategy in any finitely progressive game, 


Corollary 3 Given a game with kernel K, if the starting configuration of the game 
is in K, then Player 2 has a winning strategy. If the starting configuration is not in K, 
then Player 1 has a winning strategy. 


More Stone Removal Games 


We will continue with the theme of stone removal games because they are among 
the easiest games to learn how to play. In each of these games, there are two players, 
and the person to remove the last stone wins the game. 


| re begin with two piles of stones with four stones in the. 
first pile and three in the second. On each turn, a player chooses a pile and removes 
‘any number of stones from it. Find the kernel of this game. 


SOLUTION Even in this small example, the graph of this game is too com- 
plicated to draw easily. In Figure 7-93, we label a node (a, b) for the state of 
the game when there are a stones in the first pile and b stones in the second 








75 Graphs in Puzzles and Games 575 


63 G3 eI (03 (03 
«2 (G2 2 (1,2) (42 
4) G9 Qn On (0n 


(0 (G0 (0 (1,0) (00 


Figure 7-93 Kernel for 
Example 5. 


pile, but we do not draw any of the edges. Instead, we will just have to mentally 
visualize that the edges lead from each node to all the other nodes to the right 

‘on the same row, and to all the nodes below it in the same column. Since 
the winning node is labeled (0, 0), we can use the algorithm we developed to 
separate the good nodes from the bad ones in the following order: 





1. The winning node (0, 0) is circled. 

All nodes with edges leading to (0, 0) are crossed out. Specifically this means 

(0.3). (0.2). (0, 1), (4,0), (3.0). (2,0), and (1, 0) are bad positions. 

3. There is only one node that has all its edges leading only to crossed-out 
nodes. We circle the node (1, 1), which represents two piles of one stone 
each, 

4. All nodes with edges leading to (1, 1) are crossed out. Specifically, positions 

(1,3), (1,2). (4. 1), (3. 1), and (2, 1) are all bad. 

i. Continuing this process, we will end up with positions ( 

circled and all other positions crossed out. 











2) and (3, 3) also 


Therefore, the kernel for this small game is K = ((0,0), (1, 1), (2,2), 
G.3) o 


This game might seem too small to be of interest, but it is not hard to generalize 
the kernel to be correct for a game with the same rules but more stones to start with. 
In making this generalization, we will also take the opportunity to give a proof that 
a set of positions satisfies the three defining properties of a kernel, 





Proposition 4 In the stone removal game of Example 5, if the game starts with 
piles of size m > n > 0, then the kernel is 


K = ((a.b) € (1.2... m) x (.2,....n) ta =b) 
That is, a good state is one in which the two piles are of equal size. 


PROOF We will check that K satisfiies properties (i). (ii). and (iii in the defini- 
tion of “kernel.” We will do this by thinking of the game itself. 





(i) The winning position is when there are no stones in either pile. This node is 
labeled (0, 0) in the graph, and this certainly satisfies the definition of K. 

Let a position in the game be given that is not in K. By the definition of 
K, this position must consist of two piles of unequal sizes, say, x and y with 





i o 


576 Chapter 7 / Graphs and Trees 


x > y = 0. A single move of removing x — 
result in game position (y, y), which is in K. 

(iii) Suppose we have a game position that is in K which is not the winning con- 
figuration, That is, the game position is labeled (k, k) for some k > 0. Any 
move from this position has to change the number of stones in one and only 
one of the piles, resulting in a game position not in K. 


stones from the 





ger pile will 











‘Therefore, K is the kernel of this game, B 


This means that the player who can leave two equal piles of stones at the end 
of his or her turn will win the game. Therefore, if the game starts with two piles of 
equal size, then Player 2 can apply the winning strategy, and if the game starts with 
two piles of unequal size, then Player 1 can apply the winning strategy, 











Practice Problem 3 Suppose a game starts with two piles of stones, one with four 
stones and the other with three stones, and each player can remove one or two stones 
from a single pile on each turn. Use the algorithm on the picture in Figure 7-94 to find 
the kernel of this game. 


The Game of Nim 


Example 5 is the simplest version of a classic game called Nim, which made its 
mathematical debut in a 1902 article (see [7]) by Charles Bouton, In Nim, there are 
many piles of stones of various sizes, and players alternate removing stones from the 
board. The only rule for removal is that on a single move, a player cannot remove 
stones from more than one pile. There is no limit on the number—indeed, a whole 
pile could be removed in one move. As before, the winner is the player to remove 
the last stone, 

‘The problem with Nim is that the graph of the game can be quite large. For 
example, if the game starts with piles of size 5, 6, and 8, there are 6-79 = 378 game 
board states possible, so the graph would have 378 nodes, Although we will not draw 
these large graphs, we will think about them as we try to uncover the strategy that 
we know must exist from the previous discussion, 

For the general game of Nim, we will not attempt to “discover” the kernel as we 
did in the previous examples. We will just describe the kernel first, and then check to 
make sure it satisfies the three defining properties of a kernel. We will also see that it 




















43) (3 (3 (1,9) 0,3) 
(42) (&2 22) (1,2) (0.2) 
4) G9) ey 09) 0n 


(0 (0 (2,0) (1,0) (0,0) 


Figure 7-94 Kernel for Practice 
Problem 3. 





$9 0101 
6= 0110 
8= 1000 





Directsum 1 0 | 1 
Table 7-10. Direct 
Sum of 5,6, and 8 


75 Graphs in Puzzles and Games 577 


is a natural generalization of the kernel of the two-pile Nim game in Example 5. In 
order to describe the kernel of the general game, we will first need some facts about 
binary notation from Section 2,6. 

Recall that every nonnegative integer has a unique representation in base tw 
For example, 23 = 24 + 2? +2! + 2°, so the decimal numeral 23 can be represented 
as 10111 in base two. Note that if we have two base two numerals, say, 10111 and 
101, we can consider them both to be the same length by rewriting the second one 
as 00101, In general, if we have any number of base two numbers, we can consider 
them all having the same number of digits by adding leading 0's to any that are too 
short, 

Now given a set of base two numbers all of the same length, we can construct 
a table using one base two number for each row. For example, if the numbers are 
(5,6, 8), they are represented in base two by (0101, 0110, 1000] and put into the 
following tabular form: 











[ 101 
620110 
821000 


We complete the table by adding one more row consisting of a four-digit base 
two number constructed in the following way: The first digit is a 1 if there are an odd 
number of 1's above it (ie., among the first digits in the other three rows) and a 0 
otherwise; the second, third, and fourth digits are constructed using this same rule 
applied to the number of 1's above each of them, respectively. In the above example, 
we would add the four digits 1 0 1 1, We will call the last row constructed in this way 
the direct sum of the previous rows. In fact, we will call this row the direct sum of 
the original numbers then In our example, we have the binary number 1011 
as the direct sum of 5, 6, and 8, as shown in Table 7-10. 











Practice Problem 4 What is the direct sum of the numbers 3, 12, and 13? 





What does all this have to do with the game? We can use the idea of direct sum 
to describe what the kernel of the general Nim game looks like. 





Definition Given a game position with k piles with s1, 52, ..., s, stones, 
respectively, find the direct sum of si, s». ... s. Call this the binary number of 
the game position. 














Theorem 5 For the game of Nim with k piles of stones, the set 


K = (game positions that have a binary number of 000.0) 





is the kernel of the game. 
PROOF We will check that K satisfies the defining properties (i), (ii), and (iii 
of a kernel. 





578 Chapter 7 / Graphs and Trees 


Explore more on 
the Web. 


i) The winning position in the game consists of all piles of size 0, and the direct 
sum of any number of 0's is 000---0, 

(ii) Leta game position norin K be given. That isthe binary number of the game 
position contains at least one 1. We must find a move in the game that results 
in a position which is in K. To do this, locate the first 1 (from left to right) in 
the binary number of the configuration—let's say it is in column j, Now there 
must be a row, let's say it’s row j, in which a 1 also occurs in column i. Remove 
the appropriate number of stones from pile i so that every binary digit in row 
i that is above a 1 in the bottom row is changed. (So a 1 is changed to a O and 
a Ois changed to a 1.) Now the bottom row will be 000. - - 0, meaning this new 
game position is in K. 

(iii) Let a game position in K other than the winning position be given. By the 
definition of K, the binary number of this position is 000. . 0, so any change 
ina single row will have to introduce at least one 1 in the bottom row (in the 
same positions as the digit changes in the changed row), leading to a game 
position not in K. 








1t is rather difficult to put the winning strategy into words, so let us look at a 
particular play of the game instead to see the strategy in action. 


u Example 6 Analyze the game that starts with three piles with five, four, and 
two stones, respectively. 


SOLUTION The corresponding table of binary numbers with direct su 
shown in Table 7-11. So the initial position of the game is not in the kernel. This. 
means that Player 1 will have a winning strategy. Player 1 looks for the first 1 
in the bottom row and finds it in the second column. He looks above to find a 
1 in the second column of one of the rows and finds it in row 3. By removing 
one stone from the third pile, the 010 in the third row becomes a 001 (e., the 
second and third digits are changed), and so the direct sum will now look like. 
it does in Table 7-12. This configuration is in the kernel, so Player 1 is well on 
his way to winning the game. o 





Practice Problem 5 — Whar is the correct first move for Player 1 in a Nim game 
that starts with piles of size 3, 12, and 13? 


Ast pile (5) 
2nd pile (4) 
3rd pile (2) 






Table 7-11 Direct Sum in 
Example 6 











Ist pile (5) 
2nd pile (4) 
3rd pile (1) 


Direct sum 


75 Graphs in Puzzles and Games 579. 







Table 7-12 Direct Sum After 


Solutions to Practice Problems 


1 Referring to the graph in Figure 7-83, 
© The solution corresponding to the walk. 
(WCTG. 9) — (WC, TG) — (WCT, G) + (W, CTG) 
= (WIG, C) > (G WCT) — (TG, WO) 
- (0. WCTG) 


requires an additional $25 animal-crossing fee. 
© the solution corresponding to the walk 


(WCTG. 0) — (WC, TG) -» (WCT; G) + (G WIG) 
= (CTG, W) + (G WCT) + (TG, WC) 
= (P. WCTG) 


requires an additional $20 animal-crossing fee. 
Hence, the second solution is cheaper. 

2 The kernel states are determined by this analysis. The 
Winning state 0 isin the kernel. States 1, 2, and 3 have 
‘edges leading to , so they are bad. From state 4 the only 
edges lead to bad states, so it is good and goes into the 
Kernel. If we continue in this fashion, the kernel is 


10,4,8,12,16,20] 
Since the game begins ina kernel state (with 20 stones), 
Player 2 will be able to successfully move to each kernel 
state and eventually win the game. 

3 Referring to the picture in the problem: 





Exercises for Section 7.5 


1. Two friends have 2 gallons (8 quarts) of water in a pail. 
F They also have two (empty) jars, one holding $ quarts 
and the other 3. Using just these measuring devices how 
can they split the water so that 4 quarts are in the larger 
jar and 4 quarts remain in the pail? Produce a complete 
‘graph model for this puzzle similar to the one in the 
Wolf, goat, and cabbage puzzle, and find all solutions to 
the water puzzle in terms of the properties of the graph. 

2. Suppose the pailin the previous problem holds 12 quarts. 
of water, the two jars hold 7 quarts and 5 quarts, and 
the goal is still to split the water evenly with 6 quarts 
both the pail and the larger jar. Produce a complete 








the Move. 


12) The winning node (0,0) is circled. 

(b) Nodes(1,0),(2,0),(0, 1), (0, 2) are crossed out since 
they all would have edges pointing to (0, 0). 

46) Nodes (3, 0), (0, 3), and (1, 1) are circled since they 
only lead to crossed-out positions. 

(d) Nodes (4,0), (3. 1). (3.2). (2,3). (2. 1), (1.3), (1.2) 
are crossed out since each leads to some circled 
node. 

(©) Nodes (4,1), (3, 3), and (2,2) are circled since each 
leads only to erossed-out nodes. 

D. Nodes (4, 2) and (4,3) are crossed out. 

Thus, K = (0,0), (3,0), (0.3). (1.1). (4.1). (2.2), 

(.3)) is the kernel 

4 Inbinary,3 = 11,12 = 1100, and 13 = 1101, sothe direct 

sum computation is shown in Table 7-13, 

5 Since the direct sum is 0010, the correct first move for 

Player I is toremovetwostones from the fist pile, taking 

it from three stones to one stone, 




















3 i-r | 
n 1-75 
B T a x 
Directsum 0 0 1 0 


Table 7-13. Direct Sum for 
Practice Problem 4 


graph model for this puzzle, and find all solutions to the 
‘water puzzle in terms of the properties of the graph. 

À Suppose the palin the previous problem holds 10 quarts 
of water, the two jars hold 6 quarts and 4 quarts, and 
the goal is still to split the water evenly with 5 quarts 
in both the pail and the larger jar. Produce a complete 
graph model for this puzzle, and explain in terms of the 
properties of the graph why it has no solution, 

4. Puzzles like the wolf, goat, and cabbage puzzle exist 
in many cultures (see [2]). A slightly different version. 
comes from the Kabjlic region of Algeria. This puzzle 
involves a man traveling with a jackal, a goat, and a 


580 Chapter 7 / Graphs and Trees 


2 
rede 1 


al Green 


Os 


Figure 7-95. Puzzle for Problem 5. 


bundle of hay, but ín this version, the man can take upto 

two of these things with him on each crossing. Describe 

the complete graph model so that you can show all the 
solutions to this puzzle in terms of your graph, Is there 

a solution that seems to be better than others? Explain 

why. 

Draw the (unfolded) cubes for an Instant Insanity" 

puzzle with the graph in Figure 7-95, 

6. Give an example of an Instant Insanity™ puzzle that 
does not even have one partial solution, You can draw 
the cubes or just the graph model. 

7. Solve each of the Instant Insanity?" puzzles shown in 
Figure 7-96 or explain in terms of the graph model why. 
no solution is possible. 

8. Suppose two players playa game with two piles of stones 
(initially with seven stones in each pile) and the rule 
that on a given turn itis permissible to (a) choose a 
and remove any number of stones from it, or (b) remove 
exactly one stone from each pile, As usual, the person 
to take the last stone wins. Find the kernel of this game, 




















E w w w 
njw] [s[s] [wie] [ala 
sje) [ele] [ir] [s]s 
ame a w sj 




































































w 8 W] w 
olw gE gE G|R 
afa Bfe G|R Ale 
t R S c B 
Figure 7-96 Two 
puzzles for 
Problem 8. 





and decide which player has a winning strategy in this 

game. 

9. Supposetwo players play a game with twopilesof stones 
(initially with five stones in one pile and seven stones 
in the other) and the rule that on a given turn it is per- 
missible to (a) choose a pile and remove any number 
of stones from it, or (b) remove an equal number of 
stones from each pile. As usual, the person to take the 
last stone wins. Find the kernel of this game, and decide 
which player has a winning strategy in this game, 

10. Suppose twoplayers playa game with two piles of stones 
(initially with seven stonesin each pile) and the rule that 
on a given turn it is permissible to remove up to three 
stones total on each turn, but at least one stone must 
be removed from each pile if possible. Find the kernel, 
and decide which player has a winning strategy in this 
game, 

11. In [27], Martin Gardner presents a game played on a 
chessboard with a queen. The queen is placed on the 
chessboard anywhere in the topmost row or leftmost 
column, Two players alternate moving the queen legally 
(ie. horizontally, vertically, or diagonally) but never to 
the left or up, until someone puts the queen on the bot- 
Tom right square, The player who does thisis the winner. 
(a) Which player has a winning strategy? (Be specific. 

1t might depend on the initial placement of the 
queen.) Find a "stone removal” game that is equiv- 
alent to this game. 

(b) What if the same game is played but with a piece 
that has the combined moving abilities of a queen 
and a knight, again restricted so that leftward or 
upward play is illegal? 

12. This game is played on a chessboard with a single rook 
(which can move any number of spaces horizontally or 
vertically). The rook starts on the top row of the chess- 
board, and players alternate legally moving the rook 
only down or to the right, The winner is the player who 
moves the rook to the bottom rightmost square, Find 
the kernel of this game, and decide which player has a 
winning strategy for each possible starting position (in 
the top row) for the rook. 

13. Youare playing Nim and the current game position con- 

F sists of three piles with 3, 4and 7 stones, respectiv 
is your turn. What is the correct move? 

14, Youare playing Nim and the current game position con- 

F sists of three piles with 13, 44and 17 stones, respectively 

Itis your turn. What is the correct move? 

Suppose your little brother challenges you to a game 

of Nim using the 20 pennies from his pigey bank. How 

can you divide these into three piles at the outset of the. 
game so that you, as Player 2, have a winning strategy? 











"n 





16, Suppose two people play Nim with seven piles of stones 

F with 1, 2,3,4, 5, 6, and 7 stones in the respective piles, 
What is the correct first move for Player 1, or will 
she eventually lose no matter what she does (assum- 
ing Player 2 has read this section)? 

17, What if in the above scenario, there is an eighth pile 

F containing 11 stones? 

18. Explain why the kernel forthe two-pile Nim game out- 
lined in Example 5 is a consequence of the general de- 
scription given in Theorem 5. 

19. Consider a game that begins with three piles with two, 

F five, and seven stones, and has the rule that on any turn 
player may remove up to two stones from any one pile. 
Find the kernel of this game, and decide which player 
has the winning strategy. 

20, Play this game with a friend. Start with the 4 x 4 grid 

F shown in Figure 7-97, Players alternate putting X's in 
the squares with the limitation that X's must be placed 
in only one row or column on each move, but any num- 
ber may be inserted, The person to X the last available 
space wins. What is the strategy for the second player to 
win this game? (Hint. Consider the symmetry of the 
grid) 


76 Excursion; Binary Trees 581 
































Figure 7-97 The grid game in Problem 20, 


21. Draw a picture of a clock face and play the following 
game with a friend. On each move, a player can cross 
Out any number not already crossed out or she can cross. 
Qut rwo numbers if they are consecutive numbers (like 
5 and 6). The player who crosses out the last number 
wins. Find the kernel of this game, and determine which 
player has the winning strategy. (Hint. Consider the 
symmetry of the clock face.) 

22, Investigate some of the stone removal games we have 
studied if the rules are changed so that the player who 
removes the last stone loses the game. 





E 7.6 Excursion: Binary Trees 





Trees 








re a special type of graph first discussed formally in Section 7.2. In the current 
section, we will study a more 
binary trees have properties 


cific graph structure called a binary tree. Although 
common with general trees, their most important 





features stem from their additional structure. This section emphasizes that additional 
structure and its applications. It is not necessary to have worked through the material 
in the earlier section to understand this one, 

Even before Section 7.2, we saw “trees” in several parts of this text, primar- 
ily as a visualization or organizational aid, One important type was the decision 
tree used for counting and in probability. The first example like this came early in 
the text. 





ABA 
ABB 


AA 
BAA 
BB 
Figure 7-98 Game 


tree fora 
“best-of-three” series, 


un Example 1 The tree in Figure 7-98 is the game tree for a “best-of-three sets" 
tennis match that we first saw in the introductory section of Chupter I. 


Another familiar context for trees is the idea of a family tree. For example, 
Figure 7-99 shows a small family tree giving Brenda's descendants for several gener- 
ations. An important third example comes from our experience with arithmetic and 
calculators. In particular, a tree can represent the way arithmetic calculations are 
performed. 








582. Chapter 7 / Graphs and Trees 


+ = 
2 3 7 æ 
Trestor2+3 Tree lor17 -28 


Figure 7-100 Representing 
simple arithmetic expressions. 


Branda 


m alos Ml Example. sx e en Ta ipie Ga epos RUN luo 
i LAN, simple examples in Figure 7-100 illustrate. How can these examples be extended to 








Tom may Gat_"ePFesent more complicated expressions? 
2e [so SOLUTION As in Figure 7-100, we place the numbers at the bottom, and 
Jason Anne Bob the operation at the top, with edges joining the operation to the two numbers 
Figure 7-99 Brenda's it applies to. For more complicated expressions, we simply repeat the process. 
reat hee: For example, for the expression (2 + 3) x (17 — 28) we place another node 





above these two trees, with the x operation, and with edges to the two smaller 
trees that represent the subexpressions. This construction is shown on the left 
in Figure 7-101. If one of the operands is a subexpression and the other is a 





number, as in (2+ 3) x 7, we simply connect the operation to a tree for the 
subexpression and a node containing the number, as shown in the tree on the 
right. n 


Practice Problem 1 Write the ree representation of the arithmetic expression 
Q-8x6)-4 


All these examples share several important properties, As graphs, they are con- 
nected and have no cycles. Unlike many graphs, however, the way in which they are. 





drawn is important. We could draw the family tree with Brenda at the left (as in the 
decision tree), or perhaps on the right, or perhaps even at the bottom, as shown in 


Figure 7-102 on the left. However, drawing the family tree like the diagram on the 
right would totally destroy the meaning of the tree. 


; 
Ge e Lu 
JEANS EMT 


Figure 7-101. More complicated arithmetic 
expressions. 














76 Excursion: Binary Trees 583 





Jason Ave Bob May — Hen 

Pd = im 

To May o xn so B 

| Ms ma 

ve sm sw San — Bea — Od 
bm 

Brenda oo — Tom 


Figure 7-102 Other ways to draw Brenda's family tree, 


Notice that the tree on the left in Figure 7-102 looks more like what we would 
call a tree (or at least a bush) in ordinary life. Much of the terminology used in 
connection with trees comes from the context of either family trees or trees in nature, 
For example, we say that Brenda is at the roor of the tree, and that Tom, Sam, Jason, 
Anne, Bob, and Carl are at the leaves. We also say that the nodes labeled Joe, Sam, 
and Sue are children of the Brenda node, and that Mary and Carl are siblings. This 
is obviously based on the family tree analogy. 

In all our examples there is a special node, called the root of the tree, that 
constitutes a “starting point" for applications. Among these rooted trees is a type 
with particular importance known as a binary tree. Although it would be possible to 
formal definition of binary tree based on graph theory, it is customary instead to 
give a recursive definition, because so many of the important applications are based 
‘on the recursive nature of the tree. 














Basic Definitions 


‘The terminology used in connection with binary trees is extensive and (just as with 
graphs) not universally agreed on. Since this isnot a computer science text, we will not 
try to be exhaustive in our formal definitions. We will emphasize the mathematical 
properties of trees and restrict ourselves to definitions that will be involved in these 
mathematical discussions. Other terms that arise will be addressed more informally, 








Definition 


1. A binary tree T is a structure that is either: 
(a) empty, or 
(b) consists of one or more nodes. In this case, one of those nodes is des- 
ignated as the root node, and the remaining nodes are separated into 
two disjoint sets that are in turn (possibly empty) binary trees. One of 
these subtrees is designated the left subtree of T and the other the right 
subtree of T. 
2. A nonempty binary tree that has empty left and right subtrees is called a 














584 Chapter 7 / Graphs and Trees 


2 5 
Figure 7-103 The 
tree for the 
expression 
Q«3)x7. 


/\ 


a fs 
Loft subtree Right subtroo 
Figure 7-104 The 
subtrees for the tree in 
Figure 7-104, 





Tr 





Figure 7-108 Similar 
but different binary 
trees. 


This is a typical recursive definition. There is a “base case"—the tree may be 
empty. There is a “recursive case,” where the concept is defined in terms of a smaller 
yersion of itself. The definition might seem circular, but because the subtrees are 
always at least one node smaller than the original tree, eventually you reach the base 
case, 








jo Example 3 Show thatthe nee for the expression (2+ 3) x 7 fits the definition 
of binary tree. 


SOLUTION The tree for this expression is shown in Figure 7-103. The root 
node is the node labeled with the x symbol, The remaining nodes are divided 
into two disjoint sets: the left subtree T, containing the nodes +, 2, and 3, and 
the right subtree Ty containing the node 7. We show these subtrees in Figure 
7404, 

We must now show that each of these is in turn a binary tree. The right 
subtree isa leafsince its rootis7 and both its subtrees are empty. The left subtree 
has root +, left subtree 2, and right subtree 3, Finally, both 2 and 3 are leaves 
since each of their subtrees is empty. Notice that in an expression tree such as 
this, the numbers are at leaf nodes and the operations are at nonleaf nodes, C 











In the formal definition of a binary tree, nodes are mentioned, but there is no 

mention of edges. As you can see in the examples, when we draw a binary tree as 
a graph, we place an edge from each node to its children (ie., to the roots of that 
node's subtrees). In the preceding example, there is an edge from the root node x to 
the nodes + and 7. In the left subtree, there is an edge from the subtree's root node 
+ to the nodes 2 and 3, 
y in the definition of binary trees is the fact that the left and right 
subtrees are distinguished as part of the definition of the structure of the tree, As an 
example, consider the binary trees shown in Figure 7-105. They appear very similar, 
but they are, in fact, different, The tree 7; has a root a with an empty left subtree 
and a nonempty right subtree, while the tree 73 has a root a with a nonempty left 
subtree and an empty right subtree, 





Practice Problem 2 


(a) Forthe tree you drew in Practice Problem 1, listthe root and sketch its two subtrees 
Then list all the leaves of the left subtree, list all the leaves of the right subtree, and. 
list all the leaves for the entire tree. Finally, list all the nonleaves (sometimes called. 
internal nodes) for the entire tree. 

(b) The family tree shown in Figure 7-99 is nota binary tree since some nodes have too 
many children, but we can still make sense of most of the terms we have introduced. 
List the root and the leaves. List the children of the Mary node. List the siblings 
of the Mary node. What node do you think we would call the parent of the Mary 
node? 


Levels and Height 


When we draw a binary tree as a graph, with the tree and all subtrees throughout 
the graph having their roots at the top, the tree appears to have “levels.” Informally, 





716 Excursion: Binary Trees 585. 


Madison 





Harrison Monroe Wilson 


Taylor 
Figure 7-106 USS. Presidents born in 
Virginia. 


we can identify a node's level based on how far away it is from the root, which we 
consider to be at level 1, 





V Example 4. Consider the tree in Figure 7-106, where the nodes are labeled with 
the names of presidents born in Virginia. Starting at the top with level 1, partition the 
presidents by level in the tre. 


SOLUTION Table 7-14 shows all presidents at each level of the tree. o 
In order to be able to exploit the notion of “level” in induction proofs a bit later, 


it will be helpful to have a recursive definition that meshes with our earlier recursive 
definition of a binary tree, 








Definition For a node vina (nonempty) binary tree T, v is either the root 
of T, or v is a node in the left or right subtree of 7 (called the subtree 7; ). 





© It vis the root of 7, then the level of v in T is 1. 
9 It vis not the root of T, then the /evel of v in T is 1 more than the level of v 
in the subtree Ty. 








V Examples Verify that this definition yields the anticipated result for the Mon- 
roe" node in Figure 7-106. 


SOLUTION 


not the root of the entire Madison tree, we must first calcu- 
the Washington subtree, 


1. Since Monroe 
late its height 








Madison 
Jefferson, Washington 
Harrison, Monroe, Wilson 
Tyler 

Taylor. 

Table 7-14 Table for Example 4 


586 Chapter 7 / Graphs and Trees 


2. Since Monroe is not the root of the Washington subtree, we must first cal- 
culate its height in the Monroe subtree, 

3. In the Monroe subtree, Monroe is the root, so its level there is 1. 

4. According to the definition, its level in the Washington subtree is 1 more 
than its level in the Monroe subtree, so in the Washington subtree it is at 
level 2. 

5. According to the definition, its level in the Madison subtree is 1 more than 
its level in the Washington subtree, so in the Madison subtree itis at level 3, 
This agrees with our earlier answer. 








n 


Practice Problem 3 Intuitively, at what level is Taylor in the Washington tree? At 
what level is it in the entire Madison tree? Does this match the formal definition? 


Once we understand this notion of level, it is possible to define the height of 
a tree T as the maximum level of the nodes in T. However, in order to illustrate 
the way these recursive definitions are used, we will define “height” recursively and 
prove the relationship between height and level. 





Definition An empty binary tree has height 0. For a nonempty binary tree 
T, the height of T is 1 more than the larger of the heights of the left and right 
subtrees of T. 











Proposition 1 For all n> 0, in any binary tree T with height n, the level of any 
node in T is less than or equal to n. 


PROOF (By induction on n.) Let P(n) be the statement “In a binary tree T with 
height n, the level of any node in T is less than or equal to n." The first statement 
P(0) states, "In a binary tree T with height 0, the level of any node in T is less 
than or equal to 0.” According to the definition of “height,” the only binary tree 
of height 0 is an empty binary tree T. Since an empty binary tree has no nodes, 
the statement “The level of any node in T is less than or equal to 0” is vacuously 
true, Hence, statement P(0) is true, 

Now let m z 1 be given such that the statements P(0), ..., P(m—1) have 
been checked, and we consider the statement P(m) that states, “In a binary tree 
T with height m, the level of any node in T is less than or equal to m." the next to 
be checked. Let a binary tree T of height m be given, and let v be any node in T. 
Either vis the root of 7, or vis in either the left subtree of T or the right subtree 
of T. 








Case 1: If v is the root of T, then by the definition of “level,” the level of vin T is 
1. Since we know m > 1, we can conclude that “the level of v in T is less than 
or equal to m." 

Case 2: If v is not the root of T, then we can let T, be the subtree of T that contains 
v. By the recursive definition of “height,” the height A of T, is less than or 
equal to m — 1. So we know that P(A) has already been checked to be true, 
and hence we know that the level of v in T; is less than or equal to k. Since (by 








76 Excursion: Binary Trees 587 





the recursive definition of “level”) the level of v in T is 1 more than the level 
of vin Ty, and since the level of v in T, is less than or equal to k, we conclude 
that the level of v in T is less than or equal to K+ 1, which in turn is less than 
or equal to m. 


In either case, we have established that the level of v in T is less than or equal 
to m, completing the induction. Li 


This establishes one relationship between height and level, Exercise 14 at the 
end of this section explores another. The main point here is that the recursive defi- 
ns make induction proofs about binary trees very straightforward. We end this 
discussion with one more example of this. 





Proposition 2 — Foralln > 1, any nonempty binary tree of height n has no more 
than 2 leaves. 


PROOF (By induction on n) When n=1, the above statement says, * 
nonempty binary tree of height 1 has no more than 2" leaves.” According to the 
definition of “height,” a binary tree of height 1 must consist of a root node 
empty left and right subtrees. Hence, the only node in such a binary tree is a leaf, 
so the given statement is true when n = 1. 

Now let m > 2 be given such that the statement in the proposition has been 
checked for n — 1, ....— 1, and consider the next statement “A binary tree 
level of height m has no more than 27! leaves.” Let a binary tree T of height m 
be given. Since m > 2, we know that T is nonempty, and so we can identify the 
left subtree and right subtree of T by T; and Tx, respectively. Let i and j be the 
heights of these subtrees with / > j. By the recursive definition of height, we know 
that i = m — 1, so both subtrees have smaller heights than 7. and hence we know 
that the statement of the proposition has already been checked for binary trees 
of these heights, This means that there are at most 2/—! + 2/~! leaves combined in 
the left and right subtrees of T. Since j — 1 <i — 1 = m — 2, it follows that 


27 42/2 ea uan 
2.03 lana 




















so there are at most 27-! leaves in the binary tree T. " 


Practice Problem 4 What is the maximum number of total nodes in a binary tree 
of height n? 


Searching Lists with Binary Trees 


Animportant application of binary trees in computer science is the storing, retrieving, 
and sorting of data. A binary search tree is one way to organize a list of numbers so 
that retrieval is very efficient, 








Definition A binary search tre 
these properties: 





a binary tree where every nonleaf has 








588 Chapter 7 / Graphs and Trees 


Figure 7-107. The 
binary search tree for 
Example 6, 


© The label on that node is 
its left subtree, 

© The label on that node is less than all the labels on the nodes (if any) of its 
right subtree, 


ter than all the labels on the nodes (if any) of 








a Example 6 The numbers in the list L = [3,5, 1,9,11] can be stored in the 
binary search tree T shown in Figure 7-107. ü 


The advantage of this structure is that it is very efficient to determine if a value 
is in the tree. For example, to search for 8 in the tree, we know that 8 must be either 
the label of the root, or it is in the left subtree if 8 is less than the root, or it is in 
the right subtree if 8 is greater than the root, Based on this idea, we carry out the 
following recursive search algorithm: 





© Is T empty? If so, there is no 8 in the tree, 
© If Tis not empty, then 
Is 8 the root of 7? If so, there is an 8 in the tree, 
If Bis not the root of T, then 
If is less than the root of 7, then search the left subtree of T. 
If8 is greater than the root of T, then search the right subtree of T. 





For the particular tree in Figure 7-107, we compare 8 to 3. Since 8 is greater than 
3, we search the right subtree. In that right subtree, we compare 8 to 9, Since 8 is less 
than 9, we search 9's left subtree. In that left subtree, we compare 8 to 5. Since 8 is 
greater than 5, we search 5's right subtree, But since this right subtrce is empty, we 
conclude that there is no 8 in the tree, 

Observe that we made three comparisons, one at each level of the original tree, 
By contrast, to search in the list L itself, we would have to make five comparisons, 
‘one at each member of the list, The difference between three comparisons and five 
comparisons is not worth worrying about, but if we have a list with 1,000 items 
organized into a tree with, say, 13 levels, the difference between 1,000 comparisons 
and 13 comparisons becomes quite significant, 

In general, to sequentially search for an element in an n-element list requires as 
many as n comparisons, whereas the number of comparisons in searching a binary 
search tree is in the worst case just the height of the tree. Therefore, the relationship 
between the number of nodes in a binary tree and the height of the binary tree 
established in Practice Problem 4 becomes the central issue in using binary trees for 
searching data. 














Proposition 3 — Ifa list of n distinct numbers is stored in a binary search tree T, 
then the height of T is at least [log, n | 


PROOF Let A be the height of a binary search tree T with nodes labeled with 
the n numbers in L. By Practice Problem 4 in Exercise 13, we know that the 
total number of nodes in T is at most 2". Since the nodes of 7 are labeled with 
the n elements in L, this means that n < 2". By the definition of logarithms, this 





ag NI 
v 


Figure 7-108 The 
revised binary search 
tree T 


76 Excursion: Binary Trees 589 


is the same thing as saying A > log, n. Since } is an integer value, it follows that 
h > [logan], as desired. u 





An interesting feature of this representation is that adding a number to the 
binary search tree consists of one additional step following an unsuccessful search 
for that number. In the previous example, we realized that 8 was not in the tree when 
we reached the empty right subtree of the 5 node. Suppose we now insert 8 into the 
tree as the right subtree of that 5 node, The revised tree 7” is shown Figure 7-108, 

In this example, the naive insertion strategy increased the height of the tree, 
making future searches take longer. In a computer science course, one might study 
algorithms for doing insertions in a manner that keeps the height from growing 
more than necessary. According to Proposition 3, the height must always be at least 
flog, n]. The goal of these algorithms is to keep the height as close to the theoretical 
bound flog; n | as possible, without causing the insertion algorithm to be excessively 
slow, The details of this topic are not addressed in this text. 











Sorting Lists 


We will next show how binary trees tell us something of significant theoretical impor- 
tance about sorting lists of numbers. It is very easy to capture the essence of ordering 
a list of numbers with a tree structure. 





Bl parole 7 Given she ter of three distinct numbers vel she binary tree in 
Figure 7-109 illustrates one strategy for determining their order from smallest to largest 
by comparing two numbers at a time. o 


The order of the questions in the above binary tree could certainly be changed 
to yield a different tree structure, but note that there is no way for a binary tree of 
height less than 3 to contain the six leaves corresponding to the 3! different possible 
orderings of a, b, and c. This is really a simple consequence of Proposition 2, 





Theorem 4 Given a list ofn distinct numbers, any binary tree with leaves labeled 
with the n! possible orderings of these numbers has a height of at least 1 + [log,(n!)]| 


PROOF By the contrapositive of Proposition 2, a binary tree with n! leaves 
must have height A such that 2-1 > nl, This means that A — 1 log (nt), or h > 
1 + log,(n!). Since h must be an integer value, this means that, in practice, /r 
1+ [log;(nt)].. " 








beca b<a<c c«acb a<c<b 
Figure 7-109. Tree for sorting a list. 





590 Chapter 7 / Graphs and Trees 


The significance of the previous result is that it provides a theoretical lower 
bound on the number of comparisons that must be made in an algorithm intended 
to sort a set of data. 


Corollary $ Any algorithm that uses a binary comparison relation to sort a list 
ofn > 3 distinct values must use at least $ log, n comparisons. 


PROOF Any such algorithm can be represented by a binary tree with the n! 
possible orderings as the leaves. The number of comparisons (in the worst case) is. 
precisely the height of this tree. By Theorem 4, this height /isatisfies the following: 


ho 3 logon!) 
> dog(n"?) 
= lop (n) 
a OB; 


Since the second inequality is an algebraic property of “factorial,” we defer the 
proof of that inequality to Exercise 19. " 


There are many algorithms for sorting data that use essentially this number of 
‘comparisons, and so this result tells us that these algorithms are the best possible for 
generic sets of data. 


Traversing Binary Trees 


A traversal of a binary tree is any algorithm that visits each of the nodes of the tree. 
The term "visits" can mean any number of things. One simple type of visit consists 
of printing the data stored in the node. There are three common traversals known as 
preorder, inorder, and postorder, each defined recursively. Rather than give detailed 
formal algorithms, we will simply use examples to try to understand the differences 
between these traversals. 





Preorder traversal 


In the preorder traversal of a binary tree, we visit the root first, then we visit each 
of the nodes of the left subtree, and finally we visit each of the nodes of the right 
subtree. To visit the nodes of the subtree, we do a preorder traversal of that subtree. 
Hence, the definition is recursive—preorder traversal of a tree is defined in terms of 
preorder traversal of two smaller subtrees. 


u@ Example 8 Give a preorder print of the first two trees in Figure 7-110. 


A M R 
PA " 
Bs ie ike? vy 
cer 


Figure 7-110 Preorder prints. 








E 


Figure 7-111 Tree for 
Example 9. 





76 Excursion: Binary Trees 591 


SOLUTION For each subtree including the whole tree, we print the root, 
then the entire left subtree (if any), then the entire right subtree (if any). For 
the first tree, this gives us the straightforward result A, B, C. When we reach 
a leaf, we simply print that leaf since both of its subtrees are empty. For the 
second tree, itis obvious that the root (M) should come first. Then we print the 
W subtree in preorder, first the root (W), then the empty left subtree (nothing), 
then the right subtree (T). The final result is M, W, T. o 





Practice Problem 5 Do a similar analysis to determine the preorder print for the 
third tree (on the far right) in Figure 7-110. 


a Example 9 Give the preorder traversal for the tree in Figure 7-111. 


SOLUTION The solution consists of the root P, followed by the preorder 
traversal for the left subtree, followed by the preorder traversal for the right 
subtree. Example 8 tells us that the left subtree prints in the order M, W, T, and 
Practice Problem 5 tells us that the right subtree prints in the order R, X. L, Y. 
Therefore, the complete print order is P, M, W, T, R, X, L, Y. This illustrates 
how the answer for the whole tree is recursively built up from the answers for 

the subtrees n 








Inorder traversal 


For the inorder traversal of a binary tree, we first visit each of the nodes of the left 
subtree, then we visit the root, and finally we visit each of the nodes of the right 
subtree. The subtree nodes are visited by doing an inorder traversal of the subtree, 
Notice that this time the root of each subtree is visited in between its left and right 
subtrees. For the trees of Example 8, the inorder prints are shown in Figure 7-112 








Practice Problem 6 Use the inorder prints for the trees of Example 8 to give the 
inorder print for the tree of Example 9. 


Postorder traversal 


The postorder traversal follows the same general scheme but visits the root last. We 
visit each of the nodes of the left subtree by doing a postorder traversal of the subtree, 
Then we visit each of the nodes of the right subtree by doing a postorder traversal 


BAC wTM LxRY 
Figure 7-112. Inorder prints. 





592 Chapter 7 / Graphs and Trees 


A M L] 
AN * 
a i x Y 
T n 
BCA TWM LXYR 
Figure 7-113 Postorder prints, 


of the subtree. Finally, we visit the root. For the trees of Example 8, the postorder 
prints are shown in Figure 7-113. 


Practice Problem 7 Use the postorder prints for the trees of Example 8 to give 
the postorder print for the tree of Example 9. 


Traversals and Expression Trees 


Binary tree traversals are of particular interest for trees that represent arithmetic 
expressions with binary operations. We call these expression trees. In general, it is 
not possible to reconstruct a tree from any one traversal. However, for expression 
trees we can determine the tree from either the preorder traversal or the postorder 
traversal. Expression trees have additional structure not present in all binary trees. 
In particular, every node is either a leaf or has exactly two children, and in the latter. 
case the node must contain an operation, not a number. 

Ina way, itis unfortunate that the inorder traversal does not also have this prop- 
erty. The inorder traversal, with the root in between its two subtrees, would display 
the operation in between the two numbers. This, of course, is how we ordinarily wri 
expression such as 3 x 4 by hand. However, as this example illustrates, the inorder 
Print of an expression tree is ambiguous. 





TE Example 10 m Figure 7-114, the binary tree on the left represents the arith- 
metic expression 2 + (3 x 4), and the binary tree on the right represents the arithmetic 
expression (2 + 3) x 4. Give the inorder traversals of these two trees. 


SOLUTION For the first tree, we print the left subtree (2), then the root (+), 
then the right subtree in inorder (3, x, 4). The complete result is 2, +, 3, x, 4. 
For the second tree, we print the left subtree (2, +, 3), then the root (x), then 


+ x 
X 4 
2 
$ 4 Lo dei 
Thebiayveetorz4 (3x4) The binary Wee for (2 +3) x 4 
Figure 7-114 Expression trees for Example 10. 








76 Excursion: Binary Trees 593 


the right subtree (4). The complete result is 2, +, 3, x, 4, Observe that both 
trees give the same result. o 





As the example illustrates, the list 2, +, 3, x, 4 does not uniquely identify which 
binary tree it came from—the inorder traversal of the binary tree must be augmented 
with parentheses and/or precedence rules to correctly identify the expression, Be- 
cause ofits connection with inorder traversal, arithmetic notation (with parentheses) 
like 2 + (3 x 4) or (2 +3) x 4is called infix notation, 


Prefix and Postfix Notation 


The previous problem might seem insurmountable since we are all fai illiar with how 
lack of parentheses causes ambiguity in arithmetic expressions, However, it turns out 
that the preorder and postorder traversals can be unambiguously associated back to 
the original expression tree, as illustrated in the following examples and pra 
problems. 











k~] Example 11 The preorder traversals of the binary trees in Example 10 are as 
follows: 


© The preorder traversal of the binary tree representing 2+ (3 x 4) lists the nodes in 
order +,2, x,3,4. 

© The preorder traversal of the binary tree representing (2+ 3) x 4 lists the nodes in 
order x, +,2.3, 4. 


We use the term prefix notation for this way of representing arithmetic expres- 
sions Hence, the prefix notation for2 + (3 x 4)is+, 2, x, 3, 4and the prefix notation 
for (2+3) x 4is x, +,2,3,4, 








Practice Problem 8 An arithmetic expression is given in prefix notation 
4 Draw a binary expression tree representing this expression, rewrite 
it in infix notation (with parentheses), and evaluate it. 





Likewise, the postorder traversal uniquely determines the expression tree, as 
the following example and practice problem illustrate, 


"M Example 12 The postorder traversals of the binary tres in Example 10 areas 
follows: 


© The postorder traversal of the binary tree representing 2 + (3 x 4) lits the nodes in 
order 2, 3,4, x, + 

© The postorder traversal of the binary tree representing (2 + 3) x 4 lists the nodes in 
order 2,3, +,4, x. 


Just as with preorder traversal, we give the special name posifix notation to 
the notation for an arithmetic expression written as a postorder traversal of the 
corresponding expression tree. 


|a, c c ENEEEOC i 07 0 SEXS 


594 Chapter 7 / Graphs and Trees 


Practice Problem 9 An arithmetic expression is given in postfix notation 


3.2,4.5,1, 





x. Draw a binary expression tree representing this expression, rewrite 


it in infix notation (with parentheses), and evaluate it. 


Solutions to Practice Problems 


1 The tree is given in Figure 7-115, 
2 (a) The roots the node labeled +, The two subtrees are 
given in Figure 7-116. The left tree has leaves 2, 3, 
6, and the right tree has leaf 4. The entire tree has 
leaves 2, 3,6, and 4. The nonleaves are +, x, and +. 
(b) The root is Brenda. The leaves are Tom, Sam, Ja- 
son, Anne, Bob, and Carl. Mary's children are Ja- 
son, Anne, and Bob. Mary's sibling is Carl, and her 
parent is Sue, 
3 Inthe Washington tree, Tayloríslevel4, and in the Madi- 
son tree, itis evel S, This matches the recursive definition 
of level. 


















Figure 7-115 Solution to Practice Problem 1. 


Loft subtree Right subtree: 


d E 


Figure 7-116 Solution to Practice Problem 2. 


Exercises for Section 7.6 


1. Draw a picture of your family tree with your father's 
father as a root and with his descendants below, What 
is the height of this tree? Is this a binary tree? 

2. Draw a picture of your family tree (with three or four 
levels) with yourself as the root, your mother's family 
tree as the left subtree, and your father's family tree as 
the right subtree. Is this a binary tree? 

3. Place the numbers 1,4, —3, 12,6, 13,9 into a binary 
search tree with the smallest possible height. 





4 The maximum number of nodes in a binary tree of height 
nis 2" — 1. This is proved in Exercise 1 

5. The root (R) comes first, then the X subtree in preorder 
(root X, left subtree L). Finally, we print R's right sub. 
tree Y. Putting it together, we get RX LY. 
WTMPLXRY 

7 TWMLXYRP 

X The tree is given in Figure 7-117. In infix notation the 
‘expression is (3 — 1) + (8 +4), witha valueof24+2=4, 

9 Thetrecis given in Figure 7-118, In infix notation the ex- 
pression is (3--2) x (5 — 1), with a value of 5 x 4 = 20. 








Beatie Rint 
Figure 7-117 Solution to Practice Problem 8, 


Eje ed Lai] 
Figure 7-118. Solution to Practice Problem 9. 


4. Place the numbers 11, —4, 3,2. 16,8, 5, —1 intoa binary 
search tree with the smallest possible height. 

5. Give the preorder, inorder, and postorder traversals for 
the tree shown in Figure 7-119. 

6. Give the binary tree that represents each of the follow- 
ing “inorder” arithmetic expressions: 
(3 (5x4) 
(b) ax (3-5) 
9 (G+5)-7) x4) 22 











Figure 7-119 Tree for Problem 5. 


D (21) 8-2) - (G2) +2) 

7. For each of the trees in Exercise 6, give the result of 
preorder and postorder traversals, 

8. For each of the following arithmetic expressions in pre- 
fix notation, write the equivalent infix expression (with 

parentheses) and evaluate it: 

(2) 95,473 

(9) = 5.4.4, 

©) «8.55.62 

For each of the following arithmetic expressions in post 

fix notation, write the equivalent infix expression (with 

parentheses) and evaluate 

(a) 6,2,3, x 

©) 1.5.15 

1.24.3, x,4,5 3,6 4,4 

10. List the people in your family tree in Exercise 2 result- 

ing from inorder, preorder, and postorder traversals 

Give two different binary trees whose preorder print is 

A, B.C. Give two different trees whose postorder print 

is A, B,C. (This illustrates the fact that being able to 

reconstruct the tree from only one traversal is a special 

property of expression trees, and does not apply to all 

binary trees.) 

Explain why if a set of data is stored as a binary search 

tree, then an inorder print of that tree will print the data 

in sorted order. 

13. Prove by induction on n that the maximum number of. 
‘nodes in a binary tree of height nis 2^ — 1. 

14, Prove by induction on A that in any binary tree T with 
height 4, there isa node at level in T. 


























I-t 





i. 








nu. 











76 Excursion: Binary Trees 595 





Explain how Proposition 1 and Exercise 14 establish 

that the height of a binary tree is the largest level of any 

node in T. 

16. A binary tree of height h is called a complete binary tree 
ifitiscither empty, or it has two complete subtrees both 
‘of height A — 1. (In the picture, all the levels from level 
1 to level A have as many nodes as they can hold.) Prove 
by induction on n > 0 that a complete binary tree with 
height n has 2^ — 1 total nodes, 

17. The following algorithm allows us to think of any list or 
data asa binary tree, Given a list a, az, a... dp build 

a tree by this rule: Place a, at the root, a; as the left child 

of an, as as the right child of a, and in general: 

© I kis even, place a, as the left child of a, 

© I kis odd, place a, as the right child of a; 

(a) Apply this algorithm to these lists of words, letters, 


‘or numbers: 
È short, above, to, well, indicate, closed, section, 
student, find 
ica be die ghi] kl mno p 





dii 16,23, 1020, 145, 29, 17,772 
b) For cach list in part (a), what was the height of the 
resulting tree? 
(©) For what size list would the resulting tree be a com 
plete tree of height 2? of height 3? of height 6? of 
height 102 
4d). For what sizes of list would the resulting tree be 
a (not necessarily complete) tree of height 2? of 
height 3? of height 6? of height 102 
(©) Give a formula for calculating the height of the tree 
as a function of the size of the li 
18. Refer to Exercise 17. Prove by induction on n that the 
re item in the list is at level 1 + | logs(n)}. 
19. To complete the proof of Corollary 5, show that for 
all n > 3, nf > n^. (Hint: You may use the fact that 
Wf < rè (n — 1)" for all n > 3 without proving it.) 
20. Suppose a binary search tree is created with new data 
inserted using the simple process described after the 
proof of Proposition 3. Explain what happens ifthe data 
arc inserted in increasing order. (For example, describe 
‘what happens if you insert the numbers 1 to 10 in order 
into a new binary search tree.) 























596 Chapter 7 / Graphs and Trees 


e 7.7 Excursion: Hamiltonian Cycles and the TSP 





Hamilton’s Puzzle 


Another problem that seems to be related to the bridges of Königsberg puzzle comes 
from a little puzzle that was marketed by the Irish mathematician W. R. Hamilton 
(1805-1865) in 1857. The puzzle consists of a wooden dodecahedron, with its 20 nodes 
labeled using the names of cities. The object was to find a path using the edges of the 
dodecahedron that visits every "city" once and then returns to the starting point, In 
terms of graphs, this means that the goal is to find a cycle in the graph in Figure 7-120 
that uses every node exactly once. 








Definition Consider a simple graph G. 


1. Recall from Section 7-1 that a path is a list of vertices vp, vy, vs... Up in 
which every consecutive pair of vertices is connected by an edge and no 
Vertex appears more than once. A cycle is a path with the exception that 
there is exactly one pair of repeated nodes—namely the first and last nodes 
in the list are identical, 

2 A Hamiltonian cycle in G is a cycle that uses every node of G. The graph G 
is called Hamiltonian if it contains a Hamiltonian cycle. 




















jonian graphs in the same way that we found one for Eulerian graphs and 
graphs with Eulerian trails Surprisingly, no such characterization is currently known, 
In fact, mathematicians do not even know how to program a computer to efficiently 
determine whether a graph is Hamiltonian. Here "efficient" means something rather 








Figure 7-120. The graph of the 
dodecahedron puzzle. 








77 Excursion: Hamiltonian Cycles and the TSP 597. 


technical", but suffice it to say that listing all cycles in a graph and checking whether 
they use all the nodes are not efficient. On the other hand, testing whether a con. 
nected graph is Eulerian by checking whether every node has even degree is very 
efficient. Research on this and related problems in graph theory is of the foremost 
interest to many mathematicians and computer scientists today. And it all started 
with Hamilton's wooden puzzl 








Practice Problem 1 Spend a title time trying to find a solution to Hamilton's 
puzzle that begins with the nodes 1,5, 14, 15, ... 


By completing Practice Problem 1, you should appreciate that Hamilton's 
dodecahedron puzzle is much harder to solve than the envelope-drawing puzzle 
or the bridges of Königsberg puzzle. It is a different sort of “traversal problem" alto. 
gether. In this section, we will discuss the complexity of solving this type of problem, 
and we will examine some efficient ways to find approximate solutions. 








Traveling Salesperson Problems 


The Hamilton puzzle fs into a wider class of problems called “traveling salesperson 
Problems.” We will begin our discussion with an example of this more general problem 
before returning to Hamilton's dodecahedron. A simple example can be described 
as follows. The graph in Figure 7-121 shows five cities A, B, C, D, and E, with the 
edges between them marked with the cost of traveling between those two cities, A 
salesperson lives in city A, and he needs to visit the other four cities and return home 
as cheaply as possible, What route should he take? 
Before we attack this problem, we observe that this particular graph 

Special property: For each pair of distinct nodes, there is one edge joining them, 
a graph is called a complete graph. In a graph with this property there will always be a 














Figure 7-121 Traveling salesperson 
example. 








"iru worked through Section 48, you will appreciate the fact that noknown algorithm for checking 
for Hamiltonian cycles is O(n”) for any p whatsoever. We say that such a problem has nonpolynomial 
complexity, and that is very bad news for a computer. 





598 Chapter 7 / Graphs and Trees 


Hamiltonian cycle. In fact, there will be a lot of them! To find one, just list the nodes 
in any order, then repeat the first node listed. To get from one node to the next in 
your list, simply use the edge between them. Here are four Hamiltonian cycles for 
the graph of Figure 7-121: 


9 C. A.B. E.D.C 9 B.E.D.C.A.B 


© A.D. E,B.C.A © B.AC.D.E.B 


Observe that the first and third are essentially the same cycle, just with different 
starting points. Also, the fourth is the reverse of the third. 





Definition The complete graph on n nodes, denoted by K, is the simple 
graph with nodes (1. .... n) and an edge between every pair of distinct nodes. 











Using this terminology, we determine that the graph in Figure 7-121 is the 
graph Ks. (Technically, it is only isomorphic to Ks—in Ks the node labels would 
be 1,2,3, 4, 5 rather than A, B,C, D, E.) 

We are now ready to solve the problem described earlier. 


uw Example 1 The graph in Figure 7-121 shows five cities with the edges between 
them marked with the cost of traveling between those two cities. A salesperson lives in 
city A, and he needs to visit the other four cities and return home as cheaply as possible. 
What route should he take? 


SOLUTION In Table 7-15, we simply list all possible routes (Hamiltonian 
cycles) that start at node A, compute the cast of each one, and take the smallest! 
Notice that we only have to list half of all possible routes, since for each route, 
the reverse route has the same cost. For example, the route A, E, D.C, B, A is 
the reverse of the first route A, B,C, D, E, A, and consequently has the same 
cost as that route. Hence, the cheapest route is A.C, B, D, E, A, and its cost 
is $325. Notice that starting the cycles at any other city would not change the 


costs or the final answer. n 
Number Route Cost 
1 A.B.C. D. E.A 
2 A.B.C. E. D.A 
3 A.B.D.C.E.A 
4 A.B.D.E.C.A 
5 A.B,E.C,D.A 
6 A.B. E.D,C,A 
7 A.C.B.D. E.A — Cheapest 
8 A.C. B. E, D.A 
9 A.C.D.B. E.A 
10 A.C, E, B, D.A 
n A.D, B.C. E.A 
n A.D.C. B. E.A 





Table 7-15 Table for Example 1 


n 


5 





77 Excursion: Hamiltonian Cydes and the TSP 599. 


We can formalize this problem somewhat. The graph in Figure 7-121 is usuall 
called a weighted graph. Itis a graph with costs (generically called weights) associated 
with the edges." Our goal then is to find a Hamiltonian cycle having the smallest total 
cost possible. 











Definition Given a graph G with weighted edges, the problem of find- 
ing the Hamiltonian cycle of smallest possible weight is called the traveling 
salesperson problem (TSP) on G. 











‘We can use our counting ideas to produce a formula for the total number of 
routes to check. 


Proposition 1 The number of Hamiltonian cycles that need to be checked in the 
TSP on K, is @ 


PROOF As in the previous example, we can see that there is a one-to-one cor- 
respondence between the set of Hamiltonian cycles that begin and end with 1 and 
the set of permutations on (2,3, 4, .... n]. There are (n — 1)! such permutations, 
and only half of them need to be checked since the others are the reverses, and 
hence correspond to the same weight cycles. ma 


In one sense, the TSP is totally solved for complete graphs. It would be fairly 
easy to write a computer program to generate the list of all the Hamiltonian cycles, 


calculate the total cost for each, and find the smallest total cost. This is all very 
ini 





promising until we think harder about the function Table 7-16 shows us how 
this function grows as the number of nodes grows. The size of these numbers is a 
little more than just cause for concern. The number of Hamiltonian cycles in Ki. 
for example, is far more than any current estimate of the number of elementary 
particlesin the universe. Thisis unfortunate since problems involving paths and cycles 
in weighted graphs are very common in real-world applications. For example, any 
business concerned with a transportation network (like a bus line, airline, or express 
delivery service) has hundreds of destinations (nodes) and costs for moving between 
destinations (weighted edges). Telecommunication companies might consider the 
cost of cables (weights of edges) between thousands of switching stations (nodes). 
Clearly, the method of solution in these problems cannot consist of first generating 
the list of all Hamiltonian cycles of a graph like Ksy or Kso, so we will discuss 
some other things we might try. Before we do that, however, we should establish 
how Hamilton’s dodecahedron puzzle is just an example of this class of problem, 





(x12 


| 


n 


EIE CERO ER E ML 100 
w | zs | isso | soi | sorio | aereis 


Table 7-16 The Number of Hamiltonian Cycles in K, 


* Formally. a weighted graph isa graph G along with a function w from the set of edges of G to the set of 
Teal numbers 





600 Chapter 7 / Graphs and Trees 





Figure 7-122 TSP graph for 
Practice Problem 2. 


a Example 2 Hamilton's puzzle is a TSP problem. 


SOLUTION Form the complete graph on 20 nodes, and weight the edges 
in the dodecahedron puzzle with a 0 and weight the edges not in the dodeca- 
hedron puzzle with a 1. Asking the question “Does Hamilton's puzzle have a 
solution?” is the same as asking, “Does the minimum cost Hamiltonian cycle 
in the weighted Km have weight 0?” B 


Practice Problem 2 Find the minimum- eight Hamiltonian cycle in the graph 
shown in Figure 7-122. 


Approximate Solutions 


It is a computational burden to list all possible routes, so we would like to have a 
condition like the one for Eulerian circuits that we can use to find an answer without 
listing every possibility. Unfortunately, no one knows of any such condition. In fact, 
the TSP is in a class of difficult problems that are computationally equivalent, called 
the class of NP-complete problems. 

When an important problem cannot feasibly be solved exactly, the next best thing 
is an approximate solution. Because of the industrial interest in problems related to 
the TSP, improving on the best-known approximation schemes is one of the few 
ways a mathematician can get rich and famous. We will not delve into a study of 
these schemes in this book. but we will look at a couple of obvious choices for 
approximation schemes to see how well they do. These algorithms go by a variety 
of names. We use the adjective “greedy” in describing them because their overall 
strategy matches the overall strategy of a class of algorithms described using that 
term. In essence, a “greedy” algorithm tries to do as well as it can within small parts 
of the problem, hoping that will lead to a good solution for the overall problem. 





Definition Given a weighted complete graph K,, the vertex-greedy algo- 
rithm starts at a designated node vy, and uses the following rule: For each i > 2, 





P —— 


77 Excursion: Hamiltonian Cydes and the TSP 601 


Once all the nodes are used, the edge from v, back to v; completes the cycle. 





| choose i; to be the node not yet used for which [u... v;] has the smallest weight. 





Winans x graph in Example 1, find the result of the vertex-greedy 
algorithm, starting with vertex A. 


SOLUTION The steps are shown in Table 7-17. The final route produced by 
this algorithm is A, E, D.C. B, A (or A, B,C, D, E, A), and its cost is $330. | 


While this algorithm does not necessarily give the cheapest route, it can be 
completed in much less time than the process of considering every possible route 
in Example 1. In the example, the total number of edges listed in the “Edges to 
examine” column is 4 +3+2-+ 1, In general, examining k edges to see which has 
the smallest weight can be done using k — 1 comparisons. So the algorithm for this 
example uses 3 +2 + 1 +0 = 6 comparisons of weights for the sample graph. This 
example can be easily generalized to the following proposition. 


Proposition 2 The vertex-greedy algorithm on a weighted complete graph Ky, 


153 comparisons. 
PROOF We start with any vertex vy. The algorithm must look at each of the 
71 — edges leading to the other nodes of the graph. This process requires n —2 
comparisons. Similarly, after vz has been determined, each of the edges leading 
to the remaining n — 2 nodes must be considered, a process that requires n — 3 
comparisons. In general (we could rigorously prove by induction), after vertex 
vi-ı has been determined, n — i comparisons are required to determine v;. The 
total number of comparisons required then is 








(n— 1n -2) 


(1-2) (0-3) 24140- = 
E u 





Although this algorithm does not necessarily return the cheapest Hamiltonian. 
cycle, its execution time is considerably smaller than the process of checking every 
Possible cycle that we saw in Proposition 1. We can write the result as 2-32, As 
n gets larger, the —3 +2 portion becomes less and less significant, so the result is 
approximately proportional to n?. A simple variation of this algorithm would be to 
repeat the algorithm for each of the n nodes of the graph, resulting in an algorithm 
whose complexity is approximately proportional to 7. 





Step | Current route Edges to examine Cheapest next edge | Current cost 
1 A 1A. BL[A.CL[A, D]. [A E] lA. E] 

2 AE IE. B]. [E.C]. [E. D] [E.D] 

3 AED 1D. BL [D.C] ID.C] 

4 A, E, D.C IC.B] 1c. B] 

5 | AED.C.B Go back to start [B.A] 











Table 7-17 Steps for Example 3 


OS Se C- ncER 


602 Chapter 7 / Graphs and Trees 


Finally, we look at one other obvious method for finding an approximate solution 
tothe TSP. The complexity of this algorithm is related to the complexity of sorting the 
edge weights from cheapest to most expensive, a topic that we will not take up in this 
book. It can be shown that this algorithm’s complexity is approximately proportional 
to rr log, n, which lies between n? and n°. 





Definition Given a weighted complete graph Kn, the edge-greedy algo- 
rithm first sorts the edges into increasing order by weight. It then builds the 
Hamiltonian cycle by using the edges in this increasing order, except that it 
skips over any edge that would create a node of degree 3 or a cycle. It contin- 
ues until only one edge can be added to complete the Hamiltonian cycle. 








"MB tampe 4 forthe graph in Example 1, find the result of the edge-greedy 
algorithm. 
SOLUTION We will show the steps once again in table form. We begin by 
listing the edges in increasing order by weight: 
IA. E). [B.C]. IC. D). [B, D]. [A.C].[A. D]. [A. B]. [D. E]. [C. E]. [B. E] 


Then we go down the rows of Table 7-18 filling in the last column. You may find it 
useful to follow along by highlighting the edges on the graph as they are added. 
The final cycle is A, E, B,C, D, A with a total cost of 50 + 95 +55 +60 +75 = 
$335. Note that after step 5 we already had four edges, and it would have been 
possible to stop and figure out what the fifth edge would have to be. Since in 
practice most of the time taken doing this algorithm occurs in the sorting step, 



















































this would not significantly reduce the complexity. o 
Step | Edges and weights Action taken T Cost so far 
1 1A. £1,850 ‘Add to result $50 

2 15.CL.Sss Add to result | sus 

3 [C.D]. $60 Add to result $165 

4 1B. D $45 Skip (would form cycle B,C, D. B) 

5 1A.CI.S70 Skip (would make C have degree 3) 

6 1A. DIS ‘Add to result $240 

7 1A. B], $80 Skip (degree 3 and cycle problems) 

8 TD.ELS85 — | Skip (degree 3 and cycle problems) | 

9 IC. £1,590 Skip (degree 3 and cycle problems) 

10 muss | Add to result sus 











Table 7-18 Steps for Example 4 





Oo ëO 


77 Excursion: Hamiltonian Cycles and the TSP 603 


Practice Problem 3 Perform both approximation algorithms on the graph in 
Practice Problem 2. 


‘These approximate solutions can be pretty bad, as we will see in the exercises 
for this section, and they are presented here as simple, naive examples, not current 
research. Because of the importance of this type of problem in industry, a great deal 
of money is spent each year on research into good approximate solutions to the TSP. 
However, the problem is far from solved. 





Hamiltonian Graphs Again 


We end this section with two examples of classical theoretical results concerning 
Hamiltonian graphs. Recall that there was a very simple necessary and sufficient 
condition (namely, that the nodes all have even degree) to determine if a given 
connected graph contained an Eulerian circuit. There is no known useful necessary 
and sufficient condition for determining if a graph contains a Hamiltonian cycle. The 
theorems we will prove do, however, provide some information: 





© The first theorem gives a sufficient condition. If that condition is true, the graph 
has a Hamiltonian cycle, but if that condition is false, we can make no conclusion. 


a ' The ssecond theorem gives a different necessary condition. If that condition is false, 
the graph does not have a Hamiltonian cycle, but if that condition is true, we can 

b: ] " make no conclusion. 
x a ‘To assist your understanding of the proof of the first theorem, we first look at a 

few small examples. 
M 

Figure 7-123 M Erie Scarp C Be cones sie vericer ar shonin Figure 
Partial graph 7-123. The graph contains at least the five edges shown, but additional edges in the 





for Example 5. graph are possible. 


1. Foreach part, show that, ifthe graph also contains the edge(s) listed, it must contain 
a Hamiltonian cycle: 


(a) [x,y] 

(b) [x. ax] and [y.ai] 

(©) [r.a] and Ly, az] 

(d) [x.3] and [y, as] 

Now suppose you are told that the graph G does not have a Hamiltonian cycle. 
Explain why deg(x) + deg(y) must be less than 6. 





SOLUTION 


1. Refer to the graphs in Figure 7-124 for the subsequent discussion, 
(a) Ifwe add the edge [x, y] to the graph in the figure, there is a Hamiltonian 
cycle x, a, a2, a3, as, y, X. 
(b) If we add both [r.a] and [y.ai]. there is a Hamiltonian cycle 
3-41. Y. da, ds, do, X. 








604 chapter 7 / Graphs and Trees 


x 
E 


a a a 
y * y A y as 
x x ex a 

a * a 


Figure 7-124 Graphs for the solution to Example 5. 


(c) If we add both [x,a3] and [y,a2], there is a Hamiltonian cycle 
3.41.02. Y, d4, ds, X. 

(d) If we add both [x.as] and [yas] there is a Hamiltonian cycle 
X, i, 02, 83, Y. da, X. 

The graph as originally shown has edges [x aj] and [y, aa], so that deg(x) + 

deg(y) is 2. If we list all the edges that could make this sum larger, we have 








bx, y). Lx, az]. Ex. as). (x, as), [y. a1]. [vs az), Ly, as] 

These are the same seven edges we looked at in the first part of this example. 
We showed that our graph cannot have the edge (x, y] (part (a)), and that it 
can have no more than one of the edges listed in each of parts (b), (c), and 
(d). So it cannot have more than three of these seven edges. Even if it did 





as as have this maximum, we would have deg(x) + deg(y) = 5. So we can be sure 
that deg(x) + deg(y) < 6. ü 
y a. 5 
Practice Problem 4 Figure 7-125 shows a graph with eight vertices. Once again, 
x as the figure shows some of the edges of the graph, but there could be more. 
zo» 1. For each part, show that if the graph also contains both the edges listed, then there 
Powe 7-5 is a Hamiltonian cycle: 
Graph for Practice (a) [r.a] and [y, a3] 
Problem 4. (b) [x as] and [y, as] 


2. What other pairs of edges following this general pattern have this same property? 
3. What is the largest value for deg(x) + deg(y) if the graph is not Hamiltonian? 


We are now ready for the theorem that gives a sufficient condition for a graph 
to have a Hamiltonian cycle. Recall that we say nodes are adjacent if they are joined 
by an edge. 


Theorem 3 (Ore) Let G be a graph with n > 3 nodes. If (degx + deg y) > n for 
all nonadjacent nodes x and y in G, then G is Hamiltonian. 


PROOF We will prove the contrapositive: If G is nor Hamiltonian, then there 
are nonadjacent nodes x and y in G with (deg x + deg y) <n. 

Let G be a graph that is not Hamiltonian. Keep adding edges, one at a time, 
until the resulting graph is Hamiltonian. (We know that eventually this will happen 





77 Excursion: Hamiltonian Cydes and the TSP 605. 


because the complete graph K, is Hamiltonian). Call the last non-Hamiltonian 
graph in this process H, and call the edge that was added next (the one that caused 
a Hamiltonian graph) [x, y]. Notice that H did not contain the edge [x, y], so x 
and y are nonadjacent in H (and also in the original graph G). Also notice that 
the Hamiltonian cycle which results from adding the edge [x, y] must contain this 
[x. y] edge (or else H would have already had a Hamiltonian cycle). 

By labeling the vertices, we can write the Hamiltonian cycle as 
X, a1, 02, 5-2, Y, X. Since all the edges but [x, y] were already in H, we know 
that H has a path x, aj, az. ...d,-2. y, plus perhaps some other edges. (This is 
exactly the situation illustrated by the examples and practice problems) 

Considering just this path, we have deg(x) + deg(y) = 2, and the only edges. 
H could have that can increase this sum are [x, a;..;] and [y. a;] (where i can range 
from 1 to n — 3). However, for each value of i from 1 to n — 3, H cannot contain 
both members of the pair [x, a;.:]. [y, a;], since this would give a Hamiltonian 
Cydlex. ai... di, Y s 2: 45-5... 0,21, x in H. Thus, nomore than n — 3of these 
edges can be in H, so in H we have deg(x) + deg(y) < 2+ (n -3)=n— 1 < n. 
Since H was obtained from G by adding additional edges. this same inequalit 
must also hold in G. " 




















This gives us a sufficient condition for testing if a graph is Hamiltonian, but 
this condition is certainly not necessary. (See Exercise 17.) On the other hand, the 
following result gives usa necessary condition that alas is not sufficient. (See Exercise 
18) 


Theorem 4 If G has a Hamiltonian cycle, then removing any edge from G will 

result in a connected graph. 
PROOF Leta simple graph G with a Hamiltonian cycle be given. Let's name 
the cycle vi, vz, vs, -- «n. v1, where n is the number of nodes in G and no node 
is repeated in the cycle. Now let the graph G' be formed by removing any single 
edge from G. To show that G’ is connected, we must show that for any given pair 
of nodes in G’, there exists a path between these nodes. Let two nodes in G' (and 
hence in G) be given. Since the cycle given above completely lists all nodes in G, 
we know that the two chosen nodes come from this list, so let's say they are v; and 
vj, where i < j. Since only one edge of G is missing in G’, one of the following 
two paths must be intact in G^ (i) vj, vj«1.... vj OF (ii) vj, vais... Uae Bis eos Vie 
Hence, there is a path connecting v; and v; in G'. Li 











Solutions to Practice Problems 


Here is one solution to Hamilton's pul: 4. 1 There are these Hamiltonian cycles: 
1,5, 14, 15, 16, 17, 18, 9, 10, 11, 19, 20, 13, 12, 4, 3,2, 8, 2. xam. 
bus Ora 

Table 7.9 shows every possibility and highlights the Z 

cheapest option. 3 


Table 7-20 shows the vertex-greedy solution, and Table 
7-21 shows the edge-greedy solution, 


5. da sy Yo sy X 

[xaz], Ly.a;] and [x. as]. [v.s] and [x. as), Ly, 4] 
The two there now along with no more than five addi- 
tional means there will be no more than seven total, 








of ee 


606 Chapter 7 / Graphs and Trees 

















.B.C.D.E.A | $500 
.B,C.E.D.A | $520 | 
.B.D.C.EA | $450 

.BD.E,C.A | $420 = Cheapest 

BEC.DA | Se 

B. E. D.C. A 

C.B, D, E.A 

LC. B.E. D.A 

C. D.B, E.A 

C.E,B.D.A 

-D.B.C,E,A 

D.C, B, E.A 


Table 7-19 Exhaustive Solution to Practice Problem 2 











Step | Current route | Edges to examine ‘Cheapest next edge | Current cost 
1 A {A.B} [A.C]. IA. D] [A. E] lA, B) $50 

2 A.B [B,C]. B. 0). (B. E] [B.D] $120 

3 ABD 1D.C].ID. E] ID. E] $230 

4 A. B,D, E [E.C] i.c] $360 

5 A. B. D. E.C Go back to start Ica] $420 











Table 7-20. Vertex-Greedy Solution to Practice Problem 3 









































Step | Edges and weights Action taken Cost so far 
ı | mise ‘Add to result $50 
2 ws | Add to result sio 
3 15. D),$70 ‘Add to result $180 
4 1A. £), $80 Skip (would make A have degree 3) 

s | mps Skip (would make A have degree 3) 
e| waso Skip (would make B have degree 3) 
7| ws Add to result 5290 
s | (rs Skip (degree 3 and cycle problems) 
D IC. E) i30 Add to result $420 
10 t 15.C], sido Skip (degree 3 and eee problems) | 





Table 7-21 Edge-Greedy Solution to Practice Problem 3 








Exercises for Section 7,7 


Find another solution to Hamilton's dodecahedron puz- 

zle. 

2 In Example 2, we stated that Hamilton's puzzle can 
be thought of as a TSP problem in an appropriately 
weighted Ka graph. How many different Hamiltonian 
cycles are there in Ky? 

! Suppose that a computer can calculate the weight of a 
single Hamiltonian path in K, in 10" sec. (This means 
that it can compute the weights of a thousand trillion 
different cycles per second.) If you use this computer to 
check all possible cycles, how long would it take to find 
the shortest (in total driving miles) cycle that visits all 
state capitals in the lower 48 United States? 

4. Suppose the entire population of the earth (which we 
can round to 10 billion people) has a computer like 
the one described in the previous exercise, and they all 
set about checking different possible eycles. How long 
would it take to check all possible cycles in Key? 

5. For what values of m and n is the complete bipartite 

F graph Ky, a Hamiltonian graph? 

6. Solve the TSP on each of the complete graphs given in 

Figure 7-126, 





(a) 


p 8 


[7 
Figure 7-126 Graphs for Problem 6. 





D 


1. 


n" 


In 


4 


77 Excurson: Hamitonian Cycles and the TSP 607 


Provide a simple example to show that the edge-greedy 
algorithm does not always give the exact solution to the 
TSP. 

For cach of the graphs in Exercise 6, use the vertex- 
greedy algorithm starting with vertex a to find a cheap 
Hamiltonian cycle. 

For each of the graphs in Exercise 6,use the edge-greedy 
algorithm to find a cheap Hamiltonian cycle. 

Give an example of weighted complete graph for 
which the cheapest Hamiltonian cycle costs $100, but 
the vertex-greedy algorithm finds a Hamiltonian cycle 
that costs $1,000. 

Using the previous problem as a starting point, explain 
why the vertex-greedy algorithm can be wrong by any 
given amount, depending on the graph 

Notice that the result of the vertex-greedy algorithm 
depends on the starting vertex. One way to counter- 
act this is to run the vertex-greedy algorithm starting 
at cach vertex, and taking as our approximate solution 
the cheapest Hamiltonian cycle out of all those runs. 
(This multiplies our efforts by the number of vertices in 
the graph, but this increase is nothing compared to the 
increase in the brute force method of attack.) Try this 
new version on the graph shown in Figure 7-127. 

For the graph in Exercise 12, use the edge-greedy algo- 
rithm to find a cheap Hamiltonian cycle, and compare 
this result to your answer to that exercise. 

Let G, denote the graph whose nodes are labeled with 
all 2" binary sequences of length n, and which has an 
edge from a to b whenever sequences a and b dif- 
fer in only one position. For example, G} has nodes 
100,01, 10,11} and edges ([00.01].[00, 10] (01, 11]. 
110, 11]. Show that Gs, Gs, and Gy are Hamiltonian. 























Figure 7-127 Graphs for Problem 12. 





608 Chapter 7 / Graphs and Trees 


15. Ironically, Hamilton's dodecahedron puzzle (ca. 1857) 
F isnot the earliest puzzle problem involving a "Hamilto- 
nian cycle" and solved using graph theoretical ideas. A 
much older puzzle is the “knight's tour problem." which 
was treated mathematically by Euler (in 1759) and A. 

TT. Vandermonde (in 1771): 

‘A knight moves in an L-shape on a standard 8 x 8 
chessboard, “jumping” two spaces horizontally or verti- 
cally and one space in the perpendicular direction. Can 
a knight start at a square of the chessboard, make a se- 
quence of legal moves, and return to its starting square 
having visited every other square on the chessboard ex- 
actly once? 

(a) Explain why the classical knight’s tour problem is 
solved by finding a Hamiltonian cycle in a graph. 
‘That is, explain what the meaning of the nodes and 
‘edges is in a graph so that a Hamiltonian cycle in 
the graph gives a solution to the puzzle, 

(b) Usinga smaller 4 x 4 chessboard, solve the knight's 
tour puzzle using the type of graph you described 
above, 

(©) Explain why it is impossible to solve the knight's 
tour puzzle on a 5 x 5 chessboard. (Hisr: Compare 
the numbers of white and black squares.) 

16. How many (basc ten) digits would it take to write out 
the number 99!? — (HiNT: The number of base ten dig- 
its of a positive integer x is given by log,, x (Le.. 1 more 
than the greatest integer less than or equal to log, x), 
and log, (ab) = logy, a + logio b.) 

17. Find a counterexample to the converse of Theorem 3. 

18. Find a graph on five nodes that is a counterexample to 
the converse of Theorem 4, 


19. Explain how the following derives from Ore's theorem: 
If G is a simple, connected graph on n > 3 nodes for 
which every node has a degree of at least n/2, then G is 
Hamiltonian. 

20. Explain how you can tell that the graph in Figure 7-128 
is not Hamiltonian. (This graph is called the Petersen 
graph, and it pops up frequently as an important exam- 
ple in graph theory.) 


Figure 7-128 Graph for Problem 20. 


21. Ona recent trip to New England, we traveled round trip 
from Harrisburg to Burlington, VT, passing along the 
way through the capital cities of six other states, Given 
the mileage chart shown in Table 7-22, use the vertex- 
greedy algorithm starting with Harrisburg, PA to find 
an approximation of a round trip using as few miles as. 
possible. (The capital cities are designated with the state 



































abbreviation, while Burlington is designated with a B.) 
p M^ | s | NW] PA] Cr | vr | m 
NY mw | iss [aus | 23 [on | m | i 
MA + [zm | e | 2 wo | so 
B ae | + | ass | 46 | ms | » | 266 
NH |e |s| |m] is | m| 
PA | m | w [we [+ | 26 | as |s 
cr | m | as [is | 26 | + | am | a 
vr wo | 39 | n | s | w| 20 
RI [ s | 266 | m [ss i |æ- 











Table 7-22 Mileage Chart for Exercise 21 








22, Repeat the previous exercise using the vertex-greedy 
algorithm starting with Boston, MA. 

23, Refer to the mileage chart in Exercise 21. Use the edge- 
greedy algorithm to find an approximation of the round 
trip using the fewest possible total miles. 

24, Refer to the mileage chart in Exercise 21. Find the low- 
est mileage round trip involving just the state capitals 
of CT, MA, NH, RI, and VT. 


Chapter 7 Summary 


77 Excursion: Hamiltonian Cydes and the TSP 609 


25. Imagine a graph with 20 vertices, containing the path 
My a, e i 5. a ip s zs s dha, 


ais, ais, d. is Y 


Show that if it also contains the two edges [x, ajs} and 
1». 2]. then it contains a Hamiltonian cycle that begins 
x. ai,- Generalize. What is the largest possible value 
for deg(x) + deg(y) if G is not Hamiltonian? 








71 Graph Theory 


Terms and concepts 


© You should recognize that a graph consists of vertices 
(nodes) and edges, and that cach edge has one or two 
endpoints. The edge is said to be incident with its end- 
points, Each node has a degree that measures how many 
edges are incident with that node. 

® You should recognize special types of edges and nodes: 
oops, multiple ot parallel edges, adjacent nodes. 

© You should know what we mean by a walk, and be able 
to calculate the lengrh of the walk. 

€. You should be able to distinguish special types of walks: 
trivial walk, trail, path, circuit, trivial circuit, cycle. 

© You should understand that there are variations on the 
fundamental graph idea (for example, simple graphs and 
directed graphs) and that for certain applications one or 
the other variation may better model the application 

9 The notation [a,b] (or in some situations [a. b)) is used 
to indicate the edge with endpoints a and b. In a directed 
graph the ordered pair notation (a, b) emphasizes that 
the order of the endpoints is significant 

© A graph may be connected, and in any case it will contain. 
subgraphs which are connected components of the graph. 

9 You should understand the meaning of Eulerian circuit, 
Eulerian trail, and Eulerian graph. 








Applying graph terminology, and Eulerian graphs 

© You should be able to draw a graph with specified nodes 
and edges, using points for the nodes and lines for the 
edges 

9 You should be able to calculate the degree of a node. 

© Within a given graph. you should be able to identify 
walks, cycles, subgraphs, connected components, etc. In 
short, you should be able to demonstrate your compre- 


hension of all the terms listed above by identifying them 
ina graph. 

© You should recognize and be able to apply the formula 
Lj. denín) = 2m, where the v, are the vertices and m 
isthe number of edges in the graph. 

9 You should be able to determine if a given graph con- 
tains an Eulerian circuit or Eulerian trail. If it does, you 
should be able to construct that circuit or trai 


7.2 Proofs About Graphs and Trees 
Terms and concepts 


99 You should know that a rree is a connected, simple graph 
with no cycles, and you should be able to identify the 
leaves of a tree. 

You should know what we mean by a spanning tree for a 
graph. 

© Youshould be familiar with weighted graphs, and with the 
notion of a minimal spanning tree for a weighted graph. 


Proofs 


9 You should be able to provide direct proofs of graph 
properties (by starting with the hypotheses and estab- 
lishing that the conclusions must follow). 

9. You should be able to state the contrapositive of a given 
theorem, and recognize that you can prove the theorem 
by establishing that the contrapositi is truc, 

9 You should be able to use induction to prove graph prop- 
erties. You should recognize that, in contrast to many 
earlier induction proofs, it is not always P(m — 1) that 
proves crucial in the proof. 

© You should be able to illustrate steps in your proofs with 
concrete examples. 





má 


G10 Chapter 7 / Graphs and Trees 


Spanning trees 

© Given a graph, you should be able to construct a spanning 
tree for that graph. 

© Given a weighted graph, you should be able to use Prim's 
algorithm to contruct a minimal spanning tree for that 
graph. 


73 Isomorphism and Planarity 
Terms, concepts, and formulas 


9. You should understand the terms isomorphism and iso- 
‘morphic graphs. 

@ You should know that a planar graph can be drawn (on 
a plane) so that no edges cross, and that the drawing is 
called an embedding of the graph in the plane. 

© Youshould know the meaningof the term bipartite graph. 

9. You should recognize the notation K, for the complete 
graph on n nodes, and the notation K, for the complete 
bipartite graph on n, m nodes. 

© You should be familiar with the five regular polyhedra 
(the five Platonic solids), and their connection to the 
question of planarity. 

© You should know what is meant by a face of a planar 
graph embedded in the plane. 

© You should recognize Euler's formula for planar 
graphs: Ve Fe EZ. 


Isomorphism 

‘© For small isomorphic graphs, you should be able to pro- 
duce the function that establishes the isomorphism. 

‘© For small graphs, you should be able to use the idea of 
“tacks and elastic string” to help decide if they are iso- 
morphic. (You may use the assistance of technology for 
this.) 

© You should be familiar with properties that isomorphic 
graphs share, and be able to use the absence of those 
Properties to demonstrate that graphs are nor isomor- 
phic. 





Planarity 


© You should know that Ks and Ks; are not planar, and 
that any graph that is not planar contains a “copy” of 
one of these graphs. 

9 You should be able to use the idea of “tacks and elastic 
string” to help decide if a graph is planar. (You may use. 
the assistance of technology for this) 

© You should be able to use Euler's formula and Theorem 
7 to demonstrate that a graph is not planar. 





74 Connections to Matrices 

and Relations 

‘Terms and concepts 

© You should understand the use of an adjacency matrix 
M to represent a graph, and know the meaning of the 
notation My 

© You should know the meaning of a directed graph, where 
we represent the edges as ordered pairs, and the meaning 
‘of a walk in a directed graph. 

'® You should recognize the standard use of algebraic op- 
erations applied to matrices: for example, A + B, A - B, 
^. 

© You should understand the use of the Boolean product 
of matrices whose entries are 1s and Os, and in particular 
the notation M'!! used in this context. 


Matrix operations 


© For given matrices A and B, you should know how to 
calculate A + B and A -B (and you should know when 
these operations are/are not possible). Appendix B con- 
tains further information on matrix operations 

‘© You may have learned how to use computer software to 
calculate expressions involving matrices. 

© You should be able to compute Boolean products of ma- 
trices; in particular, you should be able to compute M% 
for the adjacency matrix M of a binary relation, 

9. You should be able to compute Boolean sums of matri- 
ces denoted A v B. 

© You should understand the connection between the usual 
product M* and the Boolean product M°, and the con- 
nection between the usual sum A + B and the Boolean 
sum A v B. 


Modeling graphs with matrices 


© You should know how to build the adjacency matrix M 
for a graph or a directed graph. 

© You should be able to compute the number of walks from 
node to node of length2, length 3, and soon, using matrix 
multiplication. 

© You should be able to compute the number of walks of 
length kor less, using matrix multiplication and addition. 

® You should be able to use Boolean matrix operations 
to explore the existence of walks, and connectivity of a 


graph. 








Matrices, graphs, and binary relations 

© You should understand, and be able to exploit, the con- 
nection between a binary relation, its associated graph, 
and its associated adjacency matrix. 

@ In particular, you should be able to use boolean matrix 
‘operations to model relation composition. 

© You should be able to use matrices to explore transitivity 
and other properties of relations. 


75 Graphs in Puzzles and Games 
Terms and concepts 


© Inthe context of the Instant Insanity puzzle, you should 
know what we mean by a good subgraph of G. 

© You should understand what is meant by a finitely pro- 
gressive game, 

© For the directed graph of a game, you should understand 
the significance of the kernel of the graph (also referred 
toas the set of good nodes), 

© In the context of the Nim games, you should know what 
is meant by the direct sum of rows and the binary number 
of the game position. 


Simple puzzles and games 

© You should be able to use directed graphs to represent 
puzzles where the nodes represent the legal states of the. 
puzzle, and the edges represent legal moves. (An exam- 
ple is the wolves, goats, and cabbages game.) 

© Similarly, you should be able to use directed graphs to 
represent games where the nodes represent the possible 
states of the game, and the edges represent legal moves. 
(The simple stone-removal games are examples.) 

© For small puzzles and games, you should be able to use 
the graph to analyze the puzzle or game, answering ques- 
tions such as: 
~ Starting at a given state, is it possible to reach some 

other given state? 

~ Which player has a winning strategy’? 

© You should know the properties that characterize the 
kernel of a graph, and be able to construct the kernel for 
games with small graphs. 


Instant Insanity and Nim 


© You should understand the particular application of 
graphs to Instant Insanity puzzles: 





77 Excursion: Hamiltonian Cydes and the TSP 611 


~ The nodes represent colors and the edges represent 
opposite faces of a cube. 

= You should know how to use a good subgraph of G 
to construct a partial solution, working with just the 
front and back columns while ignoring the other pair 
‘of columns. 

= You should know how to use a second good subgraph 
of G to move from the partial solution to a complete 
solution. 

© You should know how to use binary numbers to analyze 
the classic game of Nim and variations of the game, 





7.6 Excursion: Binary Trees 

Terms and concepts 

© You should be familiar with the terms tree and binary 
tree, and with a large variety of terms used to describe 
trees: node, root, leaf, internal node, child, parent, sibling, 
subtree. 

® You should know the related concepts of levels within a 
tree, and the height of the tree. 

© You should understand the particular type of binary tree 
known as a binary search tree. 

© You should know about traversals of a binary tree (pre- 
order, inorder, and postorder). For expression trees, you 
should be familiar with the related terms infix notation, 
prefix notation, and postfix notation. 


Applications and proofs involving tree concepts 


© Youshouldbe able to illustrate the various concepts (sub- 
tree, leaf, ete.) for specific trees. 

© You should be able to illustrate the method of using bi- 
nary trees to represent algebraic expressions. 

/ For a binary search tree, you should be able to describe 
and trace the algorithms for searching for a particular 
value, and for inserting a new value. 

© You should be familiar with the connection between bi- 
mary trees and bounds on the number of comparisons 
needed to sort a list. 

© You should be able to give the preorder, inorder, and 
postorder prints for a given binary tree. 

© You should be able to use the recursive definitions of bi- 

y tree, height, etc., to develop proofs by induction for 

various properties of binary trees. 











612 Chapter 7 / Graphs and Trees 


77 Excursion: Hamiltonian Cycles 
and the TSP 


Terms and con 





© You should recall the terms path and cycle, and under- 
stand their use in describing Hamiltonian circuits and 
Hamiltonian graphs. 

@ You should know what is meant by a traveling salesper- 
son problem (TSP) and its connection to the concept of 
weighted graphs. 

© You should recognize the notation K, for the complete 
graph on n nodes. 

© You should realize that the TSP problem is an example 
from the class of NP-complete problems, 


Te 


© You should understand the terminology used in the sec- 
tion for two algorithms for obtaining approximate solu- 
tions to the TSP: the vertex-greedy algorithm, and the 
edge-greedy algorithm, 





onian cycles 





ng Ha 

© For small graphs, you should be able to solve the TSP 
by listing all possible Hamiltonian cycles and calculating 
the weights for each 

9 You should be able to apply the vertex-greedy algorithm 
and the edge-greedy algorithm to particular weighted 
graphs. 

© You should realize that there are no known conditions 
which are both necessary and sufficient for a graph to be 
Hamiltonian. 




















Rules of the Game 


Throughout this book we refer to games played by people all over the world. 
However, since not every game is familiar to everyone, we devote this short 
appendix to an overview of the terms and rules associated with some of these 
games. Of course, more details on any of them are simply a click away on 
any web search engine. 


Cards 





A standard deck of cards consists of 52 cards, each of which consists of a value and a 
suit. The value of a card is one of, ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, jack, queen, king, and 
the suit of a card is one of, club, diamond, heart, spade. Examples of cards are 3 of 
diamonds, queen of hearts, and ace of spades, 

‘There are four cards with each possible value and there are thirteen cards of 
each possible suit. Cards whose values are jack, queen or king are called face cards, 
Aces are considered the highest value or the lowest value in the deck, depending on 
the game or the situation. When cards are distributed in a game, we say that cards 
are dealt, and the particular set of cards that a player has is called a hand of cards. 

A few specific card games are mentioned in problems and examples throughout 
the book. Although knowledge of the rules and strategies for playing these games 
is not necessary for completing the problems, it might be useful to know a little bit 
about them. 


613 





614 Appendix A / Rules of the Game 


Blackjack 


Blackjack is a game in which the initial hand consists of a face down card and a face 
up card. The value of the hand is simply the sum of the values of the two cards in the 
hand, with some simple interpretations: Face cards all have a value of ten. The player 
decides on an individual basis whether an Ace is worth one or eleven, The object 
of the game is to draw cards until the sum of the values equals as close as possible 
to but not exceeding 21. For this reason the game of blackjack is sometimes called 
Twenty-one. 





Poker 


In poker, a hand consists of five cards, There are many variations of poker that involve 
the different ways in which these five cards are obtained, but these arc irrelevant to 
the basic way hands are compared, 





1. A hand is said to contain a pair if exactly two cards have the same value and the 
other three cards have three different values. 
- A hand is said to contain nwo pairs if there are two values such that exactly two 
cards have one value and exactly two cards have the other value. 
3. A handis said to contain three ofa kind if exactly three cards have the same value 
and the other two cards have two different values. 


4. A hand is said to contain a straight if all five card values are consecutive, 
5. A hand is said to contain a flush if all five cards are of the same suit. 


6, A hand is said to contain a full house if two cards have one value and three cards 
have another value. 


7. A hand is said to contain a four ofa Kind if four of the cards have the same value. 


8, A hand is said to contain a straight flush ight which is also a 
flush. 





it contains a stra 





When comparing hands in poker, a hand lower in the list above beats a hand 
that is higher in the list. 


Sports 





In our study of games, we use several examples of sports played between two players 
or two teams. We summarize here some of the terminology used for those examples. 


Series 


In many situations, teams play a series of games to determine a winner. In tennis, the 
first player to win two sets wins the match in many tournaments, In the first round 
of professional playoffs, two teams play games until one team wins three games. In 
the final round of playoffs for baseball, basketball and hockey, two teams play until 
one team wins four games. These formats are called respectively, best of three, best 
of five, and best of seven series. 


Miscellaneous Games 615 


In less formal settings, people might decide something based on a coin flip and 
decide to use a best two out of three rule. Or perhaps two siblings play a travel game 
and decide to use a best five out of nine rule to make it more interesting, In all of 
these cases, the underlying structure of repeating a basic component until one pla 
has first won a certain fixed number of times is the same. 








Baseball and Softball 


Few sports are as inundated with numerical data as baseball, Players are identified 
by their statistics in traditional categories, media frenzy follows lengthy streaks, and 
legends are affirmed when sacred records are broken. So common is the language 
among fans, itis easy to forget that it is not a common language for everyone. Hence, 
a short lesson in terminology is in order here. 

Central to baseball and softball is the competition between pitcher and batter 
that occurs dozens of times in a game. We refer to one such occurrence as a plate 
‘appearance for the batter. To simplify the terminology, we refer to an outcome of this 
confrontation as taking the form of a hit, a walk, or an out. All less common outcomes 
are equivalent to one of these for the purposes of the problems we will consider. For 
example, a player's batting average is reported as the ratio of the number of hits to 
the total number of hits and outs. (The sum of the number of hits and outs is called 
the number of at-bats, considered to be the number of fair chances for the player to 
hit. For example, a player who has 3 hits, 2 walks, and 5 outs in 10 plate appearances 
has a batting average of 0.375. 

















Tennis 


In tennis, two players play points. A sequence of points comprises a game, a sequence 
of games make a set, and a sequence of sets make a match. In each case, there is a 
target number necessary for winning, and there are various rules for breaking ties, 





1. To win a game, a player must score four points before his or her opponent does, 
with the extra rule that a game must be won by at least two points. This extra 
rule keeps games going for many points on occasion. (At the 1995 Wimbledon 
Finals, Steffi Graf and Arantxa Sanchez Vicario played a game that lasted 
32 points before Graf won the game,) 

- To win a set, a player must win six games before her opponent does, also with the 
extra rule that a set must be won by at least two games, However, if a set is tied 
six games apiece, most tournaments play one game (with special rules) called a 
tiebreaker which determines the winner of the set, 

3. To win a match, a player must be the first to win either two or three sets depending 

on the rules of the tournament. 











Miscellaneous Games 








A few other games mentioned in the book appear only once or twice, so the reader 


needs to know less about them. Here are brief surveys of these games: 


BEER. Se m 


616 Appendix A / Rules of the Game 


© Table tennis is different from tennis. In table tennis, a game is a sequence of points 
with a target number (like 21) necessary for a win. In table tennis, it is usually 
required that the winner of a game must win by two. That is, if the score in the 
game is 20-20, then the game will not end until one player has scored two more 


points than his opponent. 


© Bridge is a card game with many subtle rules, but its basis is the distribution 
(dealing) of the entire deck of cards into four 13-card hands. The order in which 
the player receives his cards is irrelevant to the subsequent play of the game. 

© Scrabble?" is a word game played with tiles with one letter on each. The game 
board is a large grid of rectangles that match the dimensions of the tiles. The goal 
of the game is for the player to interweave her tiles with those already played on 
the board to form new words. Each letter has a value, and the player's score on a 
turn is based on the sum of the values of the letters he has used. 


Exercises for Appendix A 


1. If you received each of the following hands in a game 
of poker, what would you call it? 
(a) {Sof hearts jack of spades, jack of clubs, Sof spades, 
jack of hearts} 
(b) {2 of diamonds, 7 of diamonds, ace of diamonds, 9 
of diamonds, king of diamonds} 
(©) {9 of diamonds, 10 of spades, 6 of spades, 8 of clubs, 
7 of spades} 
(@) (6 of hearts, queen of clubs, 3 of hearts, 5 of dia- 
monds, 3 of clubs} 
(©) {ace of spades, ace of diamonds, ace of clubs, 2 of 
clubs, ace of hearts} 
(0 {3 of hearts, 7 of clubs, 3 of diamonds, 3 of clubs, 
king of clubs} 
(8) {queen of spades, 4 of diamonds, 4 of hearts, 8 of 
hearts, queen of diamonds} 
2. Choose a card that would complete the hand type 
given. 


(a) Flush: { _ king of hearts, jack of hearts, 
2 of hearts, 3 of hearts} 
(b) Pair: { 8 of diamonds, ace of hearts, 4 


of clubs, queen of clubs) 

(©) Four of a kind: { „ 2 of clubs 2 of dia- 
monds, 5 of hearts, 2 of spades} 

(d) Full house: ( „ 9 of clubs, 10 of spades, 
10 of hearts, 9 of diamonds} 

(e) Two pair: ( , queen of diamonds, queen 
of spades, 7 of clubs, of spades} 


"SCRABBLE ® & © 2005 Hasbro, In. Used with permission 


4D Straight: „king of diamonds, 10 of di- 
amonds, jack of hearts, queen of spades} 
(®) Three of a kind: ( . 3 of spades, 6 of 
clubs, 3 of clubs, ace of hearts] 

3. For each of the hands in Exercise 2, how many different 
cards can be put in the blanks? 

4. During the first week of the baseball season, John got 5 
hits, 3 walks, and 12 outs. What is his batting average? 

5. How many consecutive hits would John need to raise 
his batting average to at least 0.400? 

6 A batter's on-base percentage can be calculated using 

the formula (walks + hits)/(walks + at-bats). Using the 

information from Exercise 4, what is John's on-base per- 
centage? 

Kelly exactly doubled her season batting average in one 

game by getting 3 hits in 4at bats. How many total hits 

does she have this season? (Hrvr: There are two so- 
lutions) 

8, Kelly improved her season batting average (which was 
greater than 0) by exactly 1/5 (ie. 0.200) by getting 4 
hits in 4 at-bats in today’s game. How many hits does 
she now have this season? (Hist: There is a unique 
answer.) 

9. We can represent a best-of-five series between teams A 
and Basa list that denotes the order in which the games 
were won. For example, in the series AABA, team A 
won games 1, 2 and 4, and team H won game 3. Which 
of the following lists of letters does not correspond to a 
real best-of-five series 





7 





(a)AABBB (b) BABAB 
(d)AAABB (c) ABABB 
10. We can represent a tennis game between players A and 
B as a list that denotes the order in which the points 
were won. For example, in the game AABAA, player 


(©) AABAA 


Answers to the Exercises 


1. (a) Full house 
(b) Flush 
(s 
(d) Pair 


(e) Four ofa kind 
(D Three of a kind 
ight (g) Two Pair 





2 There is more than one solution for each question. 
Listed here is one possible answer for each. 
(a) 9 of hearts e) 7 of diamonds 
(b) Ace of spades (D Ace of clubs 
(©) 2of hearts p) 3 of diamonds 
(d) 10 of clubs 








Miscellaneous Games. 617 


B won only the third point. Which of the following lists 
of letters does not correspond to a real tennis game? 





(Remember the tie-breaking rules in tennis games.) 
(a)AABBBABB (b)BABBB (c) AABAABBB 
(d)AAABBA (c) ABAABBB 

3 a9 6 

(b) 12 os 

i w2 

ws 


A. 5/17, which would normally be written as 0.294. 

5. 3 consecutive hits 

6. 2/5, which would normally be written as 0.400, 

T. 4or5 

& 10 

9. Series (c) should have ended after four games. 

10, Game (c) should have ended after the fifth point. 
(c) is not yet over. 








Matrices and Their 
Operations 


In Sections 6.6 and 74 of this book, we present matrices and matrix opera- 
tions as they pertain to specific concepts in discrete mathematics. This brief 
appendix serves as a refresher or a source of more practice for those students 
who are less familiar with these ideas. 





Matrix Operations 





Anm x nmatrixis a rectangular array with mrows and n columns. Objects in a matrix 
M are referenced by writing M; j for the entry in Row i, Column j. Traditionally 
Row 1 is topmost and Column 1 is leftmost, When the objects in the matrix have a 
well-defined system of arithmetic (as do numbers, for example, since we know how 
numbers can be added, subtracted, multiplied and divided), then we can define a 
system of arithmetic on the matrices themselves. 


"B Example 1 7 compute the sum 


113-1 1233 2 
-1 2 3 -1|+|-2 02 -1 
200-1 -2 00 1 


we simply add each entry in A with the entry in B in the same position: 


113-1 1-13 2 206 1 
-1 2 3 -1/+/-2 02 -1/=/-3 25 -2 
200-1 2 00 1 000 0 


618 


"T ——————— 





Matrix Operations 619. 


Itis easy to turn this into a formal definition using the notation for the individual 
entries of a matrix, 





Definition If A and B are both m x n matrices (so A and B have exactly 
the same size), then the sum A + B is the m x n matrix C with 


Ci; — Aij + Bij 











V8 Example 2 To compute the product 





ES 
1-13 4 1 
-4 0} A 
0 00 1 : 

0 


we take each row of the first matrix times each column of the second matrix using the 
following "row-column multiplication 


(Row 2 of A) - (Column 1 of B) = [ 





)o+o(3) *G) )«e»o 


I is a little more complicated (but elegant with the use of summation notation) 
to turn this into a formal definition. 





Definition I Aisant x mmatrix and B is a m x n matrix (so the number 
of columns in A is the same as the number of rows in B), then the product AB 
is the / x n matrix C with 


Cig = (Ais MB 4) + (Ai2)(Bay) + (Ais (Bj) (Ac Bj) 
=P ABa) 
& 











There are special kinds of matrices that are impervious to the effects of addition 
and multiplication, in much the same way that the numbers 0 and 1 are impervious 
to the effects of addition and multiplication (respectively) of numbers. 





| 





620 Appendix B / Matrices and Their Operations 





Definition For any positive integer n, the nx n identity matrix, denoted 
1, is the matrix consisting of 0's everywhere except along the main (top-left to 
bottom-right) diagonal where there are 1's, and the n x n zero matrix, denoted 
9,. is the matrix consisting of 0's everywhere. When the size of the matrix is 
clear, we simply use I and 0 for these matrices. 








D torch 3rd e DOES ad Dpa te at diy peforit arae 
pected under multiplication and addition. 





SOLUTION Let 





1 003 
5-210 
sod Ral dium s 
0 410 
Then 
0 -l 
3 7|-4 
rur 
and 
11003 
5-210 
B*9-|7 10 5|=8 
0 410 
as expected. n 
Matrix Arithmetic with Technology 





‘We will use the TI-83 calculator for the specific instructions on how to perform matrix 
arithmetic with a handheld calculator. Other calculators function similarly. See your 
user manual for how to do this on your calculator. 


Enter or Edit Matrices 


© Press the [MATRIX] button. You will see a list of matrix names, [A], [B], etc., in 
a vertical list and the menu options [NAMES], [MATH] and [EDIT] across the 
top of the screen. 

© Use the cursor arrows to highlight the [EDIT] menu option and then the matrix 
you wish to create or edit. For this example, choose the matrix [A]. 








Matrix Arithmetic with Technology 621 


© On the EDIT screen, you will now see MATRIX[A] followed by the dimensions 
(rows x columns) of the matrix. For this example, let's designate that A is a3 x 4 
matrix, and then use the cursor arrows to set the entry values to 


“| 


Alter editing the matrix, press [QUIT] to return to the main screen. 
‘© Similarly you can EDIT matrix [B] to create 


10 1 
B=|-1 2 -3 
2:0 70 


After editing the matrix, press [QUIT] to return to the main screen. 





Arithmetic Operations 

‘© To perform arithmetic on matrices, you will enter the arithmetic expression nor- 
mally but you will get the names of the matrices involved from the matrix menu. 
For example, to find the sum B + B? for the matrix above, do the following steps 
from the main screen: 

-Press [MATRIX] and choose the NAMES menu option with matrix [B] selected. 
Pressing enter returns you to the main menu with the expression [B] displayed. 

-Press ^. 

-Press [MATRIX], choose the NAMES menu option with matrix [B] selected. 
and press enter. You will see the main menu with the expression [B] + [B] 
displayed. 

-Press ‘2° to complete the expression [B] + [B]?, and press enter to evaluate it. 

‘© The identity matrix can be quickly referenced on the calculator. For example, to 
compute the sum B + 1, do the following steps: 

-Press | MATRIX] and choose the NAMES menu option with matrix [B] selected. 
Pressing [ENTER] returns you to the main menu with the expression [B] dis- 
played. 

-Press +. 

-Press [MATRIX], choose the MATH menu option and select option $ (identity). 
You will see the main menu with the expression “[B]+identity(” displayed. The 
identity function must be told the dimensions of I that you want. 

-Press ‘3)' to complete the expression [B] + identity(3), and press [ENTER] to 
evaluate it. 

© The inverse of an n x n matrix A is the n x n matrix denoted A~t with the prop- 
erty that A - A7! = L. Finding inverses by hand can be time consuming, so the 
calculator is a good tool for doing this. For example, to find the inverse of the 
matrix B above and check that it is correct, we can do the following steps: 

-Press [MATRIX] and choose the NAMES menu option with matrix [B] selected. 
Pressing [ENTER] returns you to the main menu with the expression [B] dis- 
played. 








ee 


622 Appendix B / Matrices and Their Operations 


-Press the ‘x-"" button. You will see the main menu with the expression [B]-! 


displayed. 


-Press [ENTER] to evaluate the expression. The matrix shown is supposed to be 


the inverse of B. 


-To check this, press [ANS] button, the multiplication sign ‘x’, and then 
[MATRIX] — [NAMES] — [B] + [ENTER]. You will see the main menu 
with the expression Ans + [B] displayed. This means to multiply the previous 
answer (which was B-") by the matrix B. 


-Press [ENTER] to evaluate the expression. Since the answer is the identity 


d 


Explore more on 
the Web. 


matrix, we know your answer for the 


erse of B was correct. 





‘There are many computer programs, both free and commercial, that will perform 
matrix operations. On the web page for this book, there is a tool for computing the 


matrix powers relevant to the Markov chains in Section 6.6 and the composition of 


relations in Section 7. 





Exercises for Appendix B 


1. Given matrices A and B, compute cach of the following. 
or explain why the expression makes no sense: 


13-1 0 d 
23 -1|adB-|-12 -3 
00-1 20 0 


e 
2. For each of the following equations, choose your own 
2 x 2 matrices A and B and check if the equation is true. 
(a) A+B=B+A 
(b) A-B=B-A 
(€) A-(B+D=A-B+A 
6» At=L-4 

3. Each of the following statements is false. Find a coun- 
terexample to each. 

(a) Forall2  2matrices and B.A - B = Oonlyif A 
or B=0. 

(b) For every 2 x 2 matrix A, A? = Lonly if A =I. 

(© For every 2 x 2 matrix B # 0, there exists a matrix 
A such that A - B = L (We would say that A is the 
inverse of B in this case.) 

4. Match each matrix on the left with a matrix on the right 
50 that the pair can be multiplied together to equal L 
(Matrices in such a pair are said to be inverses of each 
other.) 











wp: 


2 3 
sp: 


af. 


& Finding inverse matrices is a topic we will leave for an- 
other course, but there is a case where itis pretty easy to 
figure out what to do. This exercise will lead you through 
this discovery process. 

(a) Through trial-and-error find the inverse of each of 
the following matrices. To be more formal, you can 
muy each vy he generic maris © * and 
termine what these four variables must bè to have 
the product equal to I. 

24 
e«[; i] 


14 24 
[io si] 
(b) In terms of a and c, what is the inverse of the ma- 
sia - [s 


nan 
L 
204 





$c |? What conditions on a and cae 


necessary for the inverse of A to exist? 





(©) In terms of a, b and c, what is the inverse of the ma- 


ina - a 5 raconte bande we 


necessary for the inverse of A to exist? 

6. The iranspose of a matrix is the matrix formed by in- 
terchanging the rows and columns of the original. We 
denote the transpose of A by writing A’. For example, 


] 


Given the matrix A above along with matrices B and C 
below, compute each of the following, if possible. 


3 0 





itA- 


m [ 


vuna 


2 
1 
2 


22:1: 4010 
B=|0 1 1|auc=| 919 

012 
3102 


Answers to the Exercises 
1. (a) A and B do not have the same number of columns, 
50 they cannot be added. 


(b) The length of a row in A is different from the length 
of a column in B, so A - B makes no sense. 





1-1-3 0 
()HA-|-9 3 3 2 
Z 2 6-2 
30-2 
Eg-|- 4-7 
202 
c) Since A is not a square matrix, then A-A is not 


defined. 
2 Solutions will vary The answers below use A= 


[5 s] i] 


si 
mascasnsa-[ 1] 





tacta as [ 7 mana 
7 0 








EE 


‘Matrix Arithmetic with Technology 623 


(2) Acar 
() ATA 
(0 A-A «C 
(AT ARB 

77. For the matrices A, B, and C from Exercise 6, use a cal- 
culator to complete the following computations (if possi- 
ble). On the TI-83, the transpose operator can be found 
in the MATH menu of the MATRIX screen. 
fa) (A-A7)-C 
©) (A7 -4)-B 
49 (AT -A) 
(@ (A-AT)* 

& For the matrix B from Exercise 6, use a calculator to 
complete the following computations (if possible). 
a) BB BE 
œ 0— 5) 
(© (8 B)- (8-1 


-6 
7 





= A s+a-[ 
[ 1 


wasara=| J-a 
23 8 


3 
| so they are equal, 
i y are eq 








02 i 
3 (a) A= and B= | then the product 
03 0 o 
o 0 
A-B- . 
00 
10 
oaf 


02 
(b) iA | then 4; 
12 0. 
] essem 








Urey generc) then A Bo |^ 5 | and this prod- 


uct can never be I since it has all 0's on the bottom 


4. (a) and (i): (b) and (i); (c) and (iv); and (d) and (iii) 


dope sal te 
P3] 











& (a) A^ 





M co REEL OC a n 


624 Appendix B / Matrices and Their Operations 


Va 


ILI 
[i] 


a#0andb#0, 





(b) In gener 


a0. 





(©) In general, 


6 (a) AAT 


E! 
s 


waas | 9 


10 


a 
59 


WAAT 


WAA 


3 peeve 














16 n2 79 147 
2 324 
7. (a) (A-A)-C 
us 126 88 172 
161 175 123 237 
18 sS 74 
© (A7-A)-B=|18 6 B 
OAA = 
(4) (A - AT)" does not exist. 
6» 257 331 
Ri) B+ +R 4 B+ R=] 0 ss os 
0 8 143 
i i 
&)B-D-|o -1 1 
0 1 0 
62 1 
()(*-B.(B-)'-| 0 ss s 
0 88 143 





Selected Answers and 
Hints for Discrete 
Mathematics 


Section 1.1 Exercises 


1. (a) hsdC (c) hsdC 
3. (a) person 5 is last and 14 is next-to last; (c) person 15 


is last and 7 is next-t last ‘or example, HTT means the nickel is heads 
4. The game tree is shown below; 12 of 20 = 60% are five- — à ind quarter are tails. The systematic list 
set matches Appears on the right 





7. (a & b) In the game trec, the first branch represents the 
nickel, the second branch the dime, and the third branch 











AAA 

A_+AABA 
BBAAB. AABBA 
BBAMA, AABBS 
BABAB. /wB AABAA 4 ABABA 
BABA B ABABB 
BABB, / ABBAA 
BAABA B-— ABB ABBAB 


START 


625 





626 Selected Answers 


ABBE BREE 





(©) Getting exactly two headsis more likely than getting 
three heads. 

9. Relabeling H's in Problem 7 with F's makes the game 
tree identical to the one in Problem 8. 


Section 1.2 Exercises 


L (a) ay = 44-1 +2 with a 
dn Qn —1) with a 
2-an- with ay = lora, 
m = 2, Of ag =3n—1; 6)a =2- 

In = 5.25: (K) dn = a, 

(m) a, = a, +3 with a; 





























5 (a) ay 

6. (a) truc; (c) not true. 

7. (a) ag = ap) +3and a; = 4 (e)an = Gy; +2and a = 
95 (€) dy = dy +7 and ay =1 

8 (a) ay 


Section 1.3 Exercises 


1. (a) B is telling the truth; (b) only A is telling the truth 
for sure, and we also know that exactly one of B or C is 
truthful, but we cannot tell which. 

3 (a) ~p v q (b) p^q(op^-g 

5. @) f A Cm): (© (^) ^ (Om), or -( v m) 

6, (a) This person is a male math major. (e) This person is 
a female, and she is either over age 30 or a math major. 

7. (3) End AN: (e) (EVAYAT(EAN), or (r^ (CR) v 
(h^ C0) 

8. (a) Bill is tall or dark or handsome, but not all three. 
©) Bill is dark, but not tall and light- 








11. (Sce Figure 1-2) All ways of drawing the envelope must 
start at corner 1 and end at 2, or start at 2 and end at 1. 


12. For game (a) the move shown in bold guarantees a win. 
In (b), you will lose if your opponent knows what she is. 
doing. 























2.5 
21. (a), 
23. (a) 1S and 30 
24 (0) Ei, 4k X145. 
27. (a) i Anne 
i. The person with name tag 5. 
(b) i Player 
fi, Players 5,6, 1,2, and 3 are left, in that order. 
iii, Player S will win. 





3. (@) (x > 0) A(y>0): (e) (( » 0) ^ (y 0) Ves 
a>) 





a “eva |P^Cpv a) 





1L (a) 


ER — —— 2; S 2 &  — EJ 








w 


amddanda || a 





"4050494 
madd 





12. (a) Let h and £ represent the same statements as in the 
example, and p represent “the snackbar makes a profit” 
Then the statement can be written, =} v (r A p), and it 
has the following truth table: 














At p yprAp joven 
STRE T T Es 
UMEN: F F 
OD oae F F 
q rpg F F 
H TOW T T 
eee F T 
E. F T 
E 4k wee F ir 











(©) Let f represent “the staff is friendly" and p repre- 
sent “the staff is very well paid.” Then the statement 
can be written, =p ^ f, and it has the following truth 
table: 


fa 





P | pas 














7745 


Section 1.4 Exercises 
L (9) (> 0) A(y> 0): © (1 0) v(y 0) A (s - 
0) ^ Qr» 0) 
3 (a) 24,68 
(5) 6,7.8,9 
68 
«a 
(©) 3,6,9 
25 
5 (a) 24,6810 
(b) None of the elements of D (The predicate R(n) is 
true for all the elements of D.) 








Selected Answers 627 


1B. (0) ^ (~ v —p). Everyone is hungry at mealtime, and 
either everyone is not tired or the snack bar does not 
make a profit. (c) p v — f. The staff is well paid, or they 
are not friendly. 

M. (b > 600) v (m> 25). The negation is (b < 600) ^ (m < 
25). 

16. The equivalence follows from the fact that the indicated 
columns of the truth table are identical. 




















P 4|P^d4 | PAD j^P |a |-p v -q 
T $i F [Flr] F 
T F|| F T [r|T] T 
F T|| F T |T|F| T 
Frir] r Jrir] T 


18 The equivalence follows from the fact that the indicated. 
columns of the truth table are identical, 
































p^ (^q) 
P 4 rjavr [avn paa] par |vipary 
rri mrT|Tr]|rz[r T 
Treti 3? [rir T 
TET tT] tr fel T 
TEFL FE] Ff ele F 
Erte} e fe |r F 
FTrITIF]r|r F 
FFTIT|F]r|r F 
rFrFrFrirliri|r F 
T f 


21. (a) Not equivalent: (c) Not equivalent 

2% (a) p^ qi) —p 

24 (a) (Pa~) v P= pv(pa-q) by Commutative 
Property, and p v (p ^ -q) = p by Absorption prop- 
erty 





(©) 2.4.6.8, 10. 12 (that is all the elements of D) 
2 

7. (a) false (x = 1): (b) true; (e) false (x = 1): (d) false 
(x28) 

& (a) Ys € B. G(s), where B is the set of biology majors 
and Gis) is the predicate, "s is required to take ge- 
ometry.” 

(b) 3s € C. —M(s), where C is the set of computer sci- 
ence majors and M(s) is the predicate, "s minors in 
mathematics." 








ef ee a o --— E o O octRE 


628 Selected Answers 


4e) Ys € M, —B(s), where M is the set of math majors 
and B(s) is the predicate, "s is required to take a 
business course.” 

(@) 3x € P, S), where Pris the set 
is the predicate, “x has a solutio 

10, (a) and (d) have the same meaning 

(a) vs € B.—G(5) 
(b) 3s € B.G(5) 
(e) Ys € B. G(s) 

D vs € B,-G(5) 

11. (a) Let D be the set of all friends of Alaina, and 
P(x) the predicate, “x gets tired of playing at the 
beach.” The two forms are “3x € D, P(x)" and 
“Yx € D P(x)” 

b) Let D be the setofall friends of Alaina, andC (x) the 
predicate, “x dislikes doing cartwheels” The two 
forms are “ax € D, C(x)” and “Yx € D, C(x)", 
(or we could let L(x) stand for "x likes doing 
cartwheels” and write "3x € D, -L(x)" and*Vx € 
D. Lix)"). 


puzzles and (x) 





Section 1.5 Exercises 


1 (a) oc — fito LA ms te) lAs) 

2. The truth tables follow below. (a) False only if you do 
not attend the concert and you do not get an F; (c) True 
only if I ate lunch but not breakfast; (e) False only if this 
triangle has both a 30 and a 60 degree angle. 


f 


wf 

















(a) 
I bjj ia 
DES F 

e T F T 
ay F 
Fog AR F 
t s | eas 
DT F 

e T F T 
A £ 
F F m 





(©) Let M be the set of all math courses, and T(x) the 
predicate, "x is too hard for Jennica.” The two forms 
are “x € M, T(x)" and “Yx € M,=T(x)". 

(d) Let M be the set of all the meals at the camp, and 
B(x) the predicate, “x is too bad.” The two forms 
are 53x € M, B(x)” and “Vx € M, B(x)". 

13. (a) (i) ~12 Gi) 0, 2 (ii) any even number is a counterex- 
ample 

(b) (i) —3, 23, 3, —31 (i) use 3 — 2y where yis the num- 
ber they have chosen 


14, (a) 3a €R, 3b € Z,a? +b g Z; (3r eZ ye Rx 
2y 








17. (a) There is an integer x that is at least as big as every 
integer. 
(b) There isa set of integers that does not have a small- 
est number, 
(©) Thereisa positive integer x such that no matter how 
the positive integer y is chosen, either y is at least 
as big as x or y is not a factor of x. 
18. (a) original statement; (b) negation; (c) negation 

















P__ 4 | pag | Aga 
(ESBS (Lele 

DNUS T T 
4@ T F F T 
F T F T 
FOF F T 

PO 04 | pva | evo—a 
eat: [D T 
uy T UR ix F 
Re T T 
FF F T 

















p^(q-n 





w 





mamanda 
mnadani 
mamamana 
PEETELI] 


5. In cach part we highlight the columns that must be 
checked to sce if the statements are equivalent. 


















































Pa P—ayqsp 
tar |r 
e. T rJ Bo] 2(Notegdvalent 
FT T || £ 
eri T| T 
t t 
p^ 
P? d|P—2|(o—2| ^a 
TE T T, T 
Or ri F F F (Equivalent) 
rr TIE x 
Frit| or fe 
T 


7 Let h stand for “Alaina likes basketball" s for “Alaina 
likes swimming,” and g for “Alaina likes gymnastics: 
From the truth tables we see that (a), (e), and (d) are 


equivalent, 


(a) b — (s ^ g) has truth table: 


b 


Li 


| s^s | burg) 





mama 


massa 


mena 





ada 
Sadana 


(b) g — (5 A b) has truth table: 


| 


SAb | g— AB) 





aama 


madd 


manamana 


























075944 
Amaman 


(e) (>g V =s) > >b has truth table: 


Selected Answers 629 


























Cg vs) 
b s g|-b|-s|-e |-gv-s| — b 
TT T|ri|r|r] F T 
T TF|F|F|T| T F 
TFTIrFIT|r| T F 
TEFTleIT|T] F F 
FTT|T|F|F| F T 
FT F|T|F|T| T T 
FFT|T|T|r] T T 
FrFriTiITITÀ T T 





4). =b v (s ^ g) has truth table: 























b s g J|-5|s^g |-bvG^g) 
rTrTT|E|-T T 
TTF|r|r F 
TrTFT|rF|F F 
TFF|r|r F 
FTT|T|T F 
PTF] tT] F m 
FETNTIT T 
FrFrFÜTIF T 





* (2&0) (y > O): (b) (x > 0) + (y <0): (0) (x < 
O= (y> 0; ( 50) (y 0) 

10. (b) is the only statement that is not true of all elements. 
in D. A counterexample is x 





12 (a) P(n) is “nis even,” and O(n) is "n? +n is even,” (b) 





P(n) is “n is a multipk 
digit of 57^ (c) P(n) 
is prime.” 

14. Use Z t0 represent the set of all integers, and X for the 
Set of positive real numbers, Let Æ; represent, “ends in 

the digit 3” Es represent, “ends in the digit 5," Ds rep. 

resent, “is evenly divisible by 3," and Ds represent, "is 

Evenly divisible by 5.” For part (d), let L(x) represent 

“x < v2!" and G(x) represent, “2/x > V2" 

(a) Yn € Z, Es(n) + Da(n) 

b) Ym € Z, Es(m) + Ds(m) 

(©) Yn € Z, D«(n) + Dy 

(d) vx € X, L(x) > G(x) 

Parts (a) and (d) are true. For (b), 13 is a counterexam- 

Ple. For (e), 15 is a counterexample. Notice that part (d) 

e false if we had quantified over "non-zero real 

ith counterexample —1. 

16. Using the same sets and predicates as in Exercise 14, 
the negations are as follows. 

(a) 3n € Z, Es(n) ^ —D«(n) 
(b) 3m € Z, Em) A ~Ds(m) 


€ of 5." and O(n) is "n has ones’ 
n is prime,” and Qn) is “2° — 1 























1) 


15, 











fea as a S o O 3X B 


630 Selected Answers 


(e) 3n € Z, Ds(n) ^ Dan? — 1) 
(@) 3x € X, L(x) ^ G(x) 
20, (a) n = -2 (© x = 37/2 
22, There are many equivalent answers. We present ver- 


sions consistent with the way we have written state- 
‘ments in the previous sections, 


(a) For every positive integer n, ifn is even, then } <1, 
(b) For all positive integers a and b, ifa — bis odd, then 








DET 
(©) For all integers a and b, i a and b are positive, then 
a/b #1 + bja. 


25, (a) If you do not get an F for the course, then you will 
attend the concert, (c) If you will not be hungry, then 
you eat your breakfast, 


Section 1.6 Exercises 


1. No. The poor child has fallen for the inverse fallacy, 

2. (a) inverse fallacy; (b) converse fallacy; (e) inverse fal 
Jacy; (d) modus ponens; (e) converse fallacy. 

4. (a) If you read the book, then you will pass the course, 
b) If you will pass the course, then you must read the 
book. (e) Ifyou will pass the course, then you must read 
the book. 

6, (a) neither; (b) tautology; (c) tautology; (d) neither, 

8. (Your examples will vary.) (a) John ordered pizza and 
Jill ordered pizza. Conclusion: John ordered pizza. 





(c) Bill got a haircut, Sue went shopping, Conclusion: 
Bill got a haircut and Sue went shopping, 


(^q — pag) 





(©) If Keith attends the party, Meg will attend, If Meg 
attends the party, Nancy will attend, Conclusion: If 
Keith attends the party, Nancy will attend. The truth 


26. (a) If you will get an F for the course, then you do not 
attend the concert, (c) If you will be hungry, then you 
do not eat your breakfast 

27. (a) If you attend the concert, then you will not get an 
F for the course, (c) If you eat your breakfast, you will 
not be hungry. 

28. (a) Both are telling the truth. 


Asays 
Mp 


Bsays 
Mp 





* 


(a) s € S.C (1) > Ds). (e) Vr € S C(S) > ^D(s). (e) 
Ys €S,C(s) > Ds) 


table shows the following statements: 





D pq; 0) 











q = ri dii) (p = q) A (g = r); 9) p = ri 0) (p 

DAG) (pr) 
Poq rj @ GD di) yay Ww 
Tq ety t 
Tye | Te Pe OT 
TRU Ee T EiT T 
TRF] FT F|F T 
PRM ret Peet T 
FTF|T F T TD 
rr cT ndr. 
FRETUM 











(4) when p and q are both false, then the premises are 
all true while the conlcusion of the argument is false; 
c) This is exactly like 8(e). 

10, (a) N is "Newton is considered a great mathematician, 
L is "Leibniz work is ignored,” and C is "Calculus. 
is the center))) piece.” This argument has the struc- 
ture: (2N ASL) = ~C, N = LC ^ cL. The truth 
table shows the following statements: (i) ^N ^ —L; 
Gi) (^N ^ —L) = =C; (lli) N — Ls (iv) C A >L; and 
(90 (ON ASL) = =C) ^ (N = L) > (C ^L) 




















N L C (0 Gi) (i) de 
RE TOW 
T TUBAE D T Gm 
T Edged Rr 
TENETE j 
BLEU IO T E 
FTERT'T F 
KI IS FOT T 
FERT F 
The first row of the table shows that if N, L and C are 


all true, then the hypotheses of the 
while the conclusion is false, 

ris *l ood round of golf,” c is “the wind 
is calm," and d is "the weather is dry.” This argument 
'as the structure, r = (c v d), € ^ d $, r. The truth ta 
ble shows the following statements: (i) c v d; (i) r => 
(c v d) (8) c dz and (iv) ((r — (c V d)) ^ (c^ d) > 


rgument are true 

















roe d O (b (b) | d) 
T TANE wq Ds 
Desa dolens x. em 
Demi Ae T Oulu 
T a ae T 
Et F T | F 
Fd jio: 1o MI NS: it 
Ret ee A 
Buia A TNE cmo T 





Section 2.1 Exercises 


1. (a) 12 (0 -8 (e) n? 22-3) -12 
2 (a) Counterexample m=3,n = 2 
(b) Seems to be true 
(©) Counterexample n 
(d) Counterexample 
(©) Seems to be true 
(0 Seems to be true 
(8) Counterexample n 
(hy Counterexample n 
(a) Dear Reaper, 
If you choose any even number and call it n, then I can 
show you that 3n is also even, Here is how, Since n is 
We agree it can be written as n = 2k for some in- 
teger k, Then if 1 calculate 3n, I get 3n = 3(2k) = 6k 
2(3k). Now 3k is also an integer, so this takes the form 
‘of 2 times some integer, which is exactly what we mean 
by saying it is even 
























Selected Answers 631 


‘The fifth row of the table shows that if r is false while 
© and d are both true, then the hypotheses of the argu. 
ment are true while the conclusion is false, 

11. (4) g is "we take our gas heater,” e is "we take extra 
blankets,” and mis “we take our air mattress." This argu: 
ment has the structure: g — e, ~e — m, (mv g) > 
©. The truth table shows the following statements: (i) 
g — e: (li) =e — mn (dli) m v g: (iy) (m v g) — e: and 
Wg = e) ^ (~e — m) > (m v g) — e). 




















gem) 6 G) di) i| 
IDE mor nm 
TTFIT T T v|T 
TETIS F T Fit 
TPF|ET T FIT 
PETTITT qum 
EP er We ox mm 
FRUIT PT r|T 
RRR OT TR TIT 











Your friend, Auror 
(©) Dear Reaver, 





Let's call your even number 1, and use our agreement 
‘on what even means to write n = 2k (k being some inte- 
ger). Now calculate n + 1 =2k-+ 1. By our agreement 
‘on the meaning of odd, this shows that n+ 1 is odd, 
Your friend, Auror 

(c) Dear Reaven, 








This one is a little tougher. Let's start by naming your 
odd number n, and writing n = 2k-+ 1 (k some integer), 
(This is what n being odd means.) If we calculate n° — 
nn, we get (2k +1)’ — (2k-+ 1) = 84° + DE + 6k-+ 1 — 
2k— 1 = 8E + 12E + 4k = QI +342 + A). Since we 
have written n’ — nas4 times the integer 24° + 34° + k 
this shows that n — n is divisible by 4 

Your friend, Auror 

















632 Selected Answers 


5 (a) Proposition. ‘The sum of two odd number is even. 
Proof. Let mand n be odd numbers, This means 
thereisan integer K such that m = 2K + Land there 
isan integer L such that m= 2L + 1, and so 


m+n=(QK+1)+QL+1) 


KLl 


Since K + L + Lisan integer, this means that m+ 
even. s 
(b) Proposition. It is even, then n? is even. 
Proof. Letn be an even number. This means that 
we can write n = 2m for some integer n. This in 
turn means that n? = 4m? = 2 - (2m), so m is even, 
because 2m is an integer. L| 
(c) Proposition. Every odd perfect square can be 
written in the form 4k +1 where Kis an integer, 
Proof. Lets be an odd perfect square. So. 
for some integer n, and n? is odd. By the contra- 
positive of the previous exericse, n is odd. Since n 


























is odd, there is an integer Z such that n = 21.1 
This means 
-QLe1y 
AL41 
Sos = A L) +1, where L? + L is an integer, 





as desired, . 
x y K L KL 2-(K+L) ety 
&|18 20 9 10 19 38 38 
(iis 18 9 9 18 36 36 


9. (a) Dear Reader, 
Take an example of two odd integers, and let's agree to 
call your integers x and y. Since x is odd, we know there 
must be an integer K that makes x = 2K +1 true, and 
likewise we know we can write y = 22. +1 for some in- 
teger L. Now let's see what happens when we multiply 
Your two integers: 

x-y=QK+1)-(2L+1) 
=4KL42L 42K +1 
=2-QKL+L+K)+1 














Since 2KL + L+K is an integer, we can see that we 
have written x- y as 2- (integer) + 1, which shows us 
that x yis odd, = 


Your friend, Aunor 





(©) Dear Reader, 

‘Take an example of an even integer and an integer that. 
is divisible by 3, and let's agree to call your integers x 
(for the even one) and y (for the one that is divisible by 
3), Since x is even, we know there must be an integer 
K that makes x = 2K true. Similarly, since y is divisible 
by3, we know we can write y = 31 for some integer L. 
Now let's sce what happens when we multiply your two 
integers: 











Since KL is an integer, we can see that we have written 

x^ y as 6- (integer), which is exactly what it means to 

y that x - y is divisible by 6. " 
Your friend, Auror 

1L (a) Proof. Letx and ybe odd integers. Then there isan 
integer K such that + Land there is an integer 
L such that y = 2L. +1, It follows then that. 

2K +1)-(2L +1) 

KL +2L+2K +1 

=2-QKL+L+K)+1 














Since 2KL + L + K isan integer, we can conclude that 
x- yis odd. . 

12. (a) A counterexample to (i) requires that my brother 
and I both root for the Braves while a counterexample 
to (li) requires that 1 do not root for the Braves while 
my brother does. 
(©) A counterexample to each would be a student who 
does not do math problems every night but is good at 
math. 








13. (a) Proposition. Im = 0and n = 0, then n? +r? = 0. 
Proof. If m=0 and n=0, then nè +1 = 0 +0? 
0. . 


(©) Proposition. If m and n are odd integers, then 
m+ nisan even integer. 

Proof Let odd integers mand n be given. This means 
m= 2K +1 for some integer K, and m-=2.+ 1 for 
some integer L. So 





m+n= (2K +1)+(2L+1) 
K+ 2L +2 
=UK+L+1) 





which means m-+ n can be written as twice an integer, 
hence m+n is even. . 








Section 2.2 Exercises 

1, (a) 73 = 6-12 +1; (€) -1234 =15-(-83) +11; (e) 
1000 =7:142+6 

2 (a) 1: (0) 11; (€)6 

3. (a) 1; (€) 6; (0) 5 1 














6. (a) ...for some integer m. From this we can conclude 
that c = m- b = m- (3-4) = 3 - (mk). Since mkis an in- 
teger, 

(b)...$0thatn = 2m-+ 1, This meansn! = (nr) = 


Sor Varr + 6m + 1 = (Am + Gn + 3m) + 1. Since 
Am? + nf + 3m is an integer, we can see that m is 1 
more than twice an integer, Hence n° is odd, complet- 
ing the proof. 

(©) Fill in the blanks as follows: 3k +1, 4k + 1, 4k +1, 
and 4k+1 

7. (a) Proof. Let integers a, b and c be given such that 
a divides b and a divides c. This means that b = k-a 
for some integer k and c =l- a for some integer [, so 
b+o=keatl-a=(k+l)-a. Since k+l is an inte- 
ger, this means that a divides b + c. " 
(©) Proof Let integers a, b, c and d be given such that 
a divides b and c divides d. This means that b= k>a 
for some integer k and d = [ c for some integer I, so 











b-d=(k-a)-(I-c) = (H)- (ac). Since kl isan integer, 
this means that ac divides hd. " 
(e) Proof Letan integer nbe given such that 9 divides. 
107! — 1, This means that 107! — 1 = 9k for some in- 
teger k. We can show algebraically that 
M" —1= 10-10"! —1) 49 
10- (95) +9 
9 (10k+1) 


Since 10k- 1 is an integer, this means that 9 divides 
1-1. . 
8. Fill in the blanks with: a is rational; b is rational; 
xw + yz; xw yz and w # O and y 0. 
9. Hint. This begins exactly as in Exercise 8, but instead 
of calculating a + b, you calculate a — b. 
Proof. Let x and y be rational numbers. Then there 
are integers a and b with b #0 such that x = j, and 
there are integers c and d with d 4 0 such that ) 
So He HE. Since ad + bc and 2bd are in- 
tegers and 2d #0, this shows that 52^ is a rational 
number. " 
(a) G) Proof Let n be given such that a, is 
even. This means that a,., — 2k for some integer k. 
Therefore, dy = dy.) + 2n =2k-+2n=2k-+n), which 
is even, (ii) 10, 14, 20,28, 38. Every term is even. (iil) 




















[3 





“An integer x isa perfect cube if x = y^ for some integer y. 





[E 


16. 


18, 


TE 


Selected Answers 633 


7, 11, 17, 25, 35. Itis vacuously true that every even term 

is followed by an even term, 

(a) Fill in the blanks as follows: *...the given mis not 

divisible by 3 9g +69 + 1; 34? +24; 1; 

Qq «25 29g + 129 +4; 39? + 4g + 151. 

Proof, Let an integer n which is not divisible by 3 be 

given, By the Division Theorem, when any integer is 

divided by 3 it leaves a remainder of 0, 1 oF 2. That is, 

one of the following cases must be true: 

© Case 1: It might be that n = 3g for some integer q. 
However, for this particular integer n, we know this 
case does not happen, because the given n is not 
divisible by 3. 

© Case 2 It might be that n = 34 + 1 for some integer 
4. In this case, 











w+2=(3q+1P+2 

(9g + 6q-+1) +2 
Gg +2¢-+1) 

© Case 3: It might be that n = 3g +2 for some integer 
4. In this case, 








m42-(q2y42 
= (0j + 12444) +2 
-3 (39° +49 +2) 


Thus, in every case that satisfies the hypothesis, we see 
that n? +2 is divisible by 3, completing the proof. a 
Outline of proof. First, prove this Proposition: For 
aay m+n divisible by 3. (Hi Write n+ 
2n = n(n? 2) and consider two eases: either n is divis- 
ibleby3, oritisnot.) Then, use this proposition together 
with previous exercises to make the desired conclusion. 
‘The sum of any three consecutive perfect cubes” is di- 
Visible by 9. 
Proof Call the three consecutive perfect cubes, 
(n — 19r? and (n--1)'. Then 


(n -1P +8 e (nel m3 +60 
=3n(r? +2) 
Now n is either divisible by 3 or itis not, 
© Case E If is divisible by 3, then 3n is divisible by 9, 
and hence 3n(n? + 2) is divisible by 9. 


© Case 2: If n is not divisible by 3, by Exercise 16 
we know that n? +2 is divisible by 3, and hence 








634 Selected Answers 


3(rè +2) is divisible by 9. It follows then that 
3n(n? + 2) is divisible by 9. 
In either case, 3n(? + 2) is divisible by 9. . 
22. For all integers n, n^ — n is divisible by 5. 
Proof Letan integer n be given. By the Division The- 
orem, when n is divided by 5, it leaves a remainder of 0, 
1,2,3, 0r 4. 





= 3125q° -5q 
=5-(6254°—4) 
© Case 2: It might be that n = Sq + I for some integer 





= 31259" +3125q! + 250g? + 250g? +209 
(625q* + 625q* +2509? + S04? +49) 








© Case 3: It might be that = Sy +2 for some integer 
4. In this case, 
n =n = (59-2) — (5q +2) 
= 31254 + 62504* + 50004? + 20004? 
+3959 +30 
= 5- (625g + 12509" + 10004" + 400g" 
+794 +6) 
© Case 4: It might be that n = Sy +3 for some integer 
q. In this case, 


n -n= (Sq +3)° - (5q +3) 
= 31254" + 9375q! + 11250 +6750 
420209 +240 
= 5- (6254 + 1875q° +2504" + 13804" 
+4044 +48) 
© Case S It might be that n = Sq + 4 for some integer 
q. In this case, 
nno ($04 — (59-+4) 
= 31254" + 12500q* + 200004? + 16000g* 
+6395q +1020 
= 5+ (625g* +2504" + 4000" 
+3200q? + 12794 + 204) 
Hence, in every possible case, n“ — n is divisible by 
5. 
24. Fi 
D—B";—4;4;0;and “(A C) = D- B. 
26. (a) The contrapositive of the statement, “If n? is even, 
then so is n” is the statement, “If m is odd, then nè 
is odd.” 





the blanks with: “this is the same as S(A —C) = 





Proof. Let n be a given odd integer. Since n is 
‘odd, we can write n =2k+1 where k is an inte- 


A 
. 





24) + 1. This shows that 1 

(b) This was proven in Proposition 7, 
(©) Claim. In Pythagorean triples a? + 4? 
‘even, then so are both a and b. 
Proof. Let integers a, b, and c, with a? +6? = c 
and c being even, be given. Since c is even, we can 
write c = 2k for some integer k. There are two pos- 
sibilities for a — either itis even or it is odd. We 
consider both cases: 
© Case 1. a is even. Then a = 2m for some in- 
teger m. In this case, the equation a? +b? 
C can be rewritten P = c — a? = 442 — 4m? = 
2(2K — 2nf), from which it follows that 6? is 
even, and thus b is even by part (a). 
© Case 2, a is odd. Then a = 2m-+ 1 for some 
teger m. In this case, the equation a? + b? 
can be rewritten b = è — aè = 4E — (An? + 
Am 1) = € —An? -Am- 1 = 4 = mè = 
m~1) +3. By part (b), it is impossible for a 
perfect square to be written in the form 4M +3 
where M is an integer, 
Since Case 2 is impossible, it must be the case 
that a is even, from which it follows that is also 
even, LI 
28. Outline of proof. Let n be a perfect square integer. 
Write n = nf, and write m = 3g + r. Consider the three 
possible cases (r = 0, r = 1, and r = 2). 
30. Outline of proof. Let an integer c be given. By the 
Division Theorem, dividing c by 4 leaves a remainder 
of 0,1,2,0r3. 
© Case k: Ife = 4g, then show thatc + 2isnota perfect 
square. using Proposition 7, 

© Case 2: Ifc = 4q + 1, then show that ¢+2 is not a 
perfect square 

© Case X If c = 4q +2, then show that 2c +2 is not a 
perfect square. 

© Case 4 If c = 4q +3, then show that 7c +2 is not a 
perfect square. 

In every case, at least one of the values c +2, 2c +2, or 

Te +2is not a perfect square, 

32. Outline of proof. We should first establish that (*) ev- 
ery perfect square is of the form 8k, 8k-+ 1, or 8k-+ 4. 
This can be done by extending the reasoning in Propo- 
sition 7. We now consider three perfect squares a°, ^ 
and c and list all the possibilities for their sum, in the 
form 8M-&7?. The only form not possible is 8M +7. 





if cis 






























Section 2.3 Exercises 


1. (a) P(1) is “2 is prime,” P(2) is Sis prime,” and P(12) 
is “145 is prime,” Only P(1) and P(2) true. P(m — 1) 
is "(m — 1) +1 is prime.” 

(b) L(1) is “1 «27 LQ) is “4 < 47 LG) is 9-87 
L(A) is “16 < 16," L(S) is "25 < 327 and L(6) is 
"36 < 64." Of these, L(1), L(S) and L(6) are true. 
L(m- 1)is*im - 1 «277 

(© S() is “P= sQ) is “1242? = 
SG) is “+2 +3? = BO” pais qua. 
F+P= MOH Lig PEPE L ES 

and L(6)is"F +24 F +445 +6 

Each of these is true. S(m—1) is 
i-um 

3. We give either the outline of the proof or the formal 

proof (both for part (a). 

(a) Outline of proof. The table shows the verification. 
forn — 1, n 22, n 3, andn 4. It also shows the 
last row checked (n = m — 1) and the next row to 
be checked (n m). 












































Now we simplify the recursive formula for dq: 
y= Ans +4 — (4m 7) 4 Am- 3, and ob- 
serve that this is equal to the desired closed formula. 
Formal proof Let P(n) be the statement, "a, 

4n— 3." Since P(1) is the statement “a = 4—3." 
‘we know that it is true from the given definition 
of a; = 1. Now let the integer m > 2 be given such 





Selected Answers 635 


that P1), PQ),..., P(m—1) have already been 
checked to be true. In particular, the last statement. 
we checked was dy.) — 4(m — 1) — 3." It now fol- 
lows that 


iq = dm- +4 (by the recurrence relation for a) 
(4(m~1) = 3) +4 (by Pim — 1)) 
Án-4-344 

=4m-3 





Thus we have shown that an =4m—3, which is 
precisely statement P(n). . 

(©) Outline of proof. The table shows the verification. 
forn = 1,n 22.5 — 3, andn = 4, I also shows the 
last row checked (n = m — 1) and the next row to 
be checked (n = m). 


















































a, (recursive 

formula) dosed formula | equat? 
1 1 i yes 

{ 

2 yes 
3 [ses Maig ys 
4 [448230 | L530 ys 
m-l| aes (m- 1] itin | veg 

amm 
m jam | mesem 








Now we simplify the recursive formula 
for ami am = am, + mM y 











ax 2 = eA -ipa 
mpte) matami) myat, ang 








636 Selected Answers 


Observe that this is equal to the desired closed 
formula. 

(@) Outline of proof. The table shows the verification 
forn=1,n=2,n=3, andn = 4. It also shows the 
last row checked (n = m — 1) and the next row to 
be checked (n = m). 



































Mmm? ani 12:1 - 1) 1e 2-27 
2+ 1 = 2" — and observe that this is equal to the. 
desired closed formula. 

6. S(1) is “AP = 1900 so) jg “IG 4 AY = BMY» s(3) 
o m AA ang SOA) is He REY 
40 4 40) m 4809 » Bach of theseis true. Without sigma 
notation, S(m) can be written, 








mon Dn 2) 
pan 


and S(m — 1) can be written, 


19 


g 29,39, 


2.09 
_ n= Denm) 
6 





8. (a) Outline of proof. The table shows the verification 
for n=1, n=2, n=3, and n — 4. It also shows the 
last row checked (n = m— 1) and the next row to be 
checked (n — m). 





Now we simplify the summation for n = m; Y 7. (2i — 
1) = (37/06 - 0) Qm- 1) = (m= 1) +m- 
1) e n? —2m+1+(2m—1) = n°, and observe that 
this is equal to the desired closed formula. 

Formal proof Let P(n) be the stateme: 
1) =n. Since P(1) is the statement 
clearly truc. Now let the integer m > 2be givensuch that 
P(1), PQ)... Pim — 1) have already been checked to 
be true. In particular, the last statement we checked 
was P(m ~ 1), which said, "Y, 2i — 1) = (m - 19". 
It now follows that 


Eo-»-(Ea-v)«en-» 
tat 1 
(m MY m - 1) (by Pin - 1) 
=m -m+ de Qm-1) 
=m 
Thus we have shown that S27, (2i — 1) = mè, which 
precisely statement P(n). L] 
(©) Outline of proof. The table shows the verification 
for n=1, n=2, n=3, and n — 4. It also shows the 
last row checked (n = m — 1) and the next row to be 
checked (n = m). 





























n | summation | formula equal? 
1 1 yes 
2 143-4. | 2-2-2=4 | yes 





















































Now we simplify the summation for n = me S37 (2 — 
1) = (xe -))«0-0n-Q7-n-)« 
(=i) 22-27 m- 1-12 2! -m-2 and ob 
Serve that this is equal to the desired closed formula, 
Formal proof. Let P(n) be the statement, "Y (2! — 
1) = 2771 — n — 2” Since P) isthe statement "2 — 1 = 
4~1~2,"itisclearly true. Now let the integer m > 2be 
given such that (1), P(2),.... P(m—1) have already 
been checked tobe true. In particular, the last statement 
we checked was Pm — 1), which said, "7, (2 — 1) = 
= (m — 1) - 27 It now follows that 








že- 1)= (Ee - ») +@-1) 





= 2"—(m—1)-24(2"=1) 
(by Pom=1)) 
-m41-2-1 
-2"-0-2 


Thus we have shown that E7 (2 — 1) 2 2+! — 9-2, 
which is precisely statement P(m). . 
(€) Outline of proof. The table shows the verification 
for n= 1, n=2, n=3, and n — 4. It also shows the 
last row checked (n =m — 1) and the next row to be 
checked (n = m). 





Now we simplify the summation for n = m: E", $ 
(Enp) th=- arthi 
js and observe that this is equal to the desired closed 
formula 

9. (a) Proof Let JG) be the statement, ue +904 









be given such that we have already checked 


P). PQ), «++. Plm= 1) to be true, Since 
(0) | 23) (m— Din) | (mm 1) 
CRAT TEES NOME: 

2) n- im 
= (22.2, Vin uon] , einen 
(m= Dini) | (nmn 1) 


6 2 
(since P(m — 1) is true) 
mm Am 2) 
um 
we sce that Pim) is also true. L 
30. (a) Y7., Met = meters 
3L Chaim, For all> LET 2) = (n— 2"! 42 
Proof Let Sin) represem the statement, 
“DL G2) = (n= 127 + Then S0) is 
“(INZ = (1 — 1)2 +2." which is true. Now let m > 2 
be given such that P(1), P(2), .....P(m—1) have all 
been checked to be true. In particular, P(m— 1) is 
"ERG = (m—2)2" +2" So 








638 Selected Answers 


Ywe= (Soe) yn) 
2 a 


= (m - 2)27 +2) + (m)(2") by Pim — 1) 
m2 2.242. 





Section 2.4 Exercises 


L (9) a = aya #263"! with ay 82; (0) mi 
sy with 0 

2 (a) Proof. It is easy to sce that a; = 3 — 1 since the 
definition above gives us that a, = 2, Let m > 2be given 
such that the closed formula has been checked to work 
for dj, ds... m-i In particular, it has been checked 
that an- T7 Now 

















ün = On.) +2: 3" by the recurrence for a 
=") =1) 42.3" 
iiem- 








So an = 3" — 1, completing the induction. . 
(c) Proof It is easy to see that c = ry since the def- 
inition above gives us that ¢ «|. Let m = 2 be given 
such that the given closed formula has been checked 
to work for ci, cs... Cj. In particular, it has been 
checked that cm- = Zl, Now 











Cn = Cot + amy By the recurrence for c 
Me 
m |t nn 
(mnt) — 1 
mmi) Cmm» 
nè m 
mol) mer 
So c, = zs, completing the induction. . 
= 1) +2(n = 








D (a) Let g, =n" + 2n, Note that gy. 
l)e m = M + 5n - 32 (n + 2n) = 3n? +3n 3, so 
in — (f — n + 1), or equivalently, ga = g-i + 
71-1). (b & c) We prove by induction that for 
All n = 0, gy is divisible by 3. 
Proof. Let P(n) be the statement, “gn is divisible by 
3." Since gy = 0, which is divisible by 3, statement P(0) 
is true, Now let m = 1 be given such that statements 
P(0), P(1) ... Pm — 1) have all been checked to be 
true. In particular, it has been checked that gy. is di- 
Visible y3,so there isan integer K such that gq_1 = 3K. 














=m m42 


-(n-1)242 


which verifies that statement P(m) is true, . 


From the (a) part, 


Bn = 8ni +m — m+ 1) 
= 3K +3? — m 1) by P(n = 1) 
= MK +07 m1) 


This means that gw is divisible by 3, completing the 
duction, . 
(a) Proof Let Pin) be the statement, "n? -n is 
divisible by 2." P(1) says, "I —1 is divisible by 
2," which is true, Let mz 2 be given such that 
P), PQ), ..., P(n — 1) have all been checked to be 
true, In particular, P(m — 1) states, "(m — 1) = (m= 1) 
is divisible by 2," so there is an integer K such that 
(m — 1y — (m — 1) = 2K. Now consider the next state- 
ment Pin 

















(m - Y = (m= 1)) &2m-2 
K+2m—2 
= AK +m=1) 








Hence, n? — mis divisible by 2, verifying that statement 
P(m) is true. . 
Proof, Let P(n) be the statement, "10^ — 1 is divi 
ible by 9." Then P(0) is the statement, "10^ —1 is 
divisible by 9," that is, "0 is divisible by 9:" This is 
true since 0 =0-9, Now let m > 1 be given such that 
P(O); PC)..... P(m=1) have all been verified. In par- 
ticular, we know that 107-1 — 1 is divisible by 9, say 














107-1 — 1 = 9k for some integer k. Using algebra, we 
10"! 1 = 9k 
Mm =9k+1 
1010! = 10(9k + 1) 
10" = 90k +10 
10"—1=90k+ 10-1 
=90k+9 





= 9(10k+1) 


Since 10k-+ 1 is an integer, this shows that Pn) is true, 
and the result follows by induction. n 





A  — eS 


8 (a) Proof. Itis easy to check that F; <2! and Fi <2? 
since these values (F; = f; = 1) are given in the 
definition of the Fibonacci numbers. Let m = 3 be 
given such that the inequality “F, < 2°” has been 
checked for the terms Fi, Fa, ..., Fn In partic- 
ular, we know that Fy. <2"! and Fpa 277, 














Now 
Hence, Fn < 2", completing the induction, — Ww. 
(b) Proof, Let P(n) be the statement, “Fs + Fy + 








Itis easy to check that /(1) 
(which says “F, = Fi 1") is true since Fy 
and Fy = Fi Fi =2. Let m 2 be given such 
that P(1), PQ), +. P(m — 1) have already been 
checked to be true, In particular, P(m — 1) says, 
eK "wa — 1" Now 












Fs 


Fat Fiere Fa = (Fat Fir Bama) + Fin 
= (Fines = 1) + Fim 
ELE 
"This verifies that Pim) is true, completing the in- 
| duction, " 


3. (a) Claim: For all n = 1, Fy is divisible by 3, Proof 
by induction on n. We can compute Fy = Fy + Fy 
2+ 1 = 3 to see that the first statement (which is Fi 
is divisible by 3") is true, Let m > 2 be given such that 
Fas Py Figg oss Fig ij have all been checked to be di- 
Visible by 3, In particular, since we know that Fun-4 is 
divisible by 3, we know there is an integer K such that 
Fus 4 = 3K, Now using the recurrence for the Fibonacci 
sequence, we see that 














Fam = Pani + Fama 

Fans + Fans) + (Fans + Fan 
(Fim-3 + Fam) + Fima Fan 
E 

Vis 2(3K) 

(Fus +2K) 

















From this, it follows that Fen is divisible by 3. . 
10. (a) Proof. Let P(n) be the statement, "In the Jose- 
phus game with 2^ people, Joe's friend should stand. 
in position 27-7 + 1." Since /(1) is thestatement “In 
the Josephus game with2 people, Joe's friend should. 
stand in position 2," we know that it is true, Now let 
m > 2 be given, and assume that we already know 
PCI). PQ), .... P(m — 1) to all be true, In the play 








Selected Answers 639 


of the game with 2" people, the order of elimina- 
tionis2, 4,6... 2" leaving the 2" people labeled 
7,... 2" — Vfor the next round of elimina- 
n. Since P(m — 1) is true, this game will eliminate 
the person in position 277 + I next to ast. Because 
of their labels, we determine that the last remaining 
person was originally labeled the (2" +1)" odd 
"4 1)- 12277 4 1 This 
establishes the truth of Pi). " 
12. Proof, Let P(n) be the statement, “the product of n 
‘odd integers is an odd integer." The first statement P(1) 
states, "the product of 1 odd integer is an odd inte- 
ger,” which is strange to say, but certainly true, We have 
proved P(2) before in Exercise 9(a) of Section 2.1 
Nowletm = 3be given such that P(1), P)... Pn — 
1) have all been verified, and let i.d... à be modd 
integers, We must show that aaz... a is an odd in- 
teger. Let p indicate the product dias.. dy. so that 
4103+ ly = p + dn By P(m — 1), we know that pis odd, 
Hence by P(2) we conclude that p- an is odd, and this 
‘establishes P(n). The result follows by induction, — 
M4. (a) Proof. Let P(n) be the statement, "there ex- 
ist integers q and r such that n=3-q-+r and 
Osr <2" I n is less than 3, this statement is 
clearly true (just use q =0 and 7 = n), This e 
tablishes P(0), P(1), and P(2). Now let m>3 be 
given such that P(O), P(1), PQ)... P(m — 1) have 
all been verified, By P(m ~ 3) we know we can write 
m-323:qrwith0 < 2, Adding 3 to both 
sides we obtain m= 3: q +r 323-( 1) +r. 
Since q +1 and r are integers and r still satisfies 
0 <P <2, this establishes P(n). The result follows 
by induction, . 
18, Proof by induction, Let P(n) be the statement, “One 
can make n-cents in postage using a combination of 3 
cent and B-cent stamps” We can check the first three 
statements as follows: 
© P(14) is true since 343 48 = 14, 
© PS) istrue since 33434343 
© P(16) is true since 8 +8 = 16, 
Now let m= 17 be given such that Pm) is the first 
statement not yet checked, In particular, Pm — 3) has 
been checked, so we know that it is possible to make 
m — 3cents in postage using just these types of stamps, 
Adding a3-cent stamp to this postage consitutes mcents 
in postage, so P(m) is true, . 
Proof. Let P(n) be the statement, 









































17, 





ws “$ = 4" which is true. Now let 
mz 2be given such that P(1), PQ),..., Pim — 1) have 


640 Selected Answers 


already been checked to be true. In particular, P(m — 1) 
says, 





Section 2.5 Exercises 


1. Fillin the blanks with: 3K + 1,92 +5,3K +1, 9L +5, 
K —3L, and “no counterexample exists” 

3. (a) Proof. Suppose a counterexample n to this state- 
ment exists. Since n makes the hypothesis true, 
nè = 2K for some integer K. Since n makes the con- 
clusion false, n = 2L. + 1 for some integer L. Com- 
bining these equations gives us 2K = (2L + 1). 
Using algebra, we find that 2K — 4L? — 4L = 1, 




















from which it follows that K — 2L? — 2L = }. Since 
— 2L isan integer and } is not, this contra- 
diction tells us no counterexample exists. L] 


(b) Proof. Suppose a counterexample n to this state- 
‘ment exists. Since m makes the hypothesis true, n 
2K + 1 for some integer K. Since n makes the con- 
clusion false, n = 21. forsome integer L. Combining 
these equations gives us 2K + 1 = (2L)°. Using al- 
 gebra, we find that 2K — 4L? = —1, from wl 
follows that K — 25 = —}. Since K — 2L? isan 
teger and — is not, this contradiction tells us no 
counterexample exists . 

5. Proof by contradiction. Suppose as a counterexam- 

ple there are odd perfect squares a and b whose sum 
is the perfect square c. From Exericse 3(b), we know 
that a and b are the squares of odd integers. That is, 

OK +1)? and b = (2L +1)? for some integers K 

and L. In this case, 


+b=4K RAK +144 
AK EK +1) 42 

















c 4L+1 








Hence, cis an even perfect square of the form 4M +2, 
where M is the integer K? + K + L? + L. But Exercise 
4 tells us this is impossible, so no such counterexample 
exists. L] 





19. 


jo 


9. 


10. 


n, 


OO LO SSS å ———— Ron vv O a 


‘The key step here is the fact that 





‘This establishes that Pn) is true, completing the induc- 
tion. . 
(a) When m= 1, ifthe Reader picks the set = Ø, then 


itis impossible to follow the instruction, "choose an el- 
ementa € 








Proof by contradiction. Suppose there is an integer n 
which is of the form SK +3 and of the form SL +1. for 
some integers K and L. This means that 








5K+3=5L+1, or 








Since L — K is an integer, this contradiction shows that 

there is no such integer n. . 

Proof outline, Suppose to the contrary that there are 

positive integers a and 6, with no common divisors, sat- 

isfying a? = 207. Since a and b have no common divisor 

greater than 1, they cannot both be even, Thus either: 

(1) a is even and b is odd; or (2) b is even and a is odd; 

or (3) both are odd, Proceed by cases, 

© Case 1. If a is even and b is odd, write a =2m and 
b=2n+1, substitute into a? = 207 and simplifying 
to get a contradiction, 

© Case 2. If a is odd and b is even, write a =2m+1 
and b = 2nand again substitute into a? = 267 to find 
contradiction, 

© Case 3. If a and bare both odd, proceed similarly to 
obtain a contradiction. 

(a) Suppose a positive number x divided by a posi- 

re number y results in a negative number z. Since 

+ y = z implies that z, this means that the 
positive number x is the product of a positive num- 
beranda negative number, contradicting “rule” (i), 
Hence, z must be positive. 

Suppose, to the contrary, that there isa rational num- 

bera such that a + v2 is rational. Then (a + V2) —ais 

rational. But this means v2 is rational, a contradiction 























of Theorem 4. Hence, there is no such rational number 

13. The contrapositive is, “If a +b is rational, then a is ir 
rational or bis rational” 

Proof. Leta and bbe given such that a + bis rational, 

Either a is irrational or a is rational 

Case L If a is irrational, then the conclusion, “a is irra- 

tional or b is rational” is certainly true, 

Case 2. Ifa is rational, then by Exercise 9 in Section 22, 

(a + b) — a = b is rational, and hence the conclusion, 

"a is irrational or b is rational” is also true. 

Since in either case the same conclusion holds, it must 

be true that a is irrational orbs rational, . 

14 (a) Proof. Since 6(3)°+11($) = 343-35 i 

follows that there exists a positive rational number 
r such that 6r? + 11r = 35. 

16. The contrapositive of Proposition 3 is, “Iris rational, 
then r? 42.” 

Proof. “Let the rational number be given. By Propo- 
sition 1, r= $ for integer a and b having no common 
divisor greater than 1. This means that a and b do not 
have 2 as a common divisor, By the contrapesitive of 
Proposition 2, this means that ($)? 2. That is, r? 2:2, 
as desired. e 

18. (a) For every integer n, n is not the largest integer. 

(0) Suppose we are given an integer n, To show that n 
is not the largest integer, all we have to do îs find a 
larger integer -and n + 1 certainly fits that descrip- 
tion 

19. Hin: "The contrapositive statement is, “IE n is even, 
then Sn + 4is even." The proof is very straightforward. 
(Write n = 2k and use this to write Sn + 4 as 2- (some 
integer). 

22. The contrapositive statementis, "If cach of the four chil- 

dren is less than ten years old, then the average age of 
the children is not ten years old.” 
Proof. Let a,b,c, d represent the ages of the four chil- 
dren, each of whom is less than ten years old, Since 
a < 10, b < 10, c < 10, and d < 10, we know that a+ 

b+ e+ d < 40, from whichitfollows that esie < 10, 

Hence, the average of a,b,c, and dis notten, — m 

24, Proof by contradiction, Suppose to the contrary that 
there exist 10 numbers that make the hypothesis true 
but the conclusion fase. Then we know the average 
of the numbers is $9.63, and we also know that one 
‘of the numbers is less than 89.63 — call that numbers. 
Label the remaining numbers x. s... s. Since the 
conclusion is false, each x, satisfies x, = 8963, Thus, 
1i + 2+... ay < 8963). Now since s < 89.63, we 
Know s + xi iss < 89.63 + 9089.6), that is 
Stata...) < 1069.63). But this contradicts 
the statement that the average is 89.63, LJ 























0 LL EO —— 


Selected Answers 641 


26. (a) Pigeonhole Principle (Basic Version): If n+ 1 ob- 
jectsare distributed among boxes, then some box must 
contain more than one object 
Proof. Let n- 1 objects be given, and label the n 
boxes 1,2,3,... n, After the objects have been dis- 
tributed among the boxes, define x, xs... by the 
following rule: 














X = the number of objects in bax i 





Since each object can go into only one box, we know 
that 

zx xs. x nel 
which means the average value of the x's is 


Xxx n+l 
n n 





14 i 

According to the Average Version, there is an x, that 

is at least 1 + È, Since a is an integer value, this means 

that x, > 1, which can be interpreted as meaning, "morc 

than objectis in the box labeled i." . 
27, Statement of contrapositive. If integers x, y, and z sat 

isfy x <4, y cd anb zc Shen e ye cl 
Proof. Since x <4, y < 4, and z <3, and since x, y, 
and zare integers, we know thatx < 3, y £ 3,and z = 4 
Sox+y+2<34344, that isa + y4 z = 10, Since 
10 < 11, we conclude that x + y+ c 11 = 
We can write the basic version as: Objects are di 
tributed among n boxes IE n + 1 objects are distributed, 
then some box must contain more than one abject. 
Contrapositive: If no box contains more than one ob- 
ject, then the number of object distributed is not n +1 
Proof. After the objects have been distributed among 
the boxes define x. xo... x, by the following rule: 














X the number of objects in box i 


Since no box contains more than one object, we know 
that cach s, satisfies x; < 1. Thus the total number af 
objects distributed is given by 





HERE n 


This shows that the total cannot be n + 1. a 

3L. This statement is similar to Exercise 24: "For any list of 
numbers, if one of the numbers is greater than the av 
age, then at least one of the numbers must be less than 
the average.” The proof of the statement is related to 
the Average Version of the Pigeonhole Principle. How. 
ever, its truth is intuitively obvious to almost everyone, 
and that is why Garrison Keillor’s sign-off is funny to al 
most everyone. (Some of today's bureaucrats don't get 
the joke.) 








p 





642 Selected Answers. 


32. (a) Proof Define Boxes 0,1,2,3,4,5, and 6 by the 
following rule: "Place integer x into Box r if x is 
of the form 6g +r.” The Remainder Theorem tells 
us that every integer can be placed in this way. Let 
seven integers be given. By the Pigeonhole Princ 
ple, some box must contain (at least) two integers, 
Let's call the two integers x and y, and say they are 
in Box d. The rule defining the boxes tells us that 
x=6-K + dand y= 6: L+ d forsome integers K 
and L, In this case, 


x y= (6K d) - (6L d) 
-64K-L) 











Since K — L isan integer, this means that x — y, the 
difference between x and y, is divisible by6, m 
34. Proof. Let five integers be given. Think of two boxes, 
one labeled “divisible by 3” and one labeled "not di- 
visible by 3,” By the distribution version of the Pigeon- 
hole Principle (with n = 3 and m = 2), we conclude that 
there are at least three of the numbers in one box, Let's 
refer to these three numbers as a, band c, and consider 
two cases based on which box they are in, 
© Case 1. Ifa, b, c are in the box labeled “divisible by 
then a? +b? + cis divisible by 3becausea = 3K, 
b= 3L and c= 3/ (where K, L, and J are integers) 
imply that 


@ +B ee = KY + BLY (ry 
= 33K? 3D e MP) 




















which is certainly divisible by 3. 
© Cuse 2 If a, b, c are in the box labeled “not divisi- 
ble by 3," then by Practice Problem 4 from Section 
2.2, a? can be written in the form 3K + 1, & can be 
written in the form 3L +1, and c? can be written in 
the form 3/4 1. Hence, a? + 6? +c? is divisible by 














Section 2.6 Exercises 


1. (a) 100011; (@) 1111011 

2 (a) 120; (c) 443 

3. (a) 43; (€) 173 

4. (a) 29; (6) 10 

5. For convenience in reading them, we show the binary 
numbers in groups of 4 bits. (a) (1101 1010 1101);; (b) 
(1.1111 0000 1013: (D (19); (e) (BODE) 5 

6, For convenience in reading them, we show the binary 
‘numbers in groups of 3 bits, (a) (1 111); (b) (10 000 000 
101)5; d) (31): (e) (130336), 

7. (a) (Fu) Q5): (A) (6655): (e) (4210421), 

8. Since this is an“if and only if” proposition, there are two 
separate proofs to be completed. 














3 because in this case, 
3K e 1) GL- 1) L1) 
-G3K Le +1) 


LEE 








In either case, a? + b? + c? is divisible by 3, completing 
the proof. " 





38. (a) Considering the four triangular regions shown tobe 


"boxes", any five points will require two to share 
one box by the Pigeonhole Principle, 





‘The greatest distance between two points in one 
of these small triangular regions is the distance be- 
tween vertices, which is }. Hence, there must be two 
points within $ of each other, 

(a) IE Kor isin Box 1, then Kr is within 2 of an integer, 

b) If no multiple of s is in Box 1, then all n given 
multiples are distributed among n — 1 boxes, so the 
Pigeonhole Principle guarantees that some box will 
contain (at least) two multiples, 

(©) If Kn and Lr are both in the same box (where 
K > L, say), then Kn — Lr has a fractional part 
between £l and 1 or between O and £ 

(@) In either case described in (e), Kir — Lar is within 1 
of an integer, Since (K — Lr is one of the multiples 
originally described, this is the desired conclusion. 











© Claim 1, If x is divisible by 3, then the sum of the 
decimal digits of x is divisible by 3. 
Proof Leta natural number x which is divisible by 
3 be given, and let s be the sum of the decimal digits 
of x, Since x is divisible by 3, we know that x = 3K 
for some integers. By Proposition 3, we know that 
x =s is divisible by 9, so x — = 9L for some in- 
From this it follows that 3K — 9L = s, so 
IL). Hence, s is divisible by 3. B 
© Claim 2. If the sum of the decimal digits of x is di- 
visible by 3, then x is divisible by 3. 
Proof. Let a natural number x whose digits sum 
1o a number 5 that is divisible by 3 be given. That 
is, s = 3K for some integer K. By Proposition 3, we 














know that x — s = 9L for some integer L, and so. 








x =s 9L c 3K 9L = 3(K +31). Hence, xis di- 
Visible by 3 LI 
9.0 1or4 
11. (a) and (c) 
12, Let the integer n be represented as dududidid, in base 


b. This means that 


n= dy BE + dy + d -B + d sb + ch andso 
b= dB dy BE dy Bb Ded, bit 


Hence, n b is represented as ddydd 0 in base b. 
13. Nine is a counterexample, 
M. Seven is a counterexample, 


Section 2.7 Exercises 


L @)a=2,b=2,n (©) A trivial 
example is n = 2 since 1? s 5 and 5 does not divide 
2.1, A more substantial example is 1 = 19 noting that 
® my 5, but 5 does not divide 19 18, 

2. (a) The missing steps are: This means a — b= Kn and 
c= d = Ln for some integers K and L. Adding th 
equations, we get 














a-bec-de Kn Lu 


Which can be rearranged as (a +c) - (b+ d) = (K + 
L) n. Since K+ L is an integer, this means that (a + 
©) = (b + d) ix divisible by n. 

3. The mod 7 tables are given below, 














+f of a] 2 aj s| 6 
off of 1] 2 4] 5] 6 















































6H 6} of 1{ 2] 3] 4 




















0 ———  ——————————— 


Selected Answers 643 





17. Outline, First use induction to prove Proposition 1: For 
all n > 0,8 — 1 is divisible by 7. The proof is similar to 
Exercise 5 in Section 2. 

Next prove Proposition 2: I sis the sum of the digits 
in the octal representation of x, then x — s is divisible 
by 7. The proof is similar to Proposition 3 in this section. 

Finally, to prove the "if and only if” statement of Ex- 
ercis 17, we must prove two separate things: Claim 1. 
If a natural number x is divisible by 7, then the sum of 
its octal digits is divisible by 7; and Claim 2. If a natural 
number x has the sum of its octal digits divisible by 7, 
then x is divisible by 7. These are similar to the proof of 
Exercise 8 in this section. 

19, (b) #FFFFOO is yellow, (c) 16777216 

20, We use T for ten and £ for eleven, (a) 4321; (c) 371 













































































263!'245!-97!-810 = 10 


8. Proposition 3 tells us that x — s, is divisible by 9. This 





means x — s, = 9K for some integer k, which means 
Xue de 
10. (a) Every x 2: 2 will work: (c) Every x i dors ej 8 





will work; (e) Every x = 4 or x =y 13 will work 

11. p can be any clement of {2, 5, 13, 17, 29, 37, 41, 53, 61, 
73, 89, 97). Except for 2, these arc the primes of the 
form AK + 1, where K is an integer. 

13. Outline of proof This is equivalent to the chim, 
"n syn for all ne N7 (Let n € N be given, By 
the Remainder Theorem, ned for some de 
(0,1,2,3.4,5.6,7.8,9} (d is the units digit of n), so 














644 Selected Answers 

you can argue in ten cases (for example, if n =o 5, then 

i yo 3125 = 3). 
M4. (a) 7; (b) Meet at Moe's (c) (1) Making n = 77 and 
43 publicly available, makes it easy to determine 
p=7 and q =11 and hence that k = 60. From this, it 
pretty easy to find the decryption key. Hence, using a 
small value of n is not very secure. (2) Encrypting one 
character at a time means there will be a lot of repeti- 
tion of numbers in your message. A long message can 
be easily broken using knowledge about common Ict- 
ters and letter patterns. Hence, encrypting one letter at 
a time can be broken without using RSA methods at all. 














Section 3.1 Exercises 


1. @) (2) 
(b) (1,2,4,8) 
(© (3.4,7.8,9] 
(a) (8) 

e) (5.6, 10) 

Aw (rez: 





2y for some y € Z} 
(b) (x € Nix = 2 for some y e N) 
(©) xe Nix = 2y- 1 forsome y € N) 
(@) (x € Nix = (2y +1} for some y € N) 
& (a) (7, 13, 19,25,31, 37,43...) 
(b) (...— 17,1, —5.1,4,7,10,13,16,...] 
(c) (3,5,9,11,15,17,21, 23,27, 0.) 
4) (4. 10, 16,22, 28, 34, 40,46, $2...) 
7. (a) rix eZ] 
O (esx eZ) 
(©) (I0c7:x € N) 
@ (peQ:-bca- b] 
8, (a) Z because 0 € Nand 0 ¢ Z*: (c) Z because =} € Q 
and-b£R'i(e) c 
9. (a) B 
10, (a) [2r + 1: x € ZI: () 2? sy € N} 
11. (@) (], $. $. $. 8, .. J, universal set = Q; 
40 (1,2, ],4, 4.8, £.9,1.27, 2... J. universal set = Q 
12. (a) all positive odd numbers (2x + 1 : x € NJ; (c) start- 




















ing at 1, increased by 8 every time (8x +1 :x € N} 
13. (a) True. 

(b) False. A = (1,2), B = la. b} 

(©) True, 


(@) False. A = (1.2), B = (a. b.c]. 

1& à) AN (BUC) - (An B)U(AnC) 
(AUBY =A'NB' = (6.7.8.9. 10}; (e) A 
A= (1.3.5) 





Sk © 








17. (a) A computer search yields 341, 561, and 645 as the 
only three possibilities: (c) A computer search yields 
124,217, 561, and 781 as the only four possibilities. 

18. (a) Proposition. If n =+ 5, then n = 1 and n =; 2, 

Proof. Let n be given with n 2, 5. This means 

that we can write n =6k+ 5 for some integer k. 

So n= 6k+5=2(3k+2) +1. Since 3k+2 is an 

integer, this shows that n 2; 1. Also, n = 6k +5 = 

3(2k+1) +2, and since 2 +1 is an integer this 

shows that n = 2. . 
19. (a) x = 43; (€) No solution; (e) No solution, 








16. Only the final Venn diagrams illustrating the property 
are shown, 














17. The Venn diagram for each side of the equation is given. 
For each that do not match, an example is given to i 
lustrate the difference. 
































A a | |a B 
Tre 
e c 
e u v 
A- (8n C) (A-B UA- C) 
A a | fa g 
Tre 
e Ò 
© u d 
(BnO)-A Bn(C- A) 


19. (a) Any partial sum of the harmonic series, in lowest. 
terms, has an even denominator. 


(b) Any partial sum of the harmonic series is between 
Land 2, 


(©) No rational number between 1 and 2 has an even 
denominator (in lowest terms). 


Section 3.2 Exercises 
A. (a) (Q.1) (2,2), (2.8), (4, 1), (4, 2), (4,8)]; 





AED (, [0]. (2). (81. (1.2), (1,8]. (2. 8), (1,2, 81) 
3 (4694 
4 a) (1), (12). (2). (23), (AI), (112). (123). 
(23), (123) 





s 
Ti 


AU BUC 4 Z;forexample,3 ¢ AU BUC. 
#AUBUCUD and so AU RUCUD & Z 
We know the three sets are non-empty since 0 € A, 
1 € Band3 € C. The Remainder Theorem tells us that 
every integer can be written in one of the forms di, 
4k-+ 1, 4k + 2 or 4k + 3. This means that any given in- 
teger is either in A (if it’s of the form 4k or 4k + 2) orin 
B GE is in the form 44-1) or in C (i it's of the form 
4k-+ 3). No pair of these sets overlap, as can be shown 
with a simple proof by contradiction. For example, to 
show that A N B = 0, we simply assume that there à an 
X € AD B, and argue that x = 2K and x = AL 4 1 for 
Some integers K and L. From this it will follow that 
K 2L = $, a contradiction to the closure of thc inte. 














Selected Answers 645 





«cca 
20. (a) (-5,4,—3, 2, —1,0,1,2,3] 
(b) [31,4] 
oo 
22. (a) 0.12]: (e) [2.27] 








23. (a) n(A) = 8, n(B) = 9, n(C) = 12 
b) G) 4, 13: G) 5, 15: (i) 7, 14; iv) 3; (V) 16 

26. (a) 668 

27. (a) 4200; (c) 560 

29. As the diagram shows, there are three students that have 
completed all three stations, 

















32. (a) True; (€) False. A = (11, B = (2) 

10. (a) Let A=(1), B= (QC = (3); () Let A= 
112). 3) 

13. (a) 8; (b) 16; (c) 12: (d) 4096; (e) 128 


18. 10 
16. (a) Not a partition; because not all elements of 5 are 
subsets of A; (c) Not a partition; since 6 is not an ele- 
ment of any part 
(9) ((1.2,3, 4), (5.6.7, 8]] (6) 02.4.6,8], (1,3, 5,7] 
Proof. LetS(n) be the statement, “If n > 1 is an inte- 
3,4, $}.8 = (1,2,3... n], then n(A x 
B) = Sn.” Statement S(1) is true since 





7. 
19. 











Ax B-1,2,3,4,5] x (1) = (C1, 1). (2,1), 83,1), 
40.6.0) 





Now let m2 be given such that statements 
S().SQ).....S(m—1) have all been verified 
to be true. In particular, S(m—1) states that 
11.2.3.4,5] x (L2... m—1) is a set containing 
S(m — 1) elementsiordered. pairs. Now consider the 
next statement, S(m). The seveartesian product 
11.2.3.4,5] x (2... m— I,m} clearly contains 


"m 


646 Selected Answers 


all of the elementsordered pairs in the set 
(1,2,3,4,5) (1,2, ...m~1}, as well as the five el- 
ements (1, m), (2, m), (3, m), (4, m) and (5, m), for a 
total of S(m — 1) +5 — Sm elements. This establishes 
statement S(m), completing the induction. . 
21. Let A = (1,2), B = [bi b C =x}. 
Then, 


Ax B x C = (Lb, x) (1,53). (2 bi x) (2, ba, x). 





Section 3.3 Exercises 


1. (a) Proof, Dear Reaver, Recall that N= 
10,1,2,3, ...] and that a natural number x is odd 
il amd only if it can be written as x — 2y 1, for 
some ye N. Now, you must agree that cach of 
1,3,5,7 and 9 is a natural number. Also, note that 
122:.01322:141,522.241,7 22:341 
and9 = 2-4 + 1 which establishes that each of 1, 3, 5,7 
and 9 isan odd, natural number, Hence, each element of 
(1,3,5, 7, 9) isalsoan elementof (k € N : Kisodd] That. 
is, (1,3,5,7,9] C (k € N c is odd], Your old friend, 
Aumo 
c) This is false since 2 is a prime number that is not 
odd. 

2 (a) Proof. Let x € (dm : m € Z), so that x = dm, for 
some m € Z, We can write x = dm = 2(2ni). Since 2m € 
Zx € nine Z), and so [mime Z) S Qn: ne Z). 
. 

(©) Proof, Letx € Z, Wecan write, x = f. Since 1 and 
x are both in Z, it follows that x € Q, and so Z c Q, 8 
(©) Proof. Let x€(2n+1ine ZINA (Sm+4ime 
Z), so that x = 2n + 1 and x = Sm + 4, for some n, m € 
Z. Now the fact that 2n +1 = Sm + 4 implies that 2n + 
2= Sm $ = S(m + 1), and so 2 divides m + 1. There- 
fore, m+ 1 = 2k or m = 2k — 1 for some k € 2, and so 























x=Sm+4=5(2k-1)+4= 10(k— 1) +9, 


Since (k — 1) € Z, it follows that x € [10k +9 : k € Z). 
Thus we have established that ((2n + 1: n € Z) n [Sm + 
4:meZ}) E [10k+9:k € Z). . 
4. (a) The missing step: This means that x € A and x € B. 
b) The missing step: It follows that x € A or x € B. 
(c) The missing steps: 
© Case 1: If x € A, then since A € B, we can infer 
thatx e B. 
© Case 2: If x € B, then we can also infer that 
xe B. 
5. The missing steps: Since A C B, we know that x € B, 
and since A C C, we know that x € C. 
6 (a) Proof, Let x € (10 — 1: € Z] be given, This 
means that x=10n—1 for some neZ, We can 











LS 


mu m0 Eee cue — P ————————————— n 


and 


(A x B) x C = KC bi). x). (C1, ba), x). (2. bi). x), 
(Q. bi). x)) 


‘The difference between these sets is that the first is a 
set of ordered 3-tuples, whereas the second set contains 
ordered pairs for each of which, the first coordinate is 
itself an ordered pair 


write, 

Z, it follows that x e (2k+ 1: k € Z). This proves that 

(1n —1:ne2) c Q1: ke Z). 

In a similar manner, we can write x= 10n — 1 
S(n — 1) 4, and since (2n — 1) € Z, it follows that 
X € [Sm &4:m € Z), This proves that (10n~1:ne 
ZIC [Sm+4:meZ}, Now by Exercise 5, we can 
conclude that [Jn — 1:n eZ) C (Bk 1: ke zin 
Im 4:m e Z), a 
‘The missing step for Case I: Since A € C, we ean infer 
that x € C. The missing step for Case 2: Since B C C, 
we can infer that x € C. 

(9 Proof, Let x € (4k41: k 6 Z) so that x = e 

1 for some ke Z, Since we can write x = 4k +1 = 

2k) + 1, and 2k € Z, we have that x € 2n Line 

Z). Hence, [4k +1 : k e Z) € [2n + 1) : n € Z). Now let 

ye [dm +3 : m € Z, so that y = dm +3 for some m € 

Z. Since we can write y = m+ 3 = 2(2m + 1) + 1, and 

(m+ 1) € Z, we have that y € [2n +1 :n € Z), Hence 

(4m+3:me ZG (on Vin eZ), By exercise 7, we 

now have that ((ak-+1 ke Z)U (4m 3: me zl) c 

(n VineZ), " 

(a) The missing step 

Case 1. If x € A, then we can truthfully say that x € A 
Or x € B, which can be written, x € A U B. But 
we can also truthfully say that x € A or x € C, 
which can be written, x € A UC. Since both 
of these inferences are valid, we can conclude 
xe (AUB)n (AUC). 

Cose 2. If x € BNC, then x € B and x €C, so we can 
say that x € A or x € B, which can be writ- 
ten, x € AU B. We can also say that x € A or 
x € C, which can be written x € A UC. Since 
both of these inferences are valid, we can con- 
clude x € (A UB) N (AUC). 

(a) Proof. Let sets A and B be given such that A U 

B = B. To show that ANB = A, we must show that 

ANB C A and A C ANB. Proposition 1 establishes 

that A.B € A is always true, so we only need to es- 

tablish A C A N B using an element-wise proof, 























"UL. ————— —-—-— —-enmmsm——— HM M 


Let x € A be given, Since Practice Problem 1 tells us 
that A C A UB, we can infer that x € A U B. Since we 
are given that AUB = B, we know that x € B. Since 
X € A and x € B, we know that x ANB. " 

"This establishes that A C A P B, and hence we con- 

clude that A = A N B, as desired. 
(©) Proof, Let sets A, B and C be given such that 
ANB =A and BNC = B. To show that ANC =A, 
we must show that A NC C A and A C A NC. Propo- 
sition 1 establishes that A NC € A is always true, so we 
only need toestablish € A n C usingan element-wise 
proot. 

Let x € A be given, Since we are given that A N B = 
A, we know that x € A N B, which implies that x € B, 
since A NB € B by Proposition 1. Since we are given 
that BOC =B, we know that x € BNC, which im- 
plies that x € C, since BAC C C by Proposition 1 
again. Hence, x € A and x € C, so x € ANC, This es- 
tablishes that A € A NC, and hence we conclude that 
A = ANC, as desired, L] 
(a) Proof by contradiction. Assume that there an cl 
ement a € N in both sets (2k +1: ke N) and Hk : k € 
N). This means that a  2K + 1 for some K € N and 
4 = AL for some L € N. Combining these facts leads us 
to conclude that 2K + 1 4L, which implies that 

















2 


2QL - K)e1 





or 21. — K = 4, We know (from closure properties of 
Z) that it is impossible to subtract integers and get a 
result that is not an integer, so this last statement is ab. 

surd, Hence, there is no such number a € N, That is, 
(ke Vi k € NI Hk: ke NJ) e v. " 
(©) Proof by contradiction, Assume that there i an cl. 

cement (a, b) € R x R in both the set ((x, y) € R x R 

y) and the set ((x, y) e Rx Ria — 

y]. This means that 














0, which the 
quadratic formula tells us has no real solutions, a con- 
tradition to the fact that a € R. " 

13, (a) Proof, Letsets A and B of clementsin U be given 

such that A N B = A. To show that A'U B 
must show that A’U B S U and U C A'U B. Since 
all sets are subsets of the universal set U, we know 
that A'U B C U is true, so we only need to establish 
U € A UB using an element-wise proof. 

Let x € U be given, By the definition of comple- 
ment, we know that either x € A or x € A’, so we 
consider each case separately to establish that the 
given x must be in A'U B. 











Selected Answers 647 


© ix € A thensince wearegiventhatA n B = A. 
we know that x € A N B, from which it follows 
that x € B. But Practice Problem 1 can be used 
toestablish that € A'U B,soweconclude that 
x € A'U B, as desired, 

© itx e A’,thensince A’ C A'U B (again by Prac 
tice Problem 1), we know that x € A’ U B, as de- 
sired. 

In either case, we see that x € A'U B. 

Thisestablishesthat U € A'U B,and hence wecon- 

clude that A’ UB = U. . 

14, (a) The properties to use are distributive, identity or 

universal bound, and identity 

(©) The properties to use are distributive, 

‘commutative, and identity 

15, (a) (ANH)U(ANU) =A 
( AU(A'NB) = AUB 
(© AN(A'UB)=ANB 
(@ (An B)u(BnC) = (ALE) NB 
(©) (AN BUA UCY = AN(BUC) 

16, (a) Use properties d, e, c, e, and d for the steps. 
b) Use properties d, e, c, a, d, and e for the steps. 
e) Use prope a i, and d for the steps 

18. (a) HAUB = B, then ANB A. 























Proof. Let sets A and B be given such that A U 
B = B. Then the following holds true: 
B = AUB given 
ANB = AN(AUB) substitution 
ANB =A absorption 


W ANB 





Ay then AU B = B. 





Proof. Let sets A and B be given such that A N 
B = A, Then the following holds true: 
A-AnB given 
AUB = (ANB)UB substitution 
AUB = BU(BOA) commutative 
AUB- B 


absorption. 


(©) IAN B= A then A UB =U. 
Proof. Let sets A and B be given such that A N 
B = A, Then the following holds tr 




















648 Selected Answers 
ANB=A given 
A'U(ANB) = A'UA substitution 
A'U(ANB)=U negati 
(UA) (UB) distributive 
(AVANNAA UB) commutative 
Un(A'uB) ‘negation 
(A'UB)QU =U commutative 
A'UB-U identity 


21. (a) Proof. Suppose A C B,andlet (a.c) € A x C, for 
somesetC. BydefinitionofA » C, a € Aande €C. 


Section 3.4 Exercises 

1. (a) (a:b) +a =a; (e) (a -bY -(a- b) =a; (e) (a 
by +(a+b)=1 

2 (a) as (b +a) (c) (a-b)-(a c) =a + (bi cy 
de) (a bY ea + (ab) 

3 (a) Use properties i, a, d, and d. (b) Use properties c, c, 











a and d. 
4. (a) Chim: (a + b) (b+c) = ac +b 
Proof. Version 1. 
(a+b)-(b+e) = (a+b):b — Distributive(c) 
Mae 0 aae 
=b-(b+a)  Commutativela) 
iata = M 
=b+c-(a+b) Absorption() 
=b+c-a+c:b Distributive(c) 
=b+b-c+a-c Commutativela) 
=bt+ace Absorption) 
=a-ctb Commutative(a) 
Version 2. 
acth bac Commutative(a) 
= (b+a)-(b+e) Distributive(c) 


= (a+b): (b+c) Commutative(a) 


(c) Claim: (a + b)(a'c) =a + be! 


Since, A C B.a € B, and (a,c) € B C, proving 
that (A x C) C (Bx C). 

(b) Proof. Suppose A C B, and let C € P(A); that i, 
let C be a subset of A, written C C A, Since we 
are assuming A C B, we have that C C B as well. 
Hence, C € P(B), proving that P(A) € P(B). 

23. (a) Proof. Let C € P(A) N P(B); that is, let C C A 
andC c B. Thus,C c AN Band soc € P(A NB), 
establishing that P(A) n P(B) € P(A N B). Con- 
versely, let C C PIAN B), so that C C ANB. 
Well then, C c P(B), from which it follows that 
C € P(A) n P(B). This establishes the inclusion, 
P(A) A P(B) 2 P(A n B),so that we now have that 
P(A) N P(B) = P(A N B). 





Proof. 


(a bya'c) = (a+ b)(a" +c’) DeMorgan(h) 
= (a+b)(a +e) Double Negation), 


=a+be Distributive (c) 


5. (a) Use properties d, c, c. 
6 (a) Proof. Leta and b be given such that a+b = b. 
Then 


a-b=a (a+b) Since b=a+b 
Absorption) 





(©) Proof. 
Then 


Let a and b be given such that a’ + = 1. 





ab Double Negative(f) 


DeMorgan(h) 
Siner +h=1 
‘Complements(k) 





7. (a) becomes "If AU B= B, then AU(B- A) = 
and 
(b) becomes "If AC B = A, then B — (B — A) 
Itis easier to prove with the abstract properties than 
using definitions of sets 











9. (a) The completed tables are given below: 









































11. Properties (a), (c), (d), & (e) are all true, so this struc- 
ture isa Boolean algcbra. 
12. (a) A check mark indicates those pairs where L(a, b) 
is true. 




















(b) L(a, b) is true precisely when a evenly divides b. 
13 (a) Proof. a= a-1 (d) Identity 
=a-(a+a') (e) Negation 
= a-a + a-a (c) Distributive 
=a-a+0 (e) Negation 
=a-a (d) Identity 


650 Selected Answers 


Section 3.5 Exercises 



























































































































































1. (a) Distributivity: The columns for a - (b- c) and (a (ab e) 
b) + (a -c) are identical. © a|b|e | ab |ab-e | +e] dro 
a (a-b)+ 
a|b| c [be [6-0 a-b| ave | (ac) ofofo] o] 1 1 1 
= = 
oljo]o | ojojoj o ojo|i[o]| o 1 0 
ojoipr| o o]o[| o oio o] 1 0 0 
ofifo} 1 | o fo} o | o ofifi fo} o 1 o 
ojiji | o fof o | o 1fofoffo} i 1 1 
tfolofo | o fojo] o oz] o] o 1 0 
A 
1 [o] uf TA E 1 iioi] 1 0 0 
JEU oi | a MEER (arr 1 
iba a xl [S ST 1 
a 
b 
ajb | a+b| Wa+b) |a+bia+ b) 
ojo || o 0 
aw 2 
io | i 1 
iid a 1 






































ipo[o|1]o[| 1 





























alitatotol 1 


4. (2) a + b; (e) c'b + abe; (e) a 

5. (a) able +abe; (b) abe +a'be + be deat 
(abe +a'be + abc + abe 

8. (a) Equivalent; (c) Equivalent. 


——————— 


Selected Answers 651 





9. (a) zi (e) (e) yz 1L (a) xy x yz yews (stay ey 


10.6) rzw tays tya () wryw 12 (a) xy + aw’ Riyctrzu 
xy w^ e) Y y yu Y zu xy w' + y zw 13 (a) ye y+ ew 





Section 4.1 Exercises 














Q Q 
a H 2) 
Q. 
0 E] 
| Q. 
0.27 (2,3) 
D 
"3 (2,3) 





Arrow diagram for f 


6. The complete diagram follows 





2 Answers will vary (a) Domain #-1)} 


Codomain R; (e) Domain (x € R : x = —|], Codomain 0 
ly € R: y= O); (e) Domain R, Codomain [y € R 0 - —a iki 


yell} fab 
3, The complete diagram is shown below o -n 























tz 
(i n à & ma v» eanta) 


Arrow diagram tor f 


‘4 The complete diagram is shown below. Once again, a 
“loop” at a value indicates that the function maps that 
value to itself. 


652 Selected Answers 


8. The diagrams follow. 








m 


POENA AA 
S©evonsene 


w w 











. oms 
. ff) e—— — 5 1 
+15 m e——-2 
+16 [ ^3 
tm MAE. 
18 Ba os 
+19 Bale en 
*2 |o na +7 








10, (a) Yes, this is a function. 
(b) No. The element 1 in the domain is mapped to2and 
S in the codomain. 
(e) No.Therational number 0:5 can be written as } or $ 
(as well as many other ways), hence the domain ele- 
‘ment 0.5 is mapped to 1 and 2 (among other values) 
in the codomain, 
12, (a) and (c) 


16. Fill in the blanks as follows, Claim (i): Q; Q; 3a +9; 
a= i gb) =a; - 3b 
Ma+3)=30+9, 

17. (a) s) = hy +2 
W s) -iv-4 














A | " m m l 6 [ u2 | uaa | ea | 023 
wo aa i aza | e | us Taa elu m 4 
p p (©) g(y) =3y+3 
pd a @ a) 7-5 
ae ai 19. (a) Leta € Q and b € R be given such that f(a) = b. 
s PA ‘That is, /a = b. From this it follows that a = b? and 
e hence a = g(b). 
3) 
ea L4 a (b) One counterexample is a = 4 and 2. 
w na n2 (©) There are g-arrows, like from —2 to 4, that when 
reversed do not correspond to any f-arrow. 
()e(B)-s-B 21. The diagrams follow. 
T" 731 
12 +2 
Dia S 13 os G ——— {) 
Te 1 14 ^a 1 e n 
2e 2 —— 5 2 ee (2 
ae 3 "t +6 3 -8 
"e “i LA 7 ^ Z: n2 
Be ^8 see T 
5e E we +9 oe 
w ee 6 | œ 20* +0 lo 7* * (123) 

















Selected Answers 653. 





23. (a) 
o 
ep 
t 
5 
2 
nom 
Ld 
(b) 





ass 


26. (a) yes: (b) yes: (c) no: (d) yes 


Section 4.2 Exercises 











L (a) 22° — 15 by dy +3; (e) 2 Put of 9, for example. The diagrams for f, fo f, and 
3 (9 f(1) = 1 and g(11) =5 f s f © f are given below. (In each case, the "loop" at 
(b) The diagrams for / and g appear below 4 indicates an arrow pointing from 4 to itself.) 








N o, 
10 m 
tet .5 

2e] 26 1 Mos th 

LEE pPT—CP 
M 

a 6 8 D 
(c) Only g f is defined. wr 





(@) The function (g o f) :(1.2,3] — (4.5, 6] has the 
following arrow diagram: 





DS 





gef 


5. [is not invertible since /(0) = f(9) = f(5)= f(4) = 
4. for example, and there is no input that has the out- 





654 Selected Answers 


ace ren 
NC ue R 
w teror 





6. The three pictures below provide the missing diagrams. 
In each case, there is only one possible answer, 





(go Nforpan(a)  gtorpart (0) Hor part (o) 
15, Here is the complete picture: 
8, Parts (b) and (c) are impossible. For part (a), go 

f has domain [1, 2,3,4}, codomain (x. y, z), and rule 

11.9.0. 3) 3). 4.9). 

36. (a) f(11011) =4, f(01101) =3, and /(11000)—2. 
‘This function is not invertible because it is not one- 
to-one. For example, f(11011) = f(11110) 

(6) (0) = 00000, (2) = 11000, and g(4) = 11110, This 
function is not invertible because it is not onto. For. 
‘example, there isno value nfor which g(n) = 10101. 

(©) € (fog)(2) = f(g(2)) = f(11000) = 2 
© (f 28X0) = f(s(0)) = f(00000) = 0 
© (g © f)11010) = g(/(11010)) = g(3) = 11100 
© (g o f)11100) = g(/(11100) = g(3) = 11100 

(a) No. 

11. Fill in the blanks with: a; f(a); 
13. Here are the diagrams for R: 















b; a; b; fog. 











17. (x. y) € ROR means x beat at least one team that 
beat y. 


19. Student x is in instructor b's course. 
2L. (u, v) € R means that production facility u produces 
machine part v 


Section 4.3 Exercises 


1. (a) There is no x €R for which f(x) = —4 since the 
equation x? + 4x +1 = —4 has no real solutions, by the 
quadratic formula; (c) There is no x e [1 oc) for which 
h(x) =2 since the equation z1 = 2 can only have so- 
lution ~$, which is not in [1, oc). 

2, (a) -1and -3; (e) O and -1 

3. (a) One-to-one and onto, and hence invertible; (c) One- 
to-one but not onto 

4. (a) not 1 —1 but onto 

S. (a) yes (b) yes (e) c? =e 

7. Fillin the blanks with: h(x) = h(a2);*f is one-to-one"; 

10. (a) Consider any player of the game. Since the cards 
were evenly divided, the players all have at least 17 
cards (they have 17, 17, and 18). Let A be the set of 
cards the player has, and let B be the set of possible 
values (ie, (ace, king, ...4,3,2]). Define f by f(x) = 
the value of the card x. Since A is size 17 or 18, and A 

is size 13, f is not 1 — 1. So there are two cards in A of 
the same value. 

















Selected Answers 655 





2 (a) 6 M 6 5 
26. (a) (x. y) € Ry © Rj" if classes x and y have a student in 
common, and (x. y) € Rj! © Rj if student x and student. 
y have a class in common. 
(©) (x. y) € Rs c Rs! if integers x and y are such that 
x= y, and Ry! c Ry relates each nonnegative perfect 
square to itself. 





(©) Let = (a, b, c,d, e] be the set of five positive int 
gers Let B = (7,9, 3,1). Define f by f(x) = the ones’ 
digit of 7. Since A has size 5 and B has size 4, f is not 
1 — 1. Thus there are two numbers y, z in the set A for 
Which 7 and 7: have the same ones’ digit. 

(e) Let A be the set of numbers in the phone number, 

















and let B = (0,1, 2,3,4,5). Define f by f(x) = x mod 
6 (that is, the remainder when x is divided by 6). Since A 
issize 7 and Bis size 6, f is not 1 — 1, For the two num- 
bers x and y for which f(x) = f(y), x — y is divisible 
by6. 

AL, Proof Let xx € Z. Assume that f(x) = f(a). 
‘Then 2x +3 = 2: + 3. This implies that 2x; = 2: and 





it follows that x; = x Therefore f is 1 — 1 

14. (a) Let x, is €[0.00) be given such that f(s 
f(x). That is aj c4 +4, so i-i =0, 
hence either x, = x; or x, = —x;, Since x, and xs 
are both from [0, 20), they must have the same sign, 
50 only the option x; = x; is possible. 























656 Selected Answers 


(b) Let y € [4, ox) be given, and set x = /y—4. We 
can Verify with algebra that 





a) ay) 
FA) +4 
=0-4)+4=y 
Hence, y is an output of the function g. Therefore 
gis ono. 
(0 [7)- Jy73 





16, (a) Proof, Let xi, € R be given such that f(x 
JG). That is x} ~2= 1] ~2, or aj — aj. Taking 
the cube root of both sides tells us that xy xs, as 
desired, a 

(b) Proof. Let y € R be given, and set x = Yy FZ, a 
real number. Then /(x) =x! ~2= (y42)— 
y- Hence, y is an output of the function f. mt 




















4) gly) = YFF Zis the inverse of f. 
18, (a) Proof. Let xx € (1 an be given such that 
JG) = f(a). That i às Multiplying 





both sides by (i — 1G = End BiVES us xia — ti 
24,4) — xa, from which it follows that x; = as, as de- 
sired 

(b) Proof. Let y € (1,00) be given, and set x = 2. 
Since y > y — 1 > 0, it follows that ^,» 1, Hence, 
x € (1,00), and 











Be et al 


matu 


fee 
‘Therefore, y is an output of the function f. 


© [70)2 2; 


Section 4.4 Exercises 


(a) This relation is transitive, antisymmetric and reflex- 
ive, 





8 


(©) This relation is transitive, but not antisymmetric 
since (2,8) € R and (8,2) € R, and not reflexive since 
6.5) gR. 


19. (a) Let (x; yi) (5. 9s) € A. be given such that 
f(x. n) = flea. 9). That is 28 5n 229 52, 
which implies that 2^ 5 = 5^-^, The only number 

power of and a power of 5is the number 1. 

80 Qn, y1) = (22, 32). 

(b) Let n be a positive factor of 500, Since $00 = 

2 x 5°, then n must have the form 2" x 5^ where 

















a € (0,1,2)andb € {0, 1,2, 3). Thatis,n = f(a, b). 
Hence, n is an output of the function /. 
(©) Theorem 7 
(@) nA) =3 x4 = 12 
21. Since f(abc) = f(acb) = ab, f is not 1 — 1. However, 


f is onto. Given a word y of length 4 or less over the. 
alphabet (a, b), form the word x by putting the letter c 
at the end of the word y. (For example, if y is abab, then 
x would be ababe.) Then x is a word of length 5 or less 
over the alphabet (a, b,c], and f(x) = y. The set A is 
bigger. 

24 (u) Let $ € P((1,2}), Set T= S U {3} Since $ 

it follows that T € (1,2, 3). Moreover, f(T) 
(3) =$. Hence, set ove ee Snap 
© f(1,2]) = £(01,2,3) 2) 
(e) Given S e (0.2), epe PSU) = fS) 















w Since every element of the codomain has exactly 
{wo arrows pointing to it, we conclude that the do- 
main has twice as many elements as the codomuin. 
28. The function / : R^ — R with the rule f(x) = x is 
one-to-one, and the function g : R — R^" with the rule 
(y) 2" is one-to-one, Hence, by Theorem 9, R^ and 
R have the same size, 

29 B= cl 








2. (a) Reflexive 


3. (a) Since (1, 3) € Ri and (3, 1) € Ri, Ri is not antisym- 
metric, 


4. (a) Yes 
5. (a) The blanks might be filled in as follows. 





ee 


Selected Answers 657 


9 Ri = (a,b) €A x A;a isa proper factor of, — (e) Not reflexive, since (1) N (1) # V. Not antisymmet- 
RA proper Echos OE 











| where A = (1.2,3,4,6, 2] Tic, since (1}R(2) and (2}R(1}. Not transitive, since 
© R-(abeBxBiasb, where B= (REQ) and (2) R(1),but not [1] RTT. 
12.3.45] 


(b) Neither relation is reflexive, We can extend these 


relations to the reflexive relations R; and Ri; ore esr wi minio 


not transite 





17, (a) IF Ris reflexive, then ^! is reflexive, 
Proof. Let the reflexive relation Ron the set A be 
given. Let a € A be given, Since R is reflexive, we 
know (a.a) € R, from which it follows (by revers 
ing the coordinates) that (a, a) € R^. Hence, & 

3 is reflexive, " 

Q NU cen a aa 
P Proof. Let the antisymmetrie relation R on the 
vem A tet A be given, Let a,b A be given such that 
y^ (a, b) © R^ and (b. a) © ^^. By definiton of R^, 
€ this means that (b, a) € R and (a. b) € R, and since 

Ris antisymmetric, it follows thata = b. Hence, R! 

is antisymmetric . 

t (6) YR transitive en £l trae 
Proof, Let the transitive relation Ron the set A be 

Piin Me given. Leta, b, c € A be given such that (a, D) € A^! 

and (b.c) € RI. This means that (b.a) € and 

































can be interpreted as follows: (x (e. b) €R, and the transitivity of R tells us that 
is “x divides y"; and (x, y) € (c.a) € R. From this, it follows that (a,c) € R- 
(©) Both of the relations are transitive, Hence, R ' is transitive. . 
7. (a) irreflexive, antisymmetric; (b) irrefles s 20. (a) (C0, 1), (1,2). (0.3). (1,4). (0,0), (1, 1). (2.1), (2, 2)]: 
antisymmetric b) (C. 2)}5 €) ((0, 1), (0,3), 1,4), (0,0)] 
9. (a) reflexive, transitive; (b) transitive, antisymmetric; 21- (a) Proof. Let R and Rs be reflexive relations on the 
(c) antisymmetric, transitive, irreflexive set A, Leta € A be given. Since R, and A; are rellexive, 





we know that (a, a) € Ri and (a, a) € Rs, from which it 
follows that (a, a) € Rj U Rs, Hence, Ri U Ry is reflex: 





11. transitive. 
(a) There is a loop at every point. (b) There are no 2- 





Way streets (e) If there's an arrow fromatobandone — "^ s 
from b to c, then there must be an arrow from a to c 
15, 

Arrow diagram - 

rollexve, anteymmetr T a 

transitve Fes saci 

(12) ea 
[7 
a 
w 0 








658 Selected Answers 


(e) This statement is false, as we can sce from an ex- 
ample of relations on the set A = {1,2,3}. Let Ri 
10.2). (1.3). (2.3)) and Ri = ((2. 1). (3.1). G. 2))- 
Both of these relations are antisymmetric, while their 








Section 4.5 Exercises 


1. Relation R is symmetric and reflexive as indicated by 
the "double arrows” and the “loops” respectively, in the 
figure below. The figure also shows that relation R isnot 
symmetric since (1, 3) e R, but (3, 1) ¢ Rs for example, 
and Rs is not reflexive since (3, 3) ¢ Rs for example. 





oo 


Relation A; 


Relation As 
2. (a) Yes: (c) No, since (2, 1) € Rs but (1,2) ¢ R. 
3. (a) symmetric; (c) symmetric 
4. (a) This is not a partition since not every element is a 
set. (c) This is a partition of A. 
6. {{ABCD, BCDA, CDAB, DABC}, (ABCD, BDCA, 
DCAB, CABD}, (ACBD, CBDA, BDAC, DACB), 


{ACDB, CDBA, DBAC, BACD) {ADBC 
DBCA, BCAD, CADB}, {ADCB, DCBA, CBAD, 
BADC}} 

&. (a) Reflexive, not symmetric, transitive. 


0 m 
123) ea 
Qi 
e3 8) 
ua (12) 


(c) Not reflexive, symmetric, not transitive. 


union 
Ri U Ra = (1,2). (1,3), (2.3). (2. 1). (3.1). .2)) 


isclearly not antisymmetric, having (1, 2) and (2, 1) both 
within, for example, 





u 


Q” 

237 Wy 2) 

eae vol 
eS na 


10. (a) Proof. Leta <A be given. Since a — a — 0and 0 
is divisible by 3, it follows that (a, a) € R. 
(b) Proof. Letaandb € A be given such that (a, b) € 
R. This means that x — y — 3- K for some integer 
K.Inthiscase, y — (-K),so y — xisdivisible 
by3. This means that (y. x) € R. 
{© Proof. Let a,b,c €A be given such that (a, b) € 
R and (b.c) € R. That means a —b=3-K and 
b—e=3- Lforsomeintegers K and L. Inthiscase, 


(ab) +(b-0) 
3K 43L 
3K +L) 














= 











Hence, a — c is divisible by 3, so (a, c) € R. 
€ (40.3.61. (1.41. 2.511 

12. Each person who has a son is related to himself/herself, 
and to his/her spouse (if in the set £). This relation is 
not reflexive, but it is symmetric and transitive. 

13. (a) Arelationisreflexive if thereisa loop at every node. 
(b) A relation is symmetric if only double arrows are 

necessary because all arrows point both ways. 





(©) A relation is transitive if every time yousee/— 


you must also see Z^" , and every pair of nodes 
connected by “double arrows” have a loop at each 
node. 

18. (a) Proof. Let the symmetric relation R on the set A 
be given. Let a, b € A be given such that (a, b) € R^! 
This means that (5, a) € R, and since R is symmetric, it 
follows that (a, b) € R. But this means that (b, a) € R. 
Hence, R^! is symmetric. m. 


wm —_—— 


(c) Proof, Let R be any relation on A. Let abe A 

be given such that (a, b) € R= R-'. This means that, for 

some c € A, we have (a.c) e R^! and (c, b) € R. This 

implies that (c.a) € R and (b. c) € R-! from which we 

can conclude that (b, a) € l. Therefore, Ro R^! 

is symmetric . 

16. (a) ((0.0). (0, 1), (0.2). (1,2). (2,1), 3,3). (3,4), (4,3), 

(4. Dk ©) {0, 2). B, 4). (4, 3); © {(0. 0). 2.1), 

G3, 3), (4. 4)): @ ((0, 1), (1, 2). (0,2). (3. 4), (4,3), 

G. 3), (4. 4)}; €) (0. 0). (1. 1). (1,2). (2, 1), (2, 2). 
(3,3). (3.4), (4, 3), (4.4) 

18. (a) Ri = ((1,1).(2.2).3,3)(4,4).(,5)) and R: = 

(1.2). (2. 1)}- Note that the empty setis technically 

















Section 4.6 Exercises 


1. The two columns at the right are identical as predicted 
by the Example. 











x p =e 
os [as | a 
wa | 04 | n | 








-4 A 4 4 




















a7 a 17 


2 (a) False, x =23, y= 
448 





: (c) False. x = 1.5 











6. 955 
8. 19729 
10. We have 
log") _ n -logw(3) _ log, (3) =138 
logo(2") n-logw(2)  tog,o(2) 





Section 4.7 Exercises 


1. (a) 34; 96. (b) For example, 1 is not in the range. For 
f(n) to be 1, n would have to be spelled “A.” and no 
number has that spelling. (e) Since /(255) = 240 and 
(240) =216, this completes the cycle. (d) It takes too 
long to compute f for one thing, but more importantly 
f has no fixed point so you cannot know the value at 
any time, 

2. (a) Ifa card is in position x, then after the deal into 
three face up piles, the card will be ín position [x/3] of 





Selected Answers 659 





a symmetric (and transitive) relation, but it is not 


reflexive. 






(©) Ri = (01. 1). (2.2). (2,3), 3.3). (4,4).(5.5)] and 
Ra = (1,2). 2.1) 
© R=...) and R= 
14.1). (1.3). G. 1). G.3)] 
19. (a) {(0. 1). (1.0). (0.0). (1,2). (2, 1), (0.3). (3.0) 





(2. 2): (b) Yes: (€) If Ris symmetric, then R = R^ 


RUR” =R. 
22. Proof. Let R be a relation on a set A with the prop- 
erty that R = R”, and let a, 6 € A be given such that 
(a. b) € R. From this we can conclude that (b. a) e R^. 
However, since R= R^! this tells us that (b,a) € R. 
Therefore, Ris symmetric. = 








so the number of digits in 3" is roughly 1.6 times the 
‘number of digits in 2*. 

1L Let a — log, 10 and b= logy,2. This means 2^ = 10 
and 10° —2 So(2" = 10 —2, which meansa -b 
Hence, a = }. as desired 

13. (a) 666: (b) 400; (c) 285 

15. (a) 933: (c) 1200: (e) 1466 

18. Since 100! = 100 -99!, 1001 is 99! with 2 more zeroes on 
the right. 

19. 158 

zn 

24, 992 

27. (a) 494 














28. (a) 3768 


11, 13,17, 19, 23, 29.and 31 
) True 
(a) Pow(x. Sum 
(a) False. x 









.3)) = Prod(Pow(x. y), Pow(x, z)) 
y=2:(€) False. x — 5, y 


3L 
x. 
ES 





its particular pile. To see this, just note that cards 1, 2, 
and3 will be in position 1 of their piles; cards 4,5, and 6 
will bein position 2; and so on. The formula [x/3] yields 
precisely these results. Since the magician places a pile 
With 11 cards on top after dealing all the cards, the card 
in position x before the deal is in position 11 + [x/3] 
after the deal. 

3. Theonly twocyclesare 1,6, 3,8,4,2, 1 and5, 10.5. For- 
mally we prove the following by induction: For every 


660 Selected Answers 


‘n> 1, the iterated function sequence (Ls. for g start- 
ing with n leads to one of these two cycles, 
Proof. Let P(n) be the statement, “The ists, for g 
starting with n leads to one of these two cycles” 
‘The numbers in the two given cycles illustrate that 
PUT), P(2), .... P(6) are all true. Let m2 7 be given 
such that the statement P(m) is the first one not yet 
checked. We argue in two cases depending on whether. 
mis even or odd, 

Case 1: If mis even, then. f(m) = m/2, and the is. 
starting with m consists of m followed by the 
Ms, starting with m/2, Since m/2 < m, we know 
statement P(m/2) has already been checked to 
be true, so the Ls starting with m/2 leads to 
‘one of the two given cycles, 

Case 2: M m is odd, then f(m)=m+5 and 
Jn) = 2, and the ifs. starting with m 
consists of m followed by m+5 followed by 
the ifs, starting with f£, Since 3 < m (since 
m> 5) we know statement P (4) has already 
been checked to be true, so the Lis starting with 
"5 leads to one of the two given cycles, 

5, (a) There isa fixed point at 1 and cycles 2 > 3+ 2and 
452129 6-4. 

6 ES Ig and GA 8 

& 1-041 

10, 132 — 1532 and 44 — 4. There are no cycles. 
with period two. 








Section 4.8 Exercises 
1. The completed table is shown below. 



































2. (a) doubled; (e) roughly doubled; (e) roughly multiplied. 
by 4: (g) roughly squared. 

4, (a) 2-1-5 2n41<25-n for all n> 2; (6) linen 
vii <2-nforalln> 1. 

S (b) 101-7 < 2-1? for alln > 6, 

6. (a) 3n—7 > 2-nforallnz 7. 

7. Taking the ratio a, /7? and simplifying yields 1 — $. 
Weshowthat | < 1— 4 < } foralln = 2. Sincen > 0, 











13, V/Z > vZ and -/z— -yz (Remember that the let- 
ter zisa fixed positive number throughout the process.) 
‘There are no cycles with period two. 

15, 0— 0,2/3 > 2/3, and 2/5 — 4/5 — 2/5 

17. Regardless of the starting value (between 0 and 1), 

the iterated function sequence in the spreadsheet will 

eventually be 0 even when we should sce the the cy- 
cles discussed in Exercise 16. Since computers typically 
use “base two” representation of numbers, the values 
of these sequences as fractions with odd denominators 
cannot be represented exactly, Hence even though we 
set an initial value like 2/3 in cell AI, the computer 
stores something very close to but not exactly equal 

10 2/3 in its memory. This very small initial difference 

gradually changes the very nature of the sequence as it 

converges to 0 instead of staying in a cycle. 

(a) Proof Letn > 2be given. Since Fy = Fri + Fy 
it follows (by dividing through by Fy.) that 











By definition of rq, this is the same thing as ry = 
*u 
b g(x) = lel 
(e) 434 Ls and E35 — 15, The values ofr, are 
the ratio of successive Fibonacci numbers, and these 
ratios "converge" to 155 as n is increased, 








then $ > 0, and hence | — $ < |. On the other hand, 

since n > 2, we know that 2n > 4, and therefore $ < $, 

so} — d > | ~1 = J, By Proposition 3, it follows that 

a, € OP). 

9. 17; Thus for n = 1, we have $ < a,/n* < 22, By Propo- 
sition 3, it follows that a, € O(P). 

(a) Taking the ratioa, / rand simplifying yields — £ 

B-À-5-(E + 3).Forn> 12 eachofthe 

3, and $ isles than |, Thus forn > 12, 





u. 





fractions 
we have d < a,/m = 

(b) Taking the ratio,/n*and simplifying yields 5 + 5 — 
$ + 2. This satisfies the inequality 

















EEE 


From part (a), we know that for n = 12, $4 $ + 
351. Hence the inequality, 


(aeaa 


implies the inequality, 4 < a, < 6. Thus for n > 12, 
we haved < ay/n <6, 
(©) Taking the ratioa /n'and simplifying yields — $ — 


4+ jr This satisfies the inequality 


4 
n 


6 


6 
" 


1 





i 








and the result follows just as in part (b) 
13. (a) 11, 21, 51, 101; (c) 16, 26, 56, 106 
14. (a) Leta, = n. Let positive numbers K and N be given 
We must find an integer n > N, for which 4 > K. Since 
$ = m, we may take any integer for m thats larger than 
both K and N. If we do, we will have n > N and also 
HK 
18. Answers can vary in each case 
(4) For a, = 2a,.,-- n and ay = 1, it appears in the 
spreadsheet that | n < a, < f-m forall n = 10. 
(b) For ay = ay, (+17) and ay = 10, it appears 
in the spreadsheet that | +1? < a, < $ +n? for all 
n> 69, 
(©) For a, = a, , + ri and ai = 3, it appears in the 
































spreadsheet that $ in? < ay < $n? for all n 
13. 

18, (a) Proof by induction, We will prove that a, < 3n? 
for alln > 1, Since a; = 1, the first statement, "a, < 
3” is true. Let m > 2 be given such that the state- 
ment “an < 3m" is the first one not yet checked 





to be true, In particular, the statement, "a, , < 
3(m — 1" has already been checked to be true. 
From this we can infer ih. 





D 
3-1 3m 
=n? -3(m-1) 
nf since m—1 > 0 


Hence a, < 3n, which is precisely statement 
Pini). . 
Proof by induction, We will prove that d, > n for 
all = 1, Since d; = 1, the first statement, “d, > 1” 
is true, Let m > 2 be given such that the statement 
“dy 2 m is the first one not yet checked to be true. 
In particular, the statement, "d, , > m — 1" has al- 
ready been checked to be true. >From this we ean 


n) 





Selected Answers 661 


infer that 
dn = dy + [ril 
= (m-1) [mi] 
meni - 1) 
= msince [fpi] > 1 


Hence dy =m, which is precisely statement 
Pini). " 

20. (a) Proof. Since for all ne N, 1 fin) € fin) «1 
fin), it follows that f(n) € @( f(n)). a 
(b) Proof. Let functions f and g be given such that 
(n) € G(g(n)). This means that there is a natu- 

Tal number M such that for all n » M, K a(n) = 
Jin) = L-g(n) for some positive real numbers 
K and L, From this it follows that for all n> 
M, t+ fin) s (n) s $ fin), and so gin) eo 
(Gin. . 
Proof. Let functions f, and i be given such that 
fin) € O(g(n)) and g(n) e G(h(n). This means 
that there is a natural number M, such that for all 
n= Mi, Ki gn) = f(n) < Li gn) for some pos 
itive real numbers K, and Ly, and there is a natu: 
al number M; such that for all n > Ms, Ks hin) s 
gin) < La + h(n) for some positive real numbers Ky 
and Lo, We choose M to be the larger of M; and. 

Ms. In this case, when > M, then 











(e 





Sn) = Lig(n) since n = M; 
= Lilah(n) since n > M, 





fn) = Kigin) since n M. 
> KiKohin) since n = My 


It follows from this that fin) e e(t). . 
23, (a) We can show that 3" ¢ OQ") as follows: Let any 
K > Obe given, and consider an integer n that igre 

than log; K. In this cas 














Since this can be done for any K > 0, this shows (by 
Proposition 3) that 3" ¢ (2^) 

(©) f(n) = 2 and gin) = n? 

Proof by induction onk "The proposition contains the 
predicate we wish to prove, but it is a bit of a mouthful, 
Therefore, we will use the notation W(n) tostand for the. 
statement, “Algorithm COUNTERFEIT perecror requires 
k weighings to find a counterfeit among 2^ coins” This 
statement is clearly true when k = 1 since you need one 
weighing (ie. weigh either one and see if itis one ounce 
9r not) when you have two coins, That is, W(1) is true. 





28, 





a a 


662 Selected Answers 


Now let m2 be given such that statements 
W(). ... Wim — 1) have all been checked to be true. 
‘To verify Wim) we must envision the algorithm being 
performed on a collection of 2" coins, one of which 
is counterfeit. In the first step of the algorithm, the 
coins are broken into two groups, each consisting of 
2" coins, and one of the groups is weighed to deter- 
mine whether it or the other group contains the coun- 
terfeit, After this step, the algorithm operates on this 
remaining collection of 2" coins, and so by the pre- 
viously checked statement Wim ~ 1), it requires m — 1 
more weighings. Therefore, a total of 1+ (m — 1) =m 
weighings are needed to find the counterfeit among the 
original 2" coins. This is precisely statement Wy) that 
‘we had hoped to verify, completing the induction, mE 

26. The completed table is shown below, 


























Value of flogs (ny) | 








(a) C: (b) Br (e) A 





33. (a) Proof by induction. 




















512 | 1024 








When n = 1, the inequality is 
“by < 27 whichis true since we are given that by = 1 
as part of the recursive description. Let m > 2 be 
given such that by is the first element of the se- 
quence not yet checked to satisfy the inequality. 
Then 


by = bma tm 
sam) +m 
x 2(mj2) - m - 2m 

































































te s [19 | os |an | ss [zs] es swos | 175099 
n | a |w |n| 2107 19683 | s 
24 | 26 | 22 | 28] 2 s | aw 
© 
i Ee en e nn [o 28 5 7 8 9 10 
alaj 4] 3s] w | 2] e| 1s] ase sa 1024 
= —— dE 
«|| s| se | 296 | 1400 | 6248 | 26936 | 113576 | 471800 | 1939688 | 7916216 
w| a | 16 | o4 | 256 [wn | e | 16384 | esee | 262194 | 1048576 
rao | 2 | 35 | 46 | ss | e| ee 69 72 74 75 























Section 5.1 Exercises 


1. (a) List two choices for candy bar in alphabetical order: 
(kk, kl, km, Im, mm} 

(b) Since the first bar is given away, order matters this 
time, so we get (kk, kl, Ik, km, mk, I, Im, ml, mm} 

(€) Number the shirts from 1 to 9, and list the 3 chosen 
in numerical order: {1 Ne) 

(d) The set has all the entries from the previous prob- 
lem, plus entries such as 111, 34, 667, ete 

e) Number the flavors 1 to 8, and list two choices of 
flavors in numerical order, Here are a few: {11, 12, 
13, ..., 18,22, 23, ..., 77, 78, 88) 

(D Assuming you think chocolate on top of vanilla is 
different from vanilla on top of chocolate, you get 
all the entries from the previous problem, plus their 
Opposites (for example, 21 along with 12). 

3 In Exercise 1, we have (a) unordered list (bag), (b) or- 
dered list, (e) set, (d) bag, (e) bag, (D ordered list 

4. (a) set; (c) unordered list (or perhaps ordered list if you 
care which number was on which dic); (e) unordered 
list (or set if duplicate toppings are not allowed) 

5 (a) 36; (b) 6:2; 7 is more likely: (e) 10; J 289 

7. (a) 8 0) 3: (0) 7:60) 3 

9. (a) 6; (b) 2; (e) 2 (A) A win in three games for player A 
must have 2 A's and 1 B, and one of the A's must come 
as A. We define a correspondence from these to 
the ordered lists of length 2 containing 1 A by simply 

dropping the final A. 

11, (a) 24; yes, by counting rows and colum 

12 




















b) 12:2; (0) 


12, (2) 6; (©) 81; (0) 24 
13. (a) 12; (c) 120; (e) 21 


Section 5.2 Exercises 


1. (4) 60,466,176; (b) 9,765,625 
3 (a) 175,760,000; (b) 174,790,000 

524 

6. (a) 4080; (b) 3840; (c) 3855; (d) 240. 

8. (a) 1776; (b) 8760; (c) 8256 

9. (a) 216; (b) 120 

AL, (a) 311,875,200; (b) 18,345,600; (c) 617,760 
13. (a) 40,320; (b) 10,080 

14, (a) 16; (b) 384: (e) 51 

16. (a) 43,680; (b) 3640; (c) 28,756; (d) 1800. 
18. (a) 362,880; (b) 2880 

20. (a) 2,985,984; (b) 665.280; (c) 77.72% 

22. (a) 128; (b) 128; (e) 16; (d) 240 





LL —— 


Selected Answers 663 


14, (a) 8: (e) 8; (e) 8 

16. (a) 18; (b) 24: (c) 48; (d) i. Sri 16 

18. (a) Given the set, write a sequence of I's and O's with 
T's in the positions given by the set, 


onoo  OOO0!  OOIO| 10001 
(23) 145) 13,5} {15} 
w 
10010 oOnO 10100 OlO0! 
(14) (34) (13) (25) 


20, (a) This is simply a matter of changing the A's and B's 
in an arrangement from (i) into 1's and 0's, respec- 
tively. This will change them (one-for-one) into the 
binary sequences described in (ii) 

(b) lems described by (i) ook like (x, y] where y and y 
are two different elements from (1,2, 3, 4, 5). Each 
of these can be "linked" with the set (d, b, c] from 
11.2.3,4, 5] of three entries which are neither x nor 
y. In this way, each item described by (i is linked. 
(one-for-one) with an item described by (ii), 

(c) For each permutation of all 5 elements we associate 
the permutation of length 4 that comes from taking 
the first 4 elements in the same order, For example, 
to 13ax9 we associate 13ax. Itis easy to see that this 
association is one-to-one since the first four entries 
in the list determine the fifth exactly. 

(4) If we represent a sequence of 5 coin tosses as a list 
of length 5 with entries taken {rom (147) then the 
items in (i) will ook like HHTTH, THHHT. ete 
To each of these we can associate the set (s, z) 
of the three positions of the “1's. For example 
HHTTH is associated with {7,2,5}, THHHT is as- 
sociated with (2,34) and so on, 




















23, (u) 5292; (b) 56: (c) 3276. 

25. (u) 159; (b) 960 

26. (u) 19:18:17. 16:15 =1,395,360; (b) 6:5:4:3-2 
12 720; (e) 120-119- 118. 117+ +++ 106+ 105-104 
6755 x 10" 

27. (a) P(9. 4): (c) P(365, 28); (e) P(k.m +1) 

29. The problem is that, by the time you get to vi 
president, there could be either 7 or 6 choices, depend- 
ing on the sex of the president. You must put the more 
restrictive steps before the less restrictive steps in your 
algorithm: There are 7 choice for the vice-president, 
10 for the secretary, and 15 for the president, yielding 
7- 10- 15 = 1050 possible result 

31. This algorithm “double-counts” cach outcome that has 
a math major for both of the offices. One way to fix 














md 


664 Selected Answers 


it is to properly apply the Inclusion-Exclusion Princi- 
ple, by subtracting the overlap. The overlap is given 
by this algorithm: "8 choices for president, 7 for vice- 
president, and 24 for secretary.” so the total count 
is 9600 — 8 - 7 24 = 8256. Another possible algorithm 
counts three non-overlapping sets: (1) president is math 
‘major and vice-president is not; (2) vice-president is 
math major and president is not; (3) both are math 
majors, | 8: 18:24- 18-8-24-4+8-7-24 = 8256]. A 
third algorithm counts the complement (neither are 








Section 5.3 Exercises 


1. (a) ab, ac, ad, ae, ba, be, bd, be, ca, cb, cd, ce, da, 
db, de, de, ea, eb, ec, ed; (c) 2 equivalence classes, each 
ol size 10 
2. (0) 60; (c) lade, aed. dae, dea, ead, eda); (e) 10 
3, (a) [ACDBFE, CDBFEA, DBFEAC, BFEACD, 
FEACDB, EACDBF} 
5. 96 
7144 
10, 32 
12.120 
14. 35 
1S, (a) 5,985; (0) 2,475 
1 (a) 220; (e) 10 

19. (a) 256; (c) 56 

. Second solution: Solve the complementary problem, 
giving 8,008. 

23, 189,303.411,990 

24, Hint: Break the committee selections up into five 
‘eases based on how many married women are chosen, 

25, 886,656 

27, (a) 74613 
(b) 7,560 

29. (2) 10,477,677,064,400; (c) About 9,626,413 to 1 

32. 35 is the coefficient of; the coefficient of is 0, 

38. Proof by induction. Let P(n) be the statement, 
“Dh Cln, k) = 2" It is easy to check that P(0) (which 
says "C(0,0) = 2) and P(1) (which says “C(1, 0) + 
(1,1) 2 2!*) are both true, Now let m= 2 be given 
such that statements P(0), P(1),..., Pm — 1) have all 
been checked, and P(m) is the next statement to be 
checked. So 





























a e 
D Cim k) = Cn, 0) + 3 (C(m - 1. k= 1) 





E C(m — 1 K)) Cm, m) 


math majors) and subtracts from the overall count 
126-25-24 — 18-17-24 = 8256] 

34. 013440 

36. 3168 

37, 25 

39. (a) 375; (b) 303; (c) 216 

41, 955,514,880 

A2. 10 subsets, which is half the number of permutations. 


e 


-c(n-19) «Sem 1,41) 
& 


Cin - 18) C(n- lm 1) 
Doc - 1-1) Y Con - 1.9 
= 2"! 4201 = 2" by statement P(m— 1) 
Hence statement P(m) i also true, . 
39. The 3" entry of Row 7. 
A1. (a) The coefficient of. in (1 + (1 +x) + (1 +x)? + 
+ (1 +x)") is the same as the coef 
(14 (12) (L3) + CI+ x)", which can be 
found by summing the coefficients of x in the compo- 
ment terms 1, (1 9-2), (1 +x)?, «++ (1 +a)". From the 
binomial theorem, this is simply 











C(1.1) + CQ, 1) FEB) (n1) = 
+2430 


Of course, the coefficient of x? in (1 x)" is C(n- 
12) 

43. This problem uses the mod 10 arithmetic from Section 
27, Let A, B, C, D and E be the values of the original 
cards. The rows formed are. 

9 [A.B.C D. FE] 

9 [A B.B&C.C +D, D+ E] 

9 [A28 C B 2C D.C 2D 4 E] 

9 [A3B 3C +D, B+3C+3D + E] 

© [A448 + GC +4D+ E] 

where each operation uses mod 10 arithmetic, Since 
4 sp ~6, we know that 


A+4B+6C+4D+E mi A448 -4C 44D4E 


‘The right-hand side of this equation is precisely the cal- 
culated value A + £+4(B + D — C) in the magic trick, 





Section 5.4 Exercises 


1. (a) 28; (c) 247 

3. (a) 24; (€) 4,096 

5, (a) 1,048,576; (c) 489,888, 

7.4200 

9. C(19, 2) (18,2)c (6.2) 
105 

13. (a) 24 weird arrangements look ike ALAAMAB once 

the goofines is removed. 

tb) 210 

15. (a) The equations are 2+24+2+2+242= 12,6 
+04+0+0+6+0=12, and 0+0+0+412+0+ 
0 = 12, The binary sequences are 00100100100100100, 
(00000011 110000001, and 1110000000000001 1 





+C(4, 2)C(2, 2) = 2.13829 x 





Section 5.5 Exercises 


1 Let a, be the number of n-digit numbers which do not 
use “0” for a digit, Any such digit number can be made 
by choosing a non-zero digit as the leading digit, and then 
following it with an (n — 1)-digit number with the same 
property. This means that 








a maya 


‘This along with the fact that a, = 9 completely solves the. 
problem, 


3. Proof by 
T 





induction. From Example 5 we know T, — 0, 
d d T, = 2+ Ty. Let P(n) be the statement, “If 
mis even, then T, = 2/7; if n is odd, then T, = 0" for 
n > 2. Since Tj = 0and T; from the example, 
we know Mements P(1) and P(2) and true, Now 
suppose we have checked P(1), P(2),...,P(m— 1) have 
all been checked to be true for some integer m > 3. We 
have two cases to consider depending on whether m is 
n or odd. 


ase 1, If mis odd, then 























Ty = 2 T, s from Example 5 
= 2: 0 by statement (m—2) since m—2 
is also odd 
=0 


Case 2. If mis even, then 


Thy = 2 T, from Example 5 





297) by statement P(n —2) 
since m — 2 is also even 

ame 

=m 


CPA Wn—n———————— 


Selected Answers 665 


mam 
19.78 

21. Using cases based on the value of 
23. 120 

25, Using four cases, we get a total of 16 solutions, 

27. 1,391,975,640, 

29. C(210, 99) — C(166, 99) = 6.4319 x 10" 

31, Taking the number of integer solutions to the equation 





we get 161 solutions 








Nba tas tay = 12 


that have each a, > 1 and subtracting the number of 


solutions that have any of the x, > 7 leaves the number 
of desired solutions as 125, 


In either case, we have confirmed the truth of statement. 
Pim), completing the induction. 

5 (a) Pn 1) =n P(n — 1,0) = n; (€) Pin, 3) e n Pn = 
1,2) =n (n — 1) - (n = 2) by part (b) 

7. Let c, be the number of ways to cover a 2 x m chess- 
board with 1 x 2 dominoes. For example, c, 3 since 
we can cover a 2 x 3 chessboard in the following ways: 


1 FED ET 

















1/3 ap [1|2 3p. [2 2]3] 
































Similarly one can verify that c, = 1 and c; = 2. Tosee the 
recursive model, just observe that the left side of every 
covering looks like one of the following: 




















T a ET 
1 Pe 


By thinking of how to complete the covering in each ease, 
We can conclude that c = cy. , 6.5 











9. Proof by induction. Since Y Md - 
and d, — 0, the statement is true when n= 1. Since 
ADi- 20-141) -1 and di — 1, the 


statement is true when n = 2, Let m > 3 be given such 
that the first statement that has not yet been checked. 
Then 


d, = (M—A) (dy ds s) 


= (m= m= LIE + m=) 


666 selected Answers 





4-25 0 


= mim SI Bm EH 
mR & 





XT 
Hmm Mo 


= mEt 





pea! 
SnD) a 


[i 


em 
=m Sone eC 


[zl 


=m Pong 


Section 5.6 Exercises 


1. (a) ay = 7 +3; (€) dy = 1+ 20+ nin - 1) 
3 (a) n+ 2n(n — 1); (e) 6n + Ernie — 1) + $n(n — n — 
2) 


6 (B) dy = 1-3 —3;(€) dy = 172) 1 
7. The mortgage will be paid off in the 139^ month. 








% 8) = Sands, = 3 -34-3 —2form = 1 has closed formula 
m=i P+ 

11. Since a, = L- I + >, if ay = 1, we must have L= 
1-15. 


13. Proof by induction. If n = 0, the statement is "s, = 
35 + K 0 which is certainly true. Let m > 1 be given 
such that all of the statements up to “Sq; = + K 
(m — 1)" have been verified to be true. Now 
Sm Sn (Sm = Smt) 

= Sn.1 + K since s, has constant first difference K 

(5) + K(m=1)) + 

=5+Km 

18. (Qa = 7:2 =n? —4n—6 

17. (8) ig = C3 + KA (eu, e CA + K (23) 

18. (8) dy = ^ —Y (e) an = $4" —(-3)") 


Section 6.1 Exercises 


LF 

2 a} 

REO EOE 

4i 

Some have more than one legitimate answer. (a) The 
set of ordered lists of length 3 using elements from 
















12 fox = forall k> 0, fay 
=D fara 


M. Let a, be the number of positive n-digit numbers with 
no consecutive 1's. We observe that a; = 9 and a; = 89 
(any number from 10 to 99 except 11). Assume that 
we understand these sorts of numbers when they have 
less than n-digits but we are now asked to count how 
many are like this with n-digits. To form these we can 
add 0,2, 3, 8,9 to the right end of an (n — 1)-digit 
number without consecutive 1's, or we can add one of 
(01, 21,31, 41, 51,61, 71, 81, 91] to the right end of an 
{n — 2)-digit number with no consecutive 1's, This means 
that dy = 9aq-1 492,5. 


1 for all n> 1, and 


f. 








19, (a) ay =C- (HA) c (13) 

aa an= (32). (44) « (i2) (142) 

21. Proof. Suppose that the equation x° = cx +d has 
distinct solutions x =r and x =s, and consider the 
closed formula a, =C - r^ - K s". From this formula, 
it follows that a, , =C r7 4 Ks"! andas C 
P+ K $3, Now we can combine this information 
toget 








(Cre Kes td 

(C- rhe Kat 

2e (ert der 
(cst eds) 

mr (eir d) es. 
(esr) 

=r (Cin) (eg) 

ree 


Cys + da, 








)+K 





an 
‘This verifies the relation c ay. das = tye 


22. (a) ay = (C+ Kon) 
23, (a) a, = (1- 4n)-2" 





{1.2.3.4 5,6}; (b) The set of permutations of length 
2 with elements from the set of club members (Treat 
the first person listed as president, the second as vice 
president.); (e) The set of ordered lists of length 6 with 
elements from (H, T]; (d) The set of sets (combinations) 
of size 3 with elements from the set of 52 cards, OR the. 








set of permutations of length 3 with elements from the 
set of 52 cards 
7.) ji (e) Ss) § 
9. There are a total of C(23,3) 
tees possible (a) $2; (c) 2 
"d 
12. Our sample space will be the set of all combinations of 
size 5 with elements from the deck of S2 card (a) 255: 
© ain 
18, The probability is approximately 0.753. 
17. The probability that two people in a group of 500 have 
the same last four digits of their Social Security number 
is approximately 0,9999969, 
19. (a) $; (© 5} (assuming n > m) 
ny 
Fem 
25, Answers will vary but should support the theoretical 
probability of |, 


= ITN different commit 





Section 6.2 Exercises 


(à) A draw which includes the Ace of Clubs is in both 
events, so these events are not disjoint: (c) These events 
are disjoint. 

2 (a) Let E, be the set of outcomes where the card is an 
Ace, and E; be the set of outcomes where the card is a 
Jack. Then since these events are disjoint, 





Prob(E, or Es) = Prob(E;)  Prob(E;) 





3. (a) Let E, be the set of outcomes where the card is an 
ace, and E; be the set of outcomes where the card is an 
heart. Then 

Prob(E, or Ej 


Prob(E;)- Prob(E;) 





4 
-Prok(E, and Ej) = = 


(©) Let E; be the set of outcomes where the card is a 
diamond or a club, and E be the set of outcomes where 
the card is a king. Then 


Prob(E, ot Es) = Prob(E;) + Prob(E;) 
4 
-Prob(E and Ej) = 5 


A. (a) These events are not independent. Let E, be the set 
of outcomes where the first card isan Ace, and E; be the 
set of outcomes where the second card is a Ten, Jack, 
Queen or King. Then Prob(E;) 





a 


Selected Answers 667 


26. Answers will vary but should support the theoretical 
probability of approximately 0.22, 

27. Tosce why this works, imagine onc half (consisting of 13 
red cards and 13 black cards) came from a blue-backed 
deck while the other half (consisting of 13 red cards 
and 13 black cards) came from a green-backed deck, 
The final deck will alternate colors of their backs even 
though the faces of the cards are fairly shuffled, As the 
solitaire game is played, cards are removed from the 
deck in adjacent pairs which share the same face-color, 
Hence at any point in the process, (i) the number of 
bluc-backed red cards is equal to the number of green- 
backed red cards, (ii) the number of blue-backed black 
‘ards is equal to the number of green-backed black 
Cards, and (ii) the blue-backed and green-backed cards 
alternate. Given these three properties that remain in 
variant as the game is played, it is impossible that the 
game should ever end in a loss. This is true because a 
losing final position would consist of cards alternating 
in face colors, and property (iii) then dictates that the 
red cards and black cards should have different back 
colors contrary to properties (i) and (i) 














16 

wa 
€) These events are independent. 

5. Because the first card is replaced and the deck is shuf- 
fled before the second card is drawn, in each case, the 
events described are independent. (a) $: (e) 8) 





Prob(E, and E; 




















ue 

n 

13. Prob(sumiscven) = 1; Prob(sumisa multiple of3) = | 

In d 

17. Jd 

19. (a) Let E, be the set of outcomes where the faceup card 
is an Ace, and let E: be the set of outcomes where the 





facedown card is a Ten, Jack, Queen or King. Then 
Prob, 





16 
5) eos 


46) Let E; be the set of outcomes where the opponent's 
faccup card is an Ace and the next two cards (mine) are 
a Four and a Five, in either order, and let F; be the set 
of outcomes where the opponent's facedown card is a 
Ten, Jack, Queen or King. Then 


16 ss 
Probl E:1E;) = 35 = 0327 


668 Selected Answers 


21. (a) Let E, be the set of outcomes where the opponent's 
faceup cards are {2D,6C}, and let E> be the set of out- 
comes where the opponent's facedown cards have val- 
ues (34,5), Then 

 Prob(EE)) = 0.0112 
(c) Let E; be the set of outcomes where the opponent's 
faceup cards are (5D.6C), and let £z be the set of out- 
comes where the opponent’s facedown cards have val- 
ues (234) or (34/7) or (478) or (7,89). Then 
Prob(E;\E\) ~ 0.0449 


23, We use the numbers already calculated in Example 9, 
(a) Prob(P and S) is the probability that the steroid re- 
sultis positive and the athlete has not used steroids. 
Prob(P and S) = Prob(S) - Prob(PiS) 
= (1 — Prob(S)) Prob(PiS) 
0006 








Section 6.3 Exercises 


1. gig = 0.001875 

3% 

5 MIS 0323 

7. d ~ 0.196 

9. 4() 0" +6(4) (4) 120) Q) (0) (0 * 
AQ) + (1)! s 0325 

11. d 04226. 

14, The probability that win the series is atleast 0.9 when 
p > 0.8042. 

16. The following graph was made in Excel: 


o3 





025] 
02 


048 
on 


0.05 | 








iocari igh age ig ar pese 
Number of Heads (k) 


(b) Events POS and PNS are disjoint and (PS) U 
(PS) =P, so by the sum rule, 
Prob(P) = Prob(P and S) + Prob(P and S) 
= 0.02985 + 0.0006 = 0.03045 
(©) Prob(SIP) isthe probability that an athlete has used 


steroids given that the result of the test was positive, 
Prob(S|P) = 0.9803, 
(d) Prob(S\P) is the probability that an athlete has used 
steroids given that the result of the test was nega- 
tive, Prob(SIP) = 0.000155, 
24, $ 0.296 
26. 4; i = 0.5078 





18, $5 = 0036 

19. (w) Ej. (©) fa = 0.59. 

20, BE 093. 

22, (a) d: 0) Us (0 HB 

23, Hint: This is harder because there is no obvious sim- 
ple case structure, List all outcomes and find the prob- 
ability of ach, 





Section 6.4 Exercises 


1, Letting X denote the sum of a pair of dice, we will have 
F[X] = 7, which is twice the expected value of a single 
die roll 


312 
8, = 0.385 
6, Hint: Let X be the sum of the values of the pair of 


cards. X can be any number from (4,5, 6... 22), The 
probability of X having any particular one of these val- 
ues must be calculated separately, leading to the final 
answer, E[X] = 14.88. (This is the value of the average 
starting hand in Blackjack, counting Ace as 11). 

9. The expected payoff is approximately $5.42. If 10, 000 
people play, the casino will take in $100, 000 and expect 
to pay out about $54200, so the casino expects to make 
a profit of about $45, 800 per day on this game, 

11, Using the hint, we recall the identity from Exercise 36 
of Section 53; 





Yk €(6.4) =6.28 


Now using X to denote the number of 1's we see in our 
tossing of six coins, we have 


HX] = Qc woh) roce (i) + } 


roco (1 ) 


= GG een) 


-Qen 


13, Since the gencral identity from Exercise 37 of Section 
53is 





EUDE ces dee 


Section 6.5 Exercises 


L HX] - 6. 
3. AIX] = 
5. (a) Use p, to denote the probability that the r^ 
the first "Ace? Then for example, p, = 
n-8 











E- A and in general, 








1 
19. 


aL 


23. Lf we "zoom in’ 


LLL EE 


Selected Answers 669. 


we can use x = 1/5 and n = 10 to get 


X ecanay ais 20/5 ($) 


and we can multiply both sides of this equation by 
65/6)" to get 
a 


Decan (i) (i «2 


Now if X is the number of *1"s that occur in the ten 
rolls, then 


FLX] = acao. ( 








by the identity above. 
If the sample space is (MF, FM, MM), the expected 
number of boysis 4/3. If the sample space is (MF, MMI, 
the expected number of boys is 3/2. 

4 396 

Following the method of Example 7 with X represent- 
ing the number of games in the best-of-three series, we 
have EIX] = 6 p — 12 p! +3 p +3 p +3. The graph 
of this expression shows that the maximum length of 
the series occurs when p = 1/2. 

The solution to Example 7 and the fact that the winner 
‘of a series must win 4 games together tell us that the 
average winning margin is 2.1875 games, 

on the graph in the solution to Exam: 
ple 7, we can see that when p is less than about 0.215 or 
‘more than about 0,785, 




















(©) The solution to Exercise 4 assumes that the trials 
are independent. “Drawing cards without replacement" 
docs not have this property. 

7. We have the recurrence relation fy — 2 and 


h= (3) Otm (3)0 +m 


or more simply, Ay 
9. p=81/82. 





2+ M-i foralln = 2. 





Which is Equation (62). 
2S. (a) Let de =e(1—)—3n. Then ial) 

15, p= #2 = 0.82 H(dy_1 +1) can be simplifed as 

1. (Qu po Sand - 9 





19. The expected length of this game is 48 coin tosses. 

2L In the Hank and Ted game played with a fair coin, 
if Hank starts with X markers and Ted starts with Y 
markers, then the probability that Hank wins is > 
the probability that Ted wins is y”, 
length of the game is (X)(¥) moves. 

23. Let dy = (n)(M ~ n). The conditions d, = Oand dy = 0 
arc obviously satisfied by this definition. Since 
thes +1 = (n+ M = (n+ 1) 1o nM 
and 


and the expected 














nt ~2n+M 





1=(n=1)(M~(n—=1) +1 = nM -r 4204 M. 
lek pet lla el This final simplification is precisely the definition of 
we conclude that dy, so the relationship holds. 
Flies +1)+ Bar +1) = Anat? In M) C) Md = c (1 — 4) - 3n, then c ~ 6000005722. 
2 (©) Using the previous result, dj, == 29.94. 
d) Looking at all values of d, for 0 < n < 20, we see 





nM — rè + 2n + M) 


m -' that the game that begins with H = 4, T = l6isex- 
aer oim 20 pected to last the longest. 

= nM — 1 +M+1 26. No matter how many total markers Ted has, Hank will 

=(M~n) = d, win this game with probability greater than 0.999, 


Section 6.6 Exercises 


1, Foreach/ with | <i <7, tate? will be the game where 


Hank has | — 1 markers and Ted has 7 ~ / markers. 100000000 


2 0 M3 0 0 0 0 0 0 





SC TRE A a ee: 02301300000 
23 0 13 0 0 0 0 0 020 0130 0 0 0 
0 23 0 13 0 o o 0 0 023 0130 0 0 
M=|0 0 23 0 18 0 0 0 0 0 022 013 0 0 
0 0 0 238 0 13 0 0 0 0 0 023 0 13 0 
0 0 0 0 z3 0 15 0 0 0 0 0 0 23 0 13 
quu Em fg 0^ a 000000001 
5. States 1 through 6 will refer to the game piece being on 
3, Define states as follows: squares A through F. respectively. The transition matrix 


0 14 1/2 1/4 0 0 


1⁄4 0 1/4 1⁄4 1/4 0 
paf tnt 0 1214 0 
=l14 0 0 1/4 1/4 1/4 
0 0 0 1⁄4 0 3A 
9 5 m S f I 











7. Answers vary but should support Exercises 12 and 24, 
11. Answers vary but should support Exercise 28. 
13. The probability of going from State 5 to State 1 in no 
more than 16 moves is approximately 0.0642, since this 
is the entry in Row 5, Column 1 of the matrix M'^. 
Using the following fifteen states ofthis game (with A-B 
reflecting the series score), the matrix N for the tran 
sient states (1,2, 3, 56,7, 9,10 and 11) is shown below. 


Is, 








614 1: 22 
eal 12 32 
kil 1x 03 
*02 1413 

We 12 18 23 
07101000020 
901010000 
oodo ooo 
o0o00f0} 00 
N=jo00000} 040 
0000000901 
000000010 
900000001] 
000002002046 


| We can add 1 to the sum of the entries in the first row 
(corresponding to those games beginning in state 1) of. 
N+ N? e NI ENT + N3, giving us 33/8 2 4.125 games 
expected in this series. 
Emulating Exercise 15, there are 104/27 = 3.85 games 
Proof by induction. The first statement is, “A! = 
[ 13 AA Which is true by the given definition of 
M. Let m > 2 be given such that the first statement not 
yet proven is the one involving M". In particular, the 
previous statement, 


mi far o 
M -[ 0 aay] 


17, 
n. 


has already been checked to be true. In this case, we 
know 


amm o 
n 9 aa 


o 
aa 
Which verifies the next statement, completing the induc- 
tion, " 
21. Proof by induction. The first statement is, “MM! = 


e] and MP ee ony F which 











[ava 0 


a 


a 


Selected Answers 671 
is true by the given definition of M and the computation 


M = MoM 


[9 2 fo 12 
“liz oflu o 
-[amam o 

7l 9^ ajay 


fis o 
=Lo na 
Let m = 3 be given such that the first statement nor yet 


proven is the one involving M?"-! and M^". From this 
we know 


Mimi = M aem? 
.[9 ur pue o 
“liz o 0 — Qn 
= 0 (1/2)0/4y-t 
= [aa o 
f0 aa 
= [zaa 0 

andso 
M™ = wm 


0 12 o 201/4" 
“Liz of [zamr 0 
[oa o 
"| 9 aar 


which verifies the next statement, completing the induc- 
tion, . 
Proof by induction. Let M be an n x n transition ma- 
trix reflecting the one-move transition probabilities for | 
States | through n of a game, and let P(A) be the state. 
ment, "The entry in Row i, Column / of the matrix Mi 
is the probability of the game moving from State / to 
State j in k moves” The statement P(1) is true by the. 
definition of the transition matrix M. Let m > ? be piven 
Such that all plays of the game consisting of less than m | 
moves have been checked, and we are now considering | 
m moves of the game. Let States i and j be given, and 
let's compute the probability that the game goes from 
State ito State j in exactly m moves. i 
In order to go from State i to State j, the game must 
20 from State to some State / in m — 1 moves and then 
from that State to State / in one move. By the induc- | 
tion hypothesis, the probability of going from State / 
to State / in m — 1 moves is M77, the entry in Row i, 
Column 1 of M™-!. The probability of going from State 
0 State j in one move is M, y 
Hence by the product rule, the probability of going 
from State i to State 7 in m — 1 moves followed by one 


IN i a ERN WEE ———— ^E 


672. Selected Answers 


move to State j is 
Mi! -Mij 


By the sum rule, we add all possible cases of which state 
is State /, to get the probability of going from State i to 
State j in m moves as 


Ewan 


which is precisely the entry in Row i, Column j of 
MM = Me 


‘This completes the induction step. 


Section 7.1 Exercises 


1. (a) Nine nodes and eleven edges; (€) The sum of the 
degrees is 22, which is twice the number of edges; (d) 
ABDA, DEFIHGD, FIHF, DEFHGD 

4 (a) 12346781 and 12643781: (©) 3487645; (e) 28732 

S. (a) deg(l) = 4, deg(2] 
deg(5) =6, deg(6) = 6, deg(7) 








4, deg(10) = 4; (c) One possibilityis 1,3, 5, 2, 6, 10. 5.7.9. 


61 
7. (a) Here are two possibilities: 


Berroa masini: 


CHO A 


(d) I there are 7 nodes in a simple graph, the largest 
degree is six. (e) There cannot be an odd number of 
vertices of odd degree. 

8. (a) Add edges [2. 7] and [3. 6], and parallel edge [4, 8|; 
(©) Add edges [1.7]. (2,8). [3.9] [4 10} [5.6] 

9. (a) 3,4,8,2.6,4,5,6,7.8, 1,2,3, 7isan Eulerian tral 
(© 1,2,3.4,5,1.6,2,7,3,8,4,9,5,10,6,7,8,9, 10.1 
is an Eulerian circuit. 

1L In this graph, there is an Eulerian trail since the “In” 
and “Out” nodes are the only ones with odd degree. 











25. There will are approximately 13.21 moves expected in 
this game, 

27. There are approximately 8.33 moves expected in this 
game. 

29. We use the matrix 


N-[s/6] 


corresponding to the transient states from the matrix. 
given in Exercise 6, and compute 


NN eoe NT s [50] 


Adding 1 to the sum of the entries in the first row (cor- 
responding to games beginning in State 1) gives us ap- 
proximately 6.0 moves expected in this game. 


—— hallway 


out 


12. (a) There is only one connected graph with 0 edges 
with every vertex having even degree; namely, a sin- 
gle node with no edges. The trivial circuit consisting of 
just this node is an Eulerian circuit. c) Here are both 
connected graphs with 2 edges with every vertex hav- 
ing even degree. The graph on the et has the Eulerian 

A and the graph on the right has the 

Evian coco lb e 


‘Yes itis possible. This will be possible in any connected 
graph with no more than four nodes of odd degree. 
(2)2:()2 

Define the graph G having nodes labeled 
0,1.2,3,4,5,6 (for the number of spots on the domi- 
noes) and an edge for each domino with endpoints 
reflecting the number of spots on that domino. In this 














graph, all nodes have degree 6. When one domino 
(edge) is removed, the nodes labeled with the two 
numbers on that domino have degree 5. The resulting 
graph has an Eulerian path starting and ending at these 
two nodes of odd degree. The trick works because of 
this. 


Section 7.2 Exercises 
1. Refer to the three graphs in the figure below. 





(9 The walk We. cd tt 

(Ere est dee toy eine 

(0 Tem age tvi epee sent 
pum 


2 The three blanks should be filled im as follows: 
Gis connected: W itself: W with the edge [os i, ] 
replaced by the walk to, m. nup 00 

$ (9) 124: ©) 9.3.2.4 (e) 35.489.327 

3287 

Let G be a simple graph with n vertices, where n > 2. 
Since G is simple, the degree of each vertex must be no 
more thann — 1. Moreover, ifthere isa vertex of degree 
71— 1, then that vertex is adjacent to every other vertex 
in the graph, making it impossible for there to also be 
a degree 0 vertex. Create boxes labeled 1 ton — 2 and 
another box labeled “0 or — 1," and assign each vertex 
of the graph to a box based on its degree. Since there 
aren — 1 boxes and n vertices, the Pigeonhole Principle 
tells us that at least one box contains two vertices. These 
two vertices must have the same degree. 

Let G be a graph on n nodes, and let P be a path in G. 

Since a path cannot use the same node twice, P con- 

tains no more than n nodes. Since for every path the 

number of edges is one less than the number of nodes, 

the number of edges in P cannot be more than n — 1. 

10. Hint: Consider the contrapositive statement, “For ev- 

ery connected graph G, if there exists a pair of vertices 
a, bin G with two (or morc) paths from a to b, then G 
has at least one cycle; 

Ul. Let G be a tree with at least one edge. let e = [o] 

denote the deleted edge, and let G' denote the resulting 
graph. By the previous exercise, the only path from 1, 








9, 


EEE 


Selected Answers 673 


19. A graph like this would have 10 vertices and (3-43 + 
24+2414141414141)/2=B8 edges, By the Ex- 
ercise 18, a simple graph like this cannot be connected, 

21. C(n—1,2) = HYD edges 

23. The connected graph has exactly two nodes of odd de- 
gree, and these two nodes have an edge connecting 
them. 


107, in Gis wy tom. Thus in G' there is no path from 1 
to vj, and hence no walk from 1 to vj. 

Define Ho = [v € V | there is a walk from v to vy in 
G'land H; = [v € V | there isa walk from vtov, in G'). 
Since for every vertex v in G there is a walk in G from 
10 w and a walk in G from v to vj, it follows that Hy 
and H, are cach connected and that every vertex is in 
‘one of the two sets. Also note that no vertex can be in 
both sets since otherwise we could build a path from w 
10 v; in G'. It follows that Hp and H, are the connected 
components of G 

34 The blanks should be filled in as follows: 
Any tree G with 0 edges has one node. The graph © 
Bas one vertex and no edges. Proposition 4: 7—1 
mnm Sthat G m= 1 

17, All relevant graphs with no edges, one edge, two edges 
Or three edges are given in the figure below. 











20. Proof by induction. Let P(K) stand for, “A simple 
graph with n vertices, k connected components, and no 
‘yeles hasn — k edges” The first statement P(1) is true 
by Theorem 7 since if k = 1, such a graph is connected 
and therefore a tree. 

Now let m>1 be given such that statements 
P()....,Plm=1) have been checked. Let G be a 
graph with n vertices, m connected components, and 
no cycles. Choose any component #7, and let G' be the 
graph with H removed. Letn; be the number of vertices 
in Hand let be the number of vertices in G’, Observe 
that n; +n: =n. 

By P(1), the component H has n, — 1 edges, and 
by statement P(m- 1), the graph G' has n; — (m— 1) 











m 


674 Selected Answers 


edges. Thus G has a total of 
(m - 1) (s -(m-1) =m - em mt 
ce od 


edges. This establishes statement P(m), and the result. 
Tollows by induction. . 

21. The blanks should be filled in as follows: Exercise 20; 

23. (a) There is only one, with edges [a, d], [a. c], [a. b] and. 
a total weight of 6. (c) There is only one, with edges 
[a, b). [a, d]. [a. e]. [c, e] and a total weight of 18. 

25, (a) We list the edges in the order [a, d] [a, c][a. b), 
Ib. dlle, d][b, c] Edges are added at steps 1,2 
and 3, giving the minimal spanning tree with edges 
la, d], [a, c], [a. b] and total weight 6, (c) The algorithm 
adds [o, b] and [a, d] in the first two steps, and [a, e] and 
[eve in the next two steps. 

26, Outline of the proof. Let P(n) be the statement, “Ty is 

included in a minimal spanning tree of G." In the induc- 

tion step, we know from the algorithm that Ty is formed 
by adding the edge ëm to the tree Tmi. By the inductive 
hypothesis P(m — 1), we know that Typ. is a subgraph 
of a minimal spanning tree of G. Call this minimal span- 
ning tree T. The edge e, either is in T or it is not, so you 
can argue by the following cases: Case 1. If the edge em 
is actually in T, then ...; Case 2, If e, is nor an edge in 

T. then form the new graph // by adding e, to T. Graph 

H will have a cycle so we can let f be the edge on this 

cycle with the smallest weight. Show that the weight of 

f must be equal to the weight of em, and conclude that. 

the tree obtained by removing em from H is a minimal 

spanning tree of G including Tp. 

The blanks should be filled in as follows: z+; 

2(n+ 1) = 2n + 2; 2n; 2n 4+2 > 2n 





























n. 


Section 7.3 Exercises 


1, (a) Graph Gz has a nodes of degree 4 and graph Gy 
does not, 
(b) Graph H; hasa cycle of length S and graph Hs does 
not. 

3. I we write down the degrees of each node, we find that 
graph (B) has two nodes of degree 2, and the others 
have only one. Hence (A) is not isomorphic to any of 
the others, Since all the graphs have a unique node of 
‘degree 1, these would have to correspond to each other 
under an isomorphism. Note that the degree 1 node in 
graph (D) is adjacent toa degree 3 node, while in graphs 
(A) and (C). the degree 1 node is adjacent to a degree 





Wu . —  |—— ——-E-  — —— 4-- -- MENO — eeu ——X 


29. Using Kruskal's algorithm, we add the edges in this or- 
der:ae, fj,ld, be, bf, gj, ij, im,H. ab, hi, jn, ad. The to- 
tal costis ($100) + 6($200) + 3($300) + $400 = $2800, 

31. (a) One solution is to remove ed from acda, then bd 
from abda, and finally be from abca. This leaves the 
spanning tree ab, ac, ad with a total weight of 6. (e) 
One solution is to remove bc from abea, then de from 
‘edec, then cd from acda, then ac from acea, then bd 
from abda, and finally be from abea. This leaves the 
spanning tree ab, ad, ae, ce with a total weight of 18, 

33, Let P(n) denote the statement,"In the algorithm of Ex- 
ercise 28, graph G, contains a minimal spanning tree of 
G” Since Gy is G and G is given as being connected, 
P(0) is true, Now let m= 1 be given such that state- 
ments P(0),... Pm — 1) have all been checked to be 
true, Recall that graph Gy is formed from Gin; be re 
moving the most expensive edge e from some cycle C 
in Gy. i. By statement P(n — 1), graph Gn- contains a 
spanning tree T that is a minimal spanning tree for G. 
Since trees don’t have cycles, the cycle C must include 
some edge f that is not in 7. There are two cases to 
consider: 

Cave 1. Ie = f, thenall edges of T are included in Gy, 
80 Gy contains the minimal spanning tree T of 
G. 

Cave 2 f e # f, then edge e must weigh at least as 
much edge / (since the algorithm deleted e to 
form Gin), 0 the new tree T" formed by adding 
f 10 T and removing e will weigh no more over- 
all than T. Hence 7” isa different minimal 
ning tree for G, and 7” is contained in Gw- 

Since in either case, Gn contains a minimal spanning 

tree for G, this establishes statement P(m), completing 

the induction, 











4 node. Thus (D) is not isomorphic to (A) or (C), This 
leaves (A) and (C) as the only candidates. To see the 
isomorphism, imagine moving the bottom left node in 
(C) halfway toward the bottom right node, then drag- 
ging the node in the middle in (C) down to the bottom 
left. 

5. The graph on the left has no cycles of length 3 while the 
graph on the right does. Another difference is that the 
graph on the left is Kax so it is non-planar, while the 
graph on the right has no edge-crossings 








A 


7. The three graphs are Kss and the two graphs shown in 
the figure below, 





w e 


All cycles in Ks. have an even number of edges. Graphs 
(B) and (C) contain cycles of length 3, so neither is iso. 
morphic to Ks. Graph (H) has the property that every 
edge is on some cycle of length 3, In graph (C), edge 
la. b] is not on such a cycle, hence graphs (8) and (C) 
are not isomporphic, 














4] 
10, (9) 


Section 74 Exercises 


01011 
00000 
w}oo11 of; 
01000 
00010 


0101 
911] 

oroo 
100 


La 


000 
10000 
@/00000 


1 
100 





Selected Answers 675 


11. (a) Suppose (A) is planar. (A) has 8 vertices, 24 edges, 
every edge is on a cycle, and the smallest cycle is length 
3. By Euler's formula, (A) has exactly 24-2 —8 = 18 
faces. By Theorem 7, the number of edges is at least } 
times the number of faces, thatis24 > $18, or24 > 27 
Since this is a contradiction, we conclude that (A) is 
not planar. (c) Suppose (C) is planar. (C) has 10 v 
tices, 15 edges, every edge is on a cycle, and the small- 
est cycle is length 5. By Euler's formula, (C) has 
actly 154-2 — 8 = 9 faces. By Theorem 7, the number 
of edges is at least $ times the number of faces, that is 
152 $9 or 15 > 22.5, Since this isa contradi 
conclude that (A) is not planar. 

12, (a) This graph is non-planar because it contains KE 
ery node in the set (1,2, 6] is connected to every node 
in (3,4, 5], The additional edges [4, 5] and [3,4] only 
make it worse, 

13. (a) For the given illustration, V 

and 12-8 = 18.2, In gener IF = 

A+2:2k+ (4-2) =3k-+2. (e) For the given illus 
8BF=5 and $4+5=842. In 
2K Fe Kee e (o 























18. Proof. By Theorem 7, m= )F, hence F < įm. By 
Euler's formula, n+ F=m+2 So m+2 n+Fs 
n Ìm. Solving the inequality m+ 2 < n 4 jm, we ga 
msin~6, a 
In the figure below, the dashed line indicates a line on 
the opposite side of the surface, Each of the houses A. 
1d C are connected to every utility 1,2 and 3 





n, 











3 (a) The degree of the / node is the sum of the / row, 
(b) The degree of the i^ node is the sum of the /^ row 
plus the value of the (i, i) entry. (This means the (ii) 
entry is counted twice.) 

‘4 (a) The outdegree for the i node is the sum of the i 
Tow. The indegree is the sum of the i column, 





676 Selected Answers 


$. (a) The matrices for R and RoR, respectively, are 


(aM 





nice moe 
ses-e- 
se-e-e 


M = 


EEE 
"ueocc- 


M = 


UNE NEN oSm am 


4 

3 
wee =| 
H 
i 


pra 
tuunan 


So there are three walks of length 3 or less from node 1 
to node 6. They are 146, 1346 and 1456. 

6 (aC, 3) (1, 9; 2. 1). 2.5), 3.2). 6, 4), (4.2), (4.5). 
(4.6). (5. 1). (5. 3). (5. 6). (6. 1). (6.2). (6.3) 


111011 
101111 
co PNP PEE 

(a) The matrices are || 1 10 qp 1 [> 
Lae tei A 
133379 1-1. 

000111 
000011 
000001 
000000 
000000 
000000 
000010 
000000 
amad 00000 
000000 
000000 
000000 


(b) For the first graph, let A = [1, 2, 3, 4, 5,6). Then Ro 
Ro Ris (A x A) — ((1.4)). For the second graph 
Ro RoR = (1.4). (1.5). (1.6). 2,5). Q.6).G. 6). 
For the third graph, Ro Ro R = ((1.5)). 











shown below, followed by the graphs of each. 


o 
o 
o 
o 
o 
o 


ecco+c 
oo-2oco 





o arm 


9. (a) It determines all pairs of nodes between which there 
walk of length 5 or less (c) M? v M? v MO v 
M V MOI v MP) v MP) v. M) v. MO 

10, (a)2-5 = 10; (€) 1-2 = 2; (6) 30 

|. (a) There are ay; + bxy walks from 6 to 3 that begin by 
going to node 1, a bss that begin by going to node 2, 
and so on, for a total of Y, as, + bia € Y, iy + by 

12. (a) Forcvery pair of integersi and j, the Row/, Column. 

j entry of M! counts the number of l-step walks. 
from node ito node j, 

(b) AY" is the same as M. The Row i, Column j entry 
‘of M is the number of edges from node i to node j, 
and an edge is the same as a L-step walk, 

(©) Forevery pair of integers/ and j. the Rowi, Column 
j entry of M*-! counts the number of (k — 1)-step 
walks from node i to node j. 

13, Let P(A) be the statement, “For every pair of integers 
i and j, the (i, j) entry of M* counts the number of k- 
step walks from node į to node j." In Exercise 12(b), 
we established P(1). Now let m = 2 be given such that 
statements P(1)..... P(m— 1) have already been es- 
tablished, and consider the next statement P(m). Let 
i and j be given, and let w, denote the number of the 
mestep walks from node i to node j whose first edge 
goes from node í to node t. Clearly there are Xr., w 
mestep walks from node í to node j, since every such 
walk must contain a first edge leading to exactly one 
of the nodes in the graph. Moreover by the product. 
rule for counting. wy is the product of the number of 
L-step walks from i to £ and the number of (m— 1)- 
step walks from £ to j. Using the induction hypothesis. 
 P(m — 1), this is the same as the product of the (i. r) 
element from M times the (£. j) element from M^ 
In this case, Efu; wv, = Eia Mis (M77), j, which is 
precisely the ((, j) entry in M" by the definition of 

















matrix multiplication. This establishes P(m), complet- 
ing the induction proof. L] 

15, In cach graph, an edge without arrows indicates an edge 
in both directions. 


el^ 
1000 = * 
ee 09) 
9 Iq 4.9.0 





16, (a) For the M in part (a) of Exercise 15, / < M is ob- 
viously true. (e) For the M in part (c) of Exercise 15, 
T £ M since my, = m3 = ma = mus = 0. (€) For the 
M in part (c) of Exercise 15, / £ M since mas = 0. 

V7. For every subscript £, J = 1. Therefore the (t, £) entry 
of (7 V M) is always 1, and hence / < (7 v M). There- 
fore, 1 v M is reflexive, 

18. (a) This relation is the same as R in Exercise 15(a). 
(b) This relation is (1, 1), 2,2), (3, 3), (4,4). 2.3). 

(3.2). (4, 1)). The matrix and graph are shown be- 








low. 
GFO 
"ngo 
c) This relati (CL, 1), (2, 2), (3.3). (4. 4). (1.2). 





Q.3). (1.3. G. 1). (3.2). Q. D). The matrix and 
‘graph are shown below. 








a 


19, (a) M = M", so this relation is symmetric, 
(b) M" s M. since MJ, = 1 but Mu = 0. 
(©) M = M". so this relation is symmetric, 

21, For parts (3).(c) (c). and (g) the relations (and hence 
the graphs and matrices) are the same as in Exercise 
O) This relation is {(1,1),(1,4),2.3)(32)4.1), (44). 
The matrix and graph are shown below. 











1001 1 2 

0010 

90100 

1001 " ô 

1000] [1000 

0100| J0100 
2/5 9 1 0/210 0 10/5 

0001]| [0001 

1000] [1000 

0100|,|0010 5 
&|o o :0|f|o 1 0 o] since 2&3 and 

1001 001 

3R2, but not 282. 


23. (a) (1,2). (1.3). (1.4). (1.5). (1.6), 2,3), (2, 4), (2,5), 
(2.6). (3, 4). (3. 5). (3.6). (4.5). (4. 6). (5.6)] 
b) The set in the part (a) solution is the transitive clo- 





sure. 
010000 
0010 
000100 

OM=lo 00010 
000001 
000000 





678 Selected Answers 


(d) MO 4 MO 4. MO hom — 


1 
1 
1 | This matches the relation n. 
1 
» 


Sococe 
Soo. 


part (b). 
25, (a) The relation is already transitive, so the transitive 
closure is the same relation, 


Section 7.5 Exercises 


1. A solution corresponds to a path from 8,0,0 to 4,4, 0 
in the graph below. Note that in the graph, every node 
within the square also has edges pointing to two of the 
Tour corners of the square, but we have left these edges 
out for clarity. Note that the puzzle has two solutions, 
‘one slightly shorter than the other. 








E 





5, Cubes 1,2, 3 and 4 are shown left-to- 


(© This relation is (1, 1). (1.2), (1,3). (2,1). (2.2). 
(2.3), (3. 1), (3,2). 3,3)]. The matrix and graph are 
shown below. 


1 2j 


EN. 


4 


Q 





= 


3 


Ò 


0,00 604 


460 





084 





























350 053 














3, The graph model below shows that there is no path from 
(10,0,0) to a node labeled (5, 5, 0). Since the only tran- 
ions involve even numbers of quarts and the begin- 

ning state has an even number of quarts, we can never 
measure any odd number of quarts with these contain- 





ers, 


cy Q 
w n 








a 8 8 6 

mu Eh Hu X 

ga wir ale R[w 
w a w c] 





























7. (a) The figure below on the left shows the graph of this 


puzzle, and the graphs on the right show the two. 
good subgraphs. Hence this puzzle can be solved. 


S 


b) Sce the figure below, We can argue that the puzzle 
has nosolution as follows. If there are two good sub- 
graphs then one will have to include the loop from 
Cube 4 at vertex R while the other must include the 
edges [W, R] from cube 3 and [R, G] from cube 1 
Let's focus on the first of these, The only ways for 
this subgraph to have every node of degree 2 is to 
either have B, G, W, B 3-cycle or a B, W, B 2-cycle 
along with aloop at G. The first of these is impossi 
ble since we would be forced to take the [B, W] edge 
from cube 1 (since cube 4 was already used with the 
loop at R) leaving us no possible edge [W, G], The 
second is impossible since there are only two edges 
of the form [W, B] so we would need to use them 
both but one is labeled 4 which has already been 
used with the loop at R. Note that the latter sub 
graph originally described is possible, so there is a 
Partial solution to this puzzle but not a complete 
solution 











9. The kernel is K = ((4.7) (3,5). (5.3). (1,2), (2, 1), 
(0, 0)},s0 Player 1 has the winning strategy, On her first 
move, she should either remove 1 stone from the smaller 
pile or 4 stones from the larger pile. 

11. (a) This game is identical to the stone removal game 

played in Exercise 9 but played on the grid 
Of states (X, Y) where X and Y range from 
0 to 8. Generalizing the process in that solu- 
tion, we will have a kernel in this game of K 
(7.4), (4,7). (3. 5). (5.3), (1. 2), 2.1). (0, 0)). 
Since the beginning position has the form (8, X) 
or (X, 8), it cannnot be in the kernel, so Player 
1 will have a winning strategy regardless of the 
queen's starting spot, 














Section 7.6 Exercises 


2. Answers will obviously vary, but they should be binary 
trees as long as only biological parents are listed. Here 
is one possibility 


Selected Answers 679. 


(b) This is like a stone removal game combining the 
rules of Exercises 9 and 10 and played on the grid 
shown in the solution to part (a). We can follow the. 
process as before to find that 
K = 10, 0), (1,3). (3, 1). (2, 6). (4. 5). (5,4). (6,2)) 
Since the beginning position has the form (8, X) or 
(X, 8), it cannot be in the kernel, so Player 1 will 
have a winning strategy regardless of the qucen's 
starting spot. 

13, The game is already in a winning position. This means 

Player 1 will not have a winning strategy assuming that 














Player 2 has read this section, 
32011 
42100 
tatar 
DirectSum = 0 0 0 


1S. One possibility is piles of 2, 8 and 10 as shown below, 
but any division of 20 pennies into three piles so th 
the direct sum is zero will give you the winning strateg 


O= 1010 
8-210090 
220010 
DirectSum = 0 0 0 0 






17. Player 1 should remove all 11 stones in the 8th pile, This 
will make the direct sum 0, 
19, The complete kernel is 
K = (000, 003, 006, 011, 014, 017, 022, 025, 030, 033, 
(036, 041, 044, 047, 052, 055, 101, 104, 107, 110, 
113, 116, 131, 134, 137, 140, 143, 146, 202, 205, 
220, 223, 226, 232, 235, 250, 2 








21, The strategy for Player 2 to win is to mirror the moves 
of Player 1 using the 180° rotational symmetry of the 
clock face, That is, for each number Player 1 crosses. 
out, Player 2 crosses out the number that is 6 hours 
later, On each move, Player 1 destroys the 180° sym- 
metry of the picture and Player 2 restores it. Since the 
winning position has this symmetry, Player 2 must win, 














Mom Dad 


~~ «ss 


Robert Mary Vema Ceci 





680 Selected Answers 


4. Since a complete binary tree with height 3has2! — 1 = 7 
nodes, we will need to use a tree with height 4 to store 
8 values. The following diagram shows one of several 
possibilities: 








torde: 





n w? 


7. (a) Prefix: +, 3, x, 5, d; Postfix: 3, 5,4, x, + 
(©) Prefix: obs. 5, 7, 4, 2; Post 
4x2 

V) (5*4) 7) 359 

9% (0(6- (2x 3) x (24) =0 

10, Inorder: Robert, Mom, Mary, Me, Verna, Dad, Cecil; 
Preorder: Me, Mom, Robert, Mary, Dad, Verna, Ceci 
Postorder: Robert, Mary, Mom, Verna, Cecil, Dad, Me 

13. Proof by induction. Let P(n) be the statement, "The 
‘maximum number of nodes ina binary tree of height nis 
2" — 1," Statement P(1) refers to a binary tree of height 
1, of which there is only one: a single root with empty 
left and right subtrees. Since such a tree has 2! — 1 
node, we conclude that the first statement P(1) is true, 

Now let mz2 be given such that statements 
P(1), PQ)...... Pn — 1) have all been checked to be 





Teme 





19. 














Section 7.7 Exercises 


1. 1,2,8,7, 17,16, 20,13, 12, 11, 19,18,9,10,3,4,5,14,15, 
61 

3. There are $ = 1,3 x 10^ different cycles to check. At 
a rate of 10! per second, it would take about 10^ sec- 
onds, 

S m22,n>2and m= 


6 
d 





true. Let T be a binary tree with height m. By definition 
of “height,” each ofthe left subtree 7, and right subtree 
Tr of T have height less than or equal to m — 1. Let 
hi and hy denote the respective heights of T; and Tj, 
we can cite statements P(/,) and P(hy), which tell us 
that tree 7), has a maximum of 2^ — 1 nodes and tree 
Ty has a maximum of 2" — 1 nodes. Hence, the original 
tree T has a maximum of 


14 (9-1) 71) <1+@""-1) 
+ (n -1)e2z-1 
modes, This confirms statement /'(n), completing the 
induction. " 
Proof, Let T beany binary tree. Let L denote the max- 
imum level of any nodein 7, and let # denote the height 


of T, Proposition 1 tells us that L < H, and Exercise 14 
tells us that H < L. Therefore, L = H. " 


(a) Here is one of the trees: 





short 





wal, 






indcate closed section 


find 


student 


46) 3; 7:63; 1023; (e) f(n) = 1+ Logon]. 
Proof by induction, Let P(n) be the statement, 
“(nl)? > nt” Since (31)! = 36 and 3° = 27, it is clear 
that statement (3) is true, Now let m > 4 be given such 
that statements P(3), P(4)...., P(n — 1) have all been 
checked to be true, In particular, it has been checked 
that ((m — 10 > (m — 1)" is true, From this it fol- 
Jows that 











= nò (m=) 
> mè (m — 1" by statement P(n — 1) 
= mi by the given fact 


(a) a, d, b, ca has weight 13 
If we start with vertex e, the Hamiltonian cycle gener- 
ated by the algorithm is e, c. d. a, b, f, e and has weight 
21, but the cycle a, b, c, d e; f, a has weight 14. 











8. (a) a, d, b, c, a has weight 13 

9. (a) [2. d]. [a.c]. [b.d], [b,c] yields a.d, b,c,a with 
weight 13; (e) [a,b]. [a, d]. c. e]. [b,c]. [d.e] yields 
a, b.c e, d, a with weight 28 

1B, (a. e] [usd] le, b]. [od] le, f). Ib. /Iyieldsa, e, b, foe. 
d, a with weight 42, 

14, (b) 000, 100, 101, 111, 110, 010, 011, 001, 000 is a Hamil- 
tonian cycle in Gs shown in the figure below. 














000, 


100 





M0 


1017011 


fn 


15. (b) The graph on the right below shows the representa: 
tion of the Knight's Tour on the 4 x 4 chessboard shown 
on the left, 





Selected Answers 681 


17. The graph shown below has a Hamiltonian cycle, but 
deg(a) + deg(c) =4 <5, 





e d 


19. Proof. Let G be a simple connected graph on n > 3 
This 





nodes in which each node has degree at least 
means that for every pair of non-adjacent veri 
and v, we can be sure that 





deg(u) + deg(v) > $ + 





By Ore's Theorem, G is Hamiltonian, = 
20. Hinr: Assume that the Petersen graph hasa Hamilto- 
nian cycle, and argue that a contradiction must result 
If you consider there are five "outside edg "sar 
edges,” and five “connecting edges,” then you can argue 
that there must be an even number of edges of each type 
in the Hamiltonian cycle. Each possible case will lead 





five 











21. The cycle PA, NY, CT, RI, MA, NH, VT, B, PA has 

weight 1216 miles. 

23. [B, VT), [RI, MA], [MA, NH], [RCT ICT 
INH. VT).[PA, B],[PA.NY] yields the 
PA, B, VT, NH, MA, RICT, NY, PA with 
1216 miles. 








Y]. 
cycle 
weight 














Copyrighted Material 


Copyrighted Material 





Copyrighted Material 


Copyrighted Material 





Special symbols and notation 
24,207 
va 
^T 
Cin. K). 32,398 
[1327 
x, 198 
deg) 510,513 
batt 
ALAS 
45, 183 
EIN, 467,469 
ia 
fos 
116 300,385, 93 
m 
mal 228 
1, 0.49, 620 
a 





n.186 

127 

Ky 536, 398 
Kina $86,607 
lem 228 
EI 


Copyrighted Material 


Index 


M, 482,547,618 
mod (50. 105, 166 
nA). 191,443 
Nam 

T 

0.39 

0, 268.274 

us 

v.27 561 

Pua), 25 

Pu). te 

Pom, K), 372, 383, 419. 
(Prony 162), 432 
Probe), 44s 

Q, 182 

Qt, 182 

Qi 
RT 
45,182 
Rae 

R=, 182 
5.19,14 

cam 

aes 

9 

n 

zasam 





Copyrighted Material 


Absorbing state, 483, 487, 494 
Absorption property 
Boolean algebra, 225 





‘Abstraction, 221, 507 
Addition of matrices, 491, 551,619 
Adjacency matrix, 547 
binary relation, 553. 
Boolean, 585. 
directed graph, 349 
Adjacent nodes 508, $47, 604 
Adleman, L. 172 
Alabama, 370. 411,417 
Alcain of York, 567 
Algorithm 
base two conversion, 154 
binary search, 353 
somplesity of 346, 601 
counterfeit detection, 354, 363 
for counting, 391 
dictionary search, Hé 
division, 103 
‘edge greedy (SP) 602 
efficient, 352 
Euclidean. 173 
finding kernel, 514 
graph, 25 
greedy, 600 








Algorithm (Contd.) 
Kruskal's, 532 
merge sort, 357 
minimal spanning tree, 527, 532 
Prim's, 527-529 
recursive, 356 
selection sort complexity, 350 
sorting, 589 
spanning tree, 526 
vertex greedy (TSP), 600 
Almost Eulerian, 518 
Alphabet, 300 
Always, 42 
And (^), 27, 38 
And gate, 230 
Antiprism, 545 
Antisymmetric relation, 302, 306, 321 
Apples, 382, 448 
Argument, 71 
conclusion, 74 
fallacy, 72 
invalid, 74 
premises, 74 
valid, 74 
Argument structure, 74 
modus ponens, 72, 76 
modus tollens, 72, 76 
Aristotle, 68. 
Arithmetic-logic unit, 230 
Arithmetic operations, 331 
Arithmetic progression, 22 
Arithmetic triangle, 403, 428 
Arrow diagram, 252, 505 
Associative property 
Boolean algebra, 225 
logic, 35, 222 
sets, 188, 215, 222 
AUTHOR, 87 
Average value, 467, 469 





Babylonians, 344 
Backgammon, 449, 465 
Bacon, K., 276 
Bag, 372, 411, 448 
Barnsley, M., 342 
Base b numeral, 156 
Base ten, 150 
Base two, 152 
conversion algorithm, 154 
Baseball, 5, 393, 455, 615 
batting average, 466, 615 
Basketball, 127 
Batting average, 466, 615 
Bernoulli, J., 455, 460 
Bernoulli trial, 455, 460, 469 
Bernstein, F., 295 
Biconditional (++), 62 
Binary magic cards, 165 
Binary number of a game position, 577 





Copyrighted Material 


Binary numeral, 152, 326, 333 
conversion algorithm, 154 
Binary relation (see Relation) 

Binary search algorithm, 353 

Binary search tree, 587 
Binary sequence, 383, 393, 408, 412, 420, 
475, 607 

Binary string, 266, 280, 300 
Binary tree, 583 

child in, 583 

complete, 595. 

empty, 583 

height, 586 

inorder traversal, 591 

internal node, 584 

leaf in, 583 

left subtree, 583 

level, 585 





parent in, 584 
postorder traversal, 591 
preorder traversal, 590 
right subtree, 583 
root of, 583 
sibling in, 583 
traversal, 590 
Binomial, 402 
Binomial coefficient, 403 
Binomial probability distribution, 465 
Binomial theorem, 403 
Bipartite graph, 536 
Birthday problem, 396, 444, 447 
Bit, 152, 327 
Blackjack, 373, 388, 396, 447, 454, 459,614 
Board games, 65 
Boole, G., 223 
Boolean adjacency matrix, 555 
Boolean algebra, 223 
Boolean algebra properties, 225 
DeMorgan's Law, 229 
Boolean arithmetic, 555 
Boolean expression, 224 
sum of products, 234 
Boolean matrix operations, 554 
Boolean power, 560 
Boolean product of matrices, 555 
Boolean sum of matrices, 561 
Bouton, C., 576 
Bridge, 372, 382, 616 
Bridges of Königsberg. 506, 564 
Bulgarian solitaire, 345 
Butterfly effect, 341 





C(n,1), 372, 399 
Cabbage, 567 
Calculus, 77 
Calinger, R., 140 
Candy, 381, 448 
Cantor, G., 295 


Copyrighted Material 


Index 685 


Cantor's Theorem, 296 
Cardano, G., 441 
Cards, 613 
deck, 161, 294, 315, 344, 345, 388, 399, 
405, 443, 445, 450, 613 
face, 458, 613 
hand, 294, 315, 372, 396, 447, 613 
suit, 294, 395, 400, 447, 454, 613 
tricks with, 2, 163, 344, 405 
value, 299, 613 
Cardinality of a set, 292, 295 
Carmichael numbers, 177 
Cartesian graph, 198 
Cartesian product, 198, 201 
Cases, proof by, 101 
Catfish, 432 
Cauchy, A., 539 
Ceiling function, 327 
Chaff, 3 
Chaos, 338 
Chaos game, 342 
Charlemagne, 567 
Chessboard games, 580 
Child in a tree, 583 
Chinese remainder theorem, 178 
Chokwe, 514 
Circuit 
Eulerian, 508, 511 
graph, 508, 511 
logic, 229 
simplification, 231, 235 
Circular arrangements, 400 
Clock game, 581 
Closed formula, 11, 111, 426 
summation, 114 
Closed interval, 196 
Closed trail, 508 
Closure 
property of integers, 91 
property of rationals, 100 
reflexive, 566 
symmetric, 566 
transitive, 323, 557, 566 
Codomain, 249 
Collatz, L., 344 
Collatz Problem, 338 
Combination, 372 
number of (C (n. k)), 399 
Combinatorial equivalence, 376 
Combinatorics, 368 
Commutative property 
Boolean algebra, 225 
logic, 35, 222 
sets, 188, 215, 222 
Complement, 187 
Complementary event, 444 
Complementary problem, 390 
Complete binary tree, 595 
Complete bipartite graph (K,,, ), 536. 607 


686 Index 


Complete graph (K,,), 536, 598 
Complexity 

algorithm, 346, 601 

Master theorem, 358 

non-polynomial, 597 

0,349 

Q, 349 

selection sort, 350 

©, 349 
Composition of functions, 268 
Composition of relations, 274, 554 
Conclusion 

of an argument, 74 

of an implication, 53, 57, 83 
Conditional probability, 452 
Congruent modn, 166 
Connected component, 511 
Connected graph, 511, 560-562 
Contradiction, 35 
Contrapositive, 60, 84 

proof by, 94, 100, 132, 287, 520 
Converse, 60, 68 
Converse fallacy, 73 
Correspondence, 286, 376 
Countable set, 297 
Counterexample, 43, 51, 57. 83, 190 
Counterfeit detection algorithm, 354, 363 
Counting algorithm, 391 
Cross (x), 198 
Cryptography, 172 
Cube, 539 
Cycle 

ina graph, 508 

Hamiltonian, 596, 603 

in a sequence, 336 


Deal, 613 
Decimal numeral, 150 
Decision tree, 457, 475, 581 
Deck of cards, 161, 294, 31 
399, 405, 443, 445, 450, 613 

Decryption, 172 
Degree (deg(v)), 510, 513 
Degree sequence, 516, 544 
DeMéré, C., 441 
Democrat, 78, 401 
DeMorgan, A., 440 
DeMorgan's Law 

Boolean algebra, 225, 229 

logic, 32, 35, 46, 222 

sets, 215, 222 
Derangement numbers, 423 
Descartes, R., 77, 198 
Dessert, 69 
Deuce, 421, 476, 489 
Diagram 

arrow, 252 

Hasse, 302 

Venn, 189 














344, 345, 388, 


Copyrighted Material 


Dice, 65, 414, 419, 
Difference of sets, 
Difference table, 424-425 
Differences, 424-425 
Direct sum, 577 
Directed edge, 549 
Directed graph, 505, 509, 548 
adjacency matrix, 549 
in-degrec, 564 
out-degree, 564 
of a relation, 553 
walk, 549, 551 
Disjoint events, 448 
Disjoint sets, 186 
Distributive property 
Boolean algebra, 225 
logic, 35,222 
sets, 188, 215, 222 
Divide-and-conquer, 356, 364 
Dividend, 104 
le, 82, 96 








Divisor lattice, 229 
Dodecahedron, 539 
Dodecahedron puzzle, 596, 600 
Domain 
function, 249 
predicate, 42, 47, 116 
relation, 253 
Dominoes, 118, 514 
Double arrow, 254, 317 
Double negative property 
Boolean algebra, 225 
logic, 33, 35, 46, 222 
sets, 215, 222 
Drug testing, 454 
Dual expression, 216, 225 
Duality principle 
Boolean algebra, 226 
sets, 216 





E[X], 467, 469 
Edge, 506 
directed, 510, 548 


multiple, 508 
parallel, 508 

Edge crossing, 536 

Edge greedy algorithm, 602 

Eeny-meeny-miney-more, 3 

Element of a set, 183 





Empty binary tree, 583 
Empty set (? or { }), 183, 211 
Encryption, 172 





Copyrighted Material 


Enumeration, 369 
Envelope puzzle, 6, 9, 513, 564 
Equality of sets (=), 183, 213 
Equally likely outcomes, 441 
Equivalence 

class, 398 

combinatorial, 376 

logical, 32, 41, 60 
Equivalence relation, 313, 319, 398 

induced by partition, 315 
Euclid, 68 

Elements, 140 
Euclidean algorithm, 173 
Euler, L., 6, 77, 423, 506, 538-541, 608 
Eulerian, 508 

circuit, 506, 511 

graph, 511, 525 

trail, 506, 513 
Euler's formula, 538, 541, 545 
Even number, 44, 90 
Event, 441 

disjoint, 448 

independent, 451 

mutually exclusive, 448 
Exclusive or, 29 
Existence proof, 90, 100, 137 
(3),43 
Expected value, 467, 469 
Experiment, 441 
Expression tree, 582, 592 









Face of planar graph, 540, 542 
Face card, 458, 613 
Factor, 98 
Factorial, 16, 330, 385, 393, 589 
Failed trial, 460 
Fallacy, 72 
converse, 73 
inverse, 73 
False advertising, 68 
Family tree, 581, 594 
Fermat, P., 441 
Fermat numbers, 110, 332 
Fermat's little theorem, 171, 174 
Fibonacci, 
Liber Abacci, 150 
Fibonacci numbers, 16, 125, 131, 345, 433 
Finite set, 295 
Finite structures, 324, 369, 415 
Finitely progressive game, 572 
First differences (An), 424 
Floor function, 327 
Flush, 447, 459, 614 
For all (Y), 43 
Formal proposition, 27 
Four of a kind, 474, 614 
Fractal, 342 
Fraction, 100 
Full house, 465, 614 





Function, 11, 249 
arithmetic operations, 331 
arrow diagram, 252 
asa binary relation, 255 
ceiling, 327 
codomain, 249 
composition (=), 268 
domain, 249 
floor, 327 
growth hierarchy, 356 





increasing, 349 

inverse (~'), 261 

invertible, 283, 286 

iterated, 336 

linear, 363 

map, 249 

one-to-one, 286 

one-to-one correspondence, 286, 376 

onto, 286 

rule, 249 

sequence, 347 

sublinear, 363 

superlinear, 363 
Fundamental theorem of arithmetic, 129 
Fundamental theorem of sums, 425 





Galilei, G., 441 

Gambler's ruin problem. 478 

Gambling, 441, 474 

Game tree. 7, 25 

Games 
backgammon, 449, 465 
baseball, 5, 393, 455 
basketball, 








388, 396, 447, 454, 459, 





Bulgarian solitaire, 345 
card. 614 

chaos, 342 

chessboard, 580 

children's, 485, 495, 498, 500 
clock game, 581 

finitely progressive, 572 





Hank and Ted. 478-481, 482 

tournament, 127, 419 

Josephus game, 3, 8, 22, 121, 128, 131, 
159 

kernel, 573. 577 

Kevin Bacon, 276, 282 

Nim, 576, 580 

poker, 399, 447, 459, 614 

Scrabble. 335, 616. 

series of, 461, 471, 501 





Copyrighted Material 


softball, 470, 615 
solitaire, 445 
state, 482 
stone removal, 549, 572 
table tennis, 480, 491, 500, 616 
tennis, 5, 9, 382, 421, 463, 476, 480, 489, 
501.615 
twenty-one, 614 
Gardner, M., 580 
Gate. 230 
and, 230 
inverter, 230 
not. 230 
or, 230 
Gauss, C., 373, 418 
General sum rule, 391, 450 
Geometry. 539 
Geometry of position, 506 
Gergonne pile trick, 344 
Goat, 567 
Gödel, K., 296 
Goldbach, C., 539 
Golden ratio, 139, 182 
Goldilocks, 54, 82 
Golf, 72 
Goose, 394 
Graph. 505 
acyclic, 521 
adjacency matrix, 547, 549. 
algorithm, 525 
almost Eulerian, 518 
bipartite, 536 
circuit, 508 
closed walk. 508 
complete (K, ). 536, 598 
complete bipartite (Ky), ), 536, 607 
connected, 511, 519, 560-562 
connected component, 511 
cycle, 508 
directed, 509, 548 
edge, 506 
edge-crossing, 536 
embedding, 536 
Eulerian, 506, 511 
face, 540. 
of a game, 573 
Hamiltonian. 
Hamiltonian 
induction proofs, 522 
isomorphism, 534 
kernel, 573, 577 
longest path, 530, 566 
loop. 508 
minimal spanning tree, 527 
multiple edge, 508 
node, 50 3 
parallel edge, 508 
path, 508, 568. 596 
Petersen, 545 














Copyrighted Material 


Index 687 


planar, 536, 538 
of a relation, 553 
simple, 510 
‘spanning tree, 526 
subgraph, 511, 569 
trail, 508 
traversal, 590 
trivial circuit, 508 
trivial walk, 508 
tree, 505, 521 
vertex, 506 
walk, 506, 551 
weighted, 527, 599. 
Graph theory, 505 
Greatest common divisor, 228 
Greedy algorithm, 600 
Greek, 129, 538 
Grid game, 6, 581 


Hall, M.,4 





Hamiltonian graph, 596, 603 
Hand of cards, 294, 315, 372, 396, 447, 613 
Hank and Ted game, 478-481, 482 
Harmonic series, 131 
Hasse, H., 302 
Hasse diagram, 302 
Height of binary tree, 586 
Herring, 394 
Hexadecimal numeral, 158 

binary conversion, 159 
Hockey stick identity, 428 
Home field advantage, 465 
Hypothesis, 53, 57, 83 


Icosahedron, 539 
Idempotent property 
Boolean algebra, 225 





Identity function, 272 
Identity matrix (7), 497, 566, 620 
Identity property 

Boolean algebra, 225 

logic, 35, 222 

sets, 215, 
If and only if (++), 62 
If-then statements, 53, 83 








57,83 
contrapositive, 60, 84 
converse, 60, 69 
counterexample, 57, 83 
hypothesis, 53, 57, 83 
inverse, 60, 69 
language of, 61 
negation of, 58 

Implies (—). 53 


688 index 


In-degree, 564 
In-shuffle, 162 
Incident, 510 
Inclusion-exclusion principle, 192, 415 
Incomparable, 309 
Increasing function, 349 
Independent, 401 
Independent events, 451 
Induced equivalence relation, 315 
Induction, 110, 117 
asa game, 111 
Inductive reasoning, 110 
Infinite set, 295 
Infix notation, 254, 593 
Injection (See one-to-one function) 
Inorder traversal, 591 
Input, 249 
Instant Insanity, 568, 580 
good subgraph, 569, 571 
partial solution, 569 
Integer solutions, 411, 413, 448 
Integers (Z), 45, 182 
Integrated circuit, 537 
Internal node, 584 
Intersection (N), 186 
Interval notation, 184, 196, 290 
Invalid argument, 74 
Inverse, 60 
Inverse fallacy, 73 
Inverse function, 261 
Inverse matrix, 497, 621 
Inverse relation, 257 
Inverter, 230 
Invertible function, 283, 286 
Tota, 272 
Irrational number, 100, 138 
Irreflexive relation, 308 
Isomorphic graphs, 534-535 
Isomorphism, 534 
Iterated function sequence, 336 


Jail, 65 
Jones, T., 54, 82 

Josephus, F. 3 

Josephus game, 3, 8, 22, 121, 128, 131, 159 
Josephus permutation, 4 


Kabjlie, 579 
Karnaugh, M., 235 
Karnaugh map, 235 
Keillor, G., 149 

Kernel, 573 

Kernel algorithm, 574 
King. S.. 77 

Knight’s Tour puzzle, 608 
Königsberg, 506 

Kruskal’s algorithm, 532 
Kuratowski, K., 537 
Kuratowski's Theorem, 538 


Copyrighted Material 


Lagrange, J., 110 
Lattice, 229 
Leaf (leaves), 521, 583 
Least common multiple, 228 
Left subtree, 583 
Leibniz, G., 77, 506 
Length of walk, 508 
Level, 585 
Liars, 24 
Libertarian, 78 
Linear algebra, 485, 496, 498 
Linear first order recurrence, 430 
Linear function, 363 
List 
ordered, 371, 393 
unordered, 371, 413, 510 
Logarithm, 324, 356, 588 
base b, 326, 330 
base ten, 324, 608 
base two, 355 
properties, 325 
Logarithmic growth, 356 
Logic 
and, 27, 38 
circuit, 229 
exclusive or, 29 
if and only if, 62 
implies, 53 
not, 27,38. 
or, 27, 38 
predicate, 40, 116 
propositional, 27, 68 
substitution rule, 36 
Logic properties, 35 
absorption, 39, 222 
double negative, 33 
Logic puzzles, 26-30, 37, 63, 67 
Logically equivalent, 32, 60 
Longest path, 530, 562 
Loop, 508, 510 
Lorayne, H., 405 
Lorenz, E., 340 
Lottery, 369, 407, 447, 474 


Magic 
binary magic cards, 165 
club trick, 2, 128, 250 
domino trick, 514 
Gergonne pile trick, 344 
number spelling trick, 334 
ordinal trick, 344 
perfect shuffle trick, 163 
Scrabble trick, 335 
triangle card trick, 405 
Map, 249 
Markov chain, 482, 498 
Master theorem, 358 
Mathematical induction, 110, 117 
Matrix 


Copyrighted Material 


addition, 491, 551, 619 
adjacency, 547 
arithmetic, 550, 619 
Boolean adjacency, 555 
Boolean operations, 554 
Boolean power, 560, 566 
Boolean product, 555 
Boolean sum, 561, 566 
entry (M, j), 482, 547, 618 
identity (1). 497, 566, 620 
inverse (M-!), 497, 621 
multiplication, 484, 550, 555, 619. 
row-column multiplication, 484, 555, 619. 
transition, 482 
transpose (M7), 566, 623 
zero (0,). 620 
Meaningless, 373 
Member of a set, 41 
Merge sort algorithm, 357 
Minimal spanning tree, 527 
Kruskal's algorithm, 532 
Prim's algorithm, 527-529 
Mississippi, 410 
mod (%) notation, 105, 166 
Modular arithmetic, 168, 176 
Modus ponens, 72, 76 
Modus tollens, 72, 76 
Morris, B., 162 
Morton, M., 40 
Multiple, 54, 98, 328 
Multiple edge, Sí 
Multiple quantifiers, 46 
Mutually exclusive events, 448 
Mutually exclusive sets, 186 








n(A), 191, 443 
n-tuple, 201 
Natural numbers (N), 182 
Necessary, 70 
Negation 
of propositions, 27. 31, 39, 58 
quantified statements, 45-46 
Negation property 
Boolean algebra, 225 
logic, 35 
sets, 215 
Networks, 6 
Never, 42 
New York City, 299 
Newton, L., 77 
Nicholson, J., 276 
Nim, 576, 580 
Node, 508, 583 
Non-absorbing states, 494 
Non-existence proof, 137 
Non-planar graph, 537 
Not (>), 27, 38 
Not gate, 230 
NP-complete, 600 


Number puzzles, 9, 13, 23 
Number of set elements, 191 
Number theory, 140, 145, 171 
Numbers. 

Bell, 423 

binary, 152, 326 

Carmichael, 177 

derangement, 423 

even, 42, 90 

Fermat, 110, 332 

Fibonacci, 16, 125, 131, 345, 433 

fraction, 100 

Hindu-Arabic, 150 

integers (Z), 45, 182 

irrational, 100, 138 

natural (N), 182 

odd, 42, 90 

Pell, 131 

perfect cube, 98 

perfect square, 15, 88, 164, 176 

prime, 52, 82, 140 

rational (Q). 100, 182 

real (R), 45, 182 

relatively prime, 138 

twin primes, 109 
Numeral, 150 

base b, 156, 330 

binary, 152, 326 

decimal, 150 

hexadecimal, 158 

octal, 164 

Roman, 150 


O notation, 349 
Octahedron, 539 
Octal, 164 
Odd number, 10, 90 
Omega notation (Q), 349 
One-set arrow diagram, 254 
One-to-one correspondence, 286, 376 
One-to-one function, 286 
Only if. 70 
Onto function, 286 
Operations research, 369 
Optimization, 369 
Or (v), 27,38 
Or gate, 230 
Oracle of Bacon, 276, 282 
Order 

of function growth, 350, 356 








strict partial, 308 
strict total, 309 
total, 309 
well-ordering, 140. 
Order relation, 301 
Ordered list, 371, 393 
Ordered n-tuple, 201 


Copyrighted Material 


Ordered pair, 198, 510, 548 
Ordered quadruple, 201 
Ordered triple, 201 

Ore's Theorem, 604 
Oresme, N., 131 
Out-degree, 564 
Out-shuffle, 162 

Output, 249 


P(A), 203 
P(n), 116 
Pí(n. r), 372, 393, 419 
Pair, 614 
Paradox, 197 
Parallel edge, 508 
Parent in a tree, 584 
Parity, 314 
Part, 205, 315 
Partial order, 302 
strict, 308 
Partition, 205, 314, 423 
equivalence relation of, 315 
Pascal, B., 403, 441 
Pascal's triangle, 403, 428 
Path, 508, 568, 596 
Pedantic, 95 
Peirce, C., 68 
Pell sequence, 131 
Pennsylvania, 70, 83 
Perfect cube, 98, 109 
Perfect shuffle, 161 
how to fake, 163 
Perfect square, 15, 88, 164, 176 
Period of cycle, 336 
Permutation, 300, 372 
derangements, 423 
Josephus permutation, 4 
number of (P(n, k)), 393, 419 
perfect shuffle, 161 
Petersen graph, 545, 608 
Pigeonhole principle, 142, 530 
average version, 148 
basic version, 144, 293 
general version, 143 
Pittsburgh, 70 
Pizza, 38 
Planar graph, 536, 538 
application, 537 
Euler's formula, 538, 541, 545 
face, 540. 
face size, 542 
Platonic solids, 538 
Poker, 399, 447, 614 
Polyhedra 
antiprism, 545 
prism, 545 
pyramid, 545 
regular, 538 
Population model, 432 


Copyrighted Material 


Index 689 


Postfix notation, 593 
Postorder traversal, 591 
Potato, 3 
Power set, 203 
Precedence, 28 
Predicate, 40 
domain. 42, 47 
induction, 116 
quantified, 43 
Prefix notation, 593 
Premise, 77 
Preorder traversal, 590 
Prerequisite, 308 
Prim's algorithm, 527-529 
Prime interest rate, 78 
Prime number, 52, 88, 140 
relatively, 138 
Principle of Inclusion/Exclusion, 192, 415 
Prism, 545 
Prob(E), 443 
Probability, 440, 443 
conditional (Prob(E;|E2)). 452 
experiment, 441 
general product rule, 453 
general sum rule, 450 
product rule, 452 
sample space, 441 
sum rule, 449 
Product of matrices, 484, 550, 619. 
Product rule 
for counting, 387 
for probability, 452, 453 
Proof, 85 
of antisymmetric relation, 307 
by cases, 101 
by contradiction, 133 
by contrapositive, 94, 100, 132, 287, 520 
direct, 85, 89, 132 
element-wise set, 205 
existence, 90, 100, 137 
about functions, 287 
as a game, 87 
about graphs, 519 
by induction, 110, 117, 132 
induction on graphs, 522-524 
of invertible function, 292 
as a letter, 89 
non-existence, 137 
about numbers, 90, 96 
of one-to-one function, 289 
of onto function, 292 
of reflexive relation, 305 
set equality, 213 
of subsets, 210 
of symmetric relation, 318 
tracing, 89 
of transitive relation, 306 
Proper subset (c), 195 
Proposition, 27 








690 Index 


Propositional connective, 30, 51, 65 
Propositional variable, 27 
Pseudocode, 391 
Pseudoprime, 177 
Psychology, 54 
Puzzles 
coin weighing, 354, 363 
envelope puzzle, 6, 9, 513, 564 
Hamilton's dodecahedron, 596, 600, 608 
higher or lower, 363 
Instant Insanity, 568, 580 
knight's tour, 608 
line tracing, 514 
logic puzzles, 24-30, 37 
number puzzles, 9, 13, 23 
river-crossing puzzle, 567, 579 
towers of Hanoi, 420 
utilities puzzle, 536, 543, 546 
water puzzle, 579 
Pyramid, 545 


Q. 182 

Quadrilateral, 54 
Quantified predicate, 43 
Quantifier, 43 
Quotient, 103 


Random variable, 466 
Rational numbers, 100, 182 
Reaper, 87 
Real numbers (R), 45, 182 
Reciprocal, 170 
Rectangle, golden, 139 
Recurrence relation, 11, 423 
constant differences, 426 
divide-and-conquer, 356, 364 
linear first order, 430 
master theorem, 358 
second order, 432, 436 
Recursion, 475 
Recursive algorithm, 356 
Recursive formula, 11, 111 
Recursive model, 4, 418, 475, 511 
Reflexive closure, 566 
Reflexive relation, 302, 566 
Red herring (See wild goose) 
Regular polyhedra, 538 
Relation, 253 
adjacency matrix for, 553 
antisymmetric, 302, 306, 321 
binary, 253 
codomain, 253 





equivalence, 315, 319, 398 
function, 255 

graph of, 553 

identity (1), 272 

infix notation, 254 


Copyrighted Material 


inverse, 257 
irreflexive, 308 
order, 301 
partial order, 302 
reflexive, 302, 566 
tule, 253 
strict partial order, 308 
symmetric, 317, 566 
transitive, 302, 323, 557 
Remainder, 103 
Republican, 78, 401 
Reverse, 257 
RGB color values, 165 
Right subtree, 583 
River-crossing puzzle, 567, 579 
Rivest, R., 172 
Roman numerals, 150 
Rook, 580 
Root of a tree, 583 
Roster method, 183 
Round robin tournament, 419, 435 
Row-column multiplication, 484, 619 
RSA cryptosystem, 172 
Rule, 249, 253 
Rule of complements, 390 
Rule of products, 387 
Rule of sums, 389, 391 
Russell, B., 197 
Russell's paradox, 182, 197 


Sample space, 441 
Schoolhouse Rock, 40 
Scrabble, 335 
Scarch tree, 587 
Second differences, 425 
Second order recurrence relation, 432, 436 
Selection sort complexity, 350 
Selection tree, 388 
Self-similiarity, 342 
Senate, 399 
Sensitivity to initial conditions, 340, 345 
Sequence, 9 
binary, 383, 393, 408, 412, 420, 475, 607 
closed formula, 11, 111, 426 
differences, 424-425 
function, 347 
iterated function, 336 
notation, 11, 199 
recursive formula, 11, 111 
Series of games, 461, 471, 501 
Set, 181, 372 
cardinality, 295 
Cartesian product, 198, 201 
complement, 187 
countable, 297 
difference, 186 
disjoint, 186 
duality principle, 216 
element, 41, 183 





Copyrighted Material 


empty, 183 
equality, 183, 213 
finite, 295 
form description, 185 
infinite, 295 
integers (Z), 182 
intersection, 186 
membership (€), 43, 183 
natural numbers (N), 182 
number of elements, 191 
rational numbers (Q), 182 
real numbers (R), 182 
roster method, 183 
partition, 205, 314, 423 
power set, 203 
property description, 184 
subset, 183 
uncountable, 297 
union, 186 
universal, 183 
Set builder notation, 184 
Set operation properties, 215, 222 
absorption, 195, 220 
associative, 188 
commutative, 188, 220 
DeMorgan's law, 220 
distributive, 188, 220 
Shakespeare, W., xiii 
Shamir, A., 172 
Shuffle 
in-shuffle, 162 
out-shuffle, 162 
perfect, 161 
Sibling in a tree, 583 
Sierpinski Triangle, 343 
Sigma notation, 19, 114 
Simple graph, 510 
Simple interest, 431 
Single elimination tournament, 127 
Six Degrees of Separation, 276 
Smarty pants, 418 
Smullyan, R., 24 
Smullyan's Island, 24-30, 34, 37, 63, 67, 199 
Softball, 470, 615 
Solitaire, 345, 445 
Sometimes, 42 
Sorting algorithm, 350, 357, 589 
Spanning tree, 526 
algorithm, 527 
minimal, 526 
Spreadsheet, 345, 351, 360-364 
Stamps, 131 
State of a game, 482 
absorbing, 483 
transient, 494 
Statement for induction, 116 
Statistics, 454 
Steroids, 454 
Stone removal game, 549, 572 





Straight, 447, 459, 614 
Straight flush, 447, 614 
Strategy, 7,572 
Strict partial order, 308 
Strict total order, 309 
Strong induction, 119 
Subgraph, 511, 569 
Subset (C), 183 
Substitution rule, 36 
Subtree, 583 
Successful outcomes, 443 
Successful trial, 460 
Sufficient, 70 
Suit of a card, 294, 395, 400, 447, 454, 613 
Sum of matrices, 491, 551, 619 
Sum notation, 19, 114 
Sum of products, 234 
Sum rule 

for counting, 389, 391 

for probability, 449, 450. 
Summation, 19 
Super Bowl, 150 
Surjection (see onto function) 
Symmetric closure, 566 
Symmetric relation, 317, 566 
Syracuse Problem, 338 


Table tennis, 480, 491, 500, 616 
Tartaglia’s triangle, 403 
Tautology. 35, 74 
Telecommunications, 599 
Tennis, 5, 9, 382, 421, 463, 476, 480, 489, 
501,615 

Ternary sequence, 417 
Tetrahedron, 539 
‘There exists (3), 43 
Theta notation, 349 
Third differences, 425 
‘Thoreau, H., xi 
Three of a kind, 465, 614 
TL-83 calculator, 326, 447, 559, 620, 623 
Time complexity, 346 
Topology. 538 
Total order, 309 
Tournament 

first round, 423 

round robin, 419, 435 

single elimination, 127 
Towers of Hanoi, 420 








Copyrighted Material 


Tracing a proof, 89 
Trail, 508, 513 
Transient states, 494 
Transition matrix, 482 
‘Transitive closure, 
relation, 3 


566 
02, 323, 557, 








‘Transportation network, 599 
‘Transpose of a matrix, 566, 623 
Traveling Salesperson Problem, 597 
Traversal of a binary tree, 590 
‘Traversal problem, 597 
Tree, 505, 521 

binary, 581 

binary search, 587 

child in, 583 

decision, 457. 475, 581 

expression, 582, 592 

family, 581, 594 

game, 7,9, 25 

height, 586 

inorder traversal, 591 

internal node, 584 

leaf (leaves), 521, 583 

level, 585 

minimal spanning, 527 

parent in, 584 

postorder traversal, 591 

preorder traversal, 590 

root of, 583 

selection, 388 





spanning, 526 
subtree, 583 
traversal, 590 

Trial, 455 

Triangle, 52 

Trick-or-treat, 373 





l circuit, 508 
Trivial walk, 508 
Truth table, 24 
and (^), 30 
for circuit, 230 
as function, 250 
implication (+), 55 
not (=). 30 
or (v). 30 
Truth-tellers, 24 
TSP. 597 





Copyrighted Material 


Index. 691 


Tukey, J., 152 
Tuple, 201 

‘Twenty-one game, 614 
‘Twin primes, 109 

Two pair, 465, 614 

Two-set arrow diagram, 254 


US. presidents, 585 
Ulam's Problem, 338 
Uncountable set, 297 
Unemployment, 78 
Union (U), 186 
Universal bound property 
Boolean algebra, 225 
logi 22 
sets, 215, 222 
Universal set, 183 
Universe, 183 
Unordered list, 371, 413, 510 
Utilities puzzle, 536. 543, 546 





Valid argument structure, 74 

Validity of arguments, 71 

Value of card, 613 

Vandermonde, A., 608 

Variable, 27, 40, 466 

Venn diagram, 189 

Vertex (vertices), 506 
degree, 510, 513, 564 

Vertex-greedy algorithm, 600 

Virginia, 276, 585 


Walk, 506 
length, 508 
trivial, 508 
Wason’s selection task, 84 
Water puzzle, 579 
Weak induction, 120 
Weighted graph, 527, 599 
Well-ordering principle, 140 
Welles, O., 276 
Wild goose chase (See red herring) 
Wolf, 567 
Word, 300 
World Series, 462 





2,45, 182 
Zahlen, 45 
Zero matrix (0,). 620 


Did you know that games and puzzles have given birth to some of todays —-— =m 
deepest mathematical subjects? Now, with Douglas Ensley and Winston yc. 
Crawley's Discrete Mathematics, you can explore mathematical writing, & 
abstract structures, counting, discrete probability, and graph theory, ^7 a 

through games, puzzles, patterns, magic tricks, and real-world problems, i 
You will discover how new mathematical topics can be applied to everyday situations, learn 
how to work with proofs, and develop your problem-solving skills along the way. 


Online applications SS ——— 
ie apr ( Dragthe gren lines to rm a correct proof EAA 
help improve your porrada eieeyidoe aeta Proof: 


R Continuing o the next problem 
mathematical reasonir 

Intriguing, interactive Flash-based 
applications illustrate key mathe- 
matical concepts and help you 





develop your ability to reason ‘Therein an integer m = 3 such that n= mt, 
mathematically, solve problems, 

and work with proofs. goa E E a a 
Explore More 





on the Web 
icons in the text direct you to 
online activities at 

wwwwiley.com/college/ensley. 


Ceket to poon tothe vest sentent. INN | 





Improve your grade with the Student Solutions Manual. 

A supplementary Student Solutions Manual contains more detailed solutions to selected 
exercises in the text. If your bookstore doesn't carry the Student Solutions Manual, you can 
purchase a copy online at www.wiley.com/college/ensley. 





WILEY 


ww w.wiley.com/college/ensley 


il 





















ISBN 0-471-47502-1 


| | | 


9 780471475023: 

















